An intelligent visual question answering method based on deep neural network
A deep neural network, intelligent vision technology, applied in the field of intelligent visual question answering based on deep neural network, can solve the problems of unknown answer reasons and lack of training data.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0069] Embodiments of the present invention include the following steps:
[0070] 1. Intelligent Q&A data preprocessing
[0071] 1.1 Adjust all image scales to 448*448 resolution.
[0072] 1.2 Remove stop words from the text content in all training data, and lowercase all English words. Then segment the text content, and select the 8000 words with the highest frequency as the answer dictionary, and select the 20000 words with the highest frequency as the image description dictionary.
[0073] 2. Image depth convolution feature extraction
[0074] Use the residual deep convolutional network to process the image convolution features, and obtain the feature map of each image, expressed as F I ∈ R 14×14×2048 . Here 14×14 is the feature area of the image, and 2048 is the feature dimension of each feature block.
[0075] 3. Depth Feature Extraction for Text Questions
[0076] Use the bidirectional recurrent neural network to extract the problem features, and the processing ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


