Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Visual question and answer method based on network structure search

A network structure and visual technology, applied in the field of visual question answering, can solve the problem of high trial and error costs of manually designed networks

Active Publication Date: 2021-08-20
NANJING UNIV
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current network is artificially designed by experts. These deep learning network designs have obvious "prior" traces, such as the RCNN series network in the field of image classification, from RCNN network to Fast-RCNN network, to Faster-RCNN network, and then to Mask- RCNN network, each upgrade integrates the "prior" design advantages of the previous network, and then improves it, but as the network structure becomes more and more complex, the cost of manual design network trial and error is getting higher and higher

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Visual question and answer method based on network structure search
  • Visual question and answer method based on network structure search
  • Visual question and answer method based on network structure search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is an embodiment of a part of the application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0024] It should be noted that the terms "first" and "second" in the specification and claims of the present application and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific order or sequence. It should be understood that the data so used may be interchanged under appropriate circumstances for ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a visual question and answer method based on network structure search. The method comprises the following steps: performing feature extraction on an original picture by adopting a first artificial neural network model; performing feature extraction on the text information by adopting a second artificial neural network model; a to-be-searched network structure framework being a coding-decoder framework, defining three search operators for framework network search, and the input of the search operators being image features or text features extracted based on the original picture or / and the text information; searching the architecture weight of the network structure and the operation weight of an operator by using a gradient-based alternative optimization strategy; and enabling the search network to output candidate word vectors according to a multi-classification method, and selecting the word vector with the maximum probability as an answer to be output. The visual question and answer method based on network structure search has the beneficial effect that a better effect can be searched in a larger space.

Description

technical field [0001] This application relates to the field of visual question answering, in particular, to a visual question answering system method based on network structure search. Background technique [0002] With the development of deep learning, visual question answering has been widely used. However, the traditional visual question answering system still has certain defects. The multimodal feature fusion part of the traditional visual question answering system is a very skillful network structure designed by experts. Through the network structure Only when the search technology automatically designs the network structure can it find the optimal network structure in a sufficiently large space. [0003] Specifically, in recent years, with the rapid development and important success of artificial intelligence, Visual Question Answering (VQA), as an intersection of computer vision and natural language processing, has attracted widespread attention. VQA tasks widely ex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/33G06F16/338G06N3/04
CPCG06F16/332G06F16/3344G06F16/338G06N3/045
Inventor 俞扬詹德川周志华乔康管聪秦熔均袁雷张云天胡毅奇
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products