Incremental data enhancement method for visual question and answer model training and application
A model training and incremental technology, applied in the field of model training, can solve problems such as increasing the difficulty of reasoning, difficulty in achieving recognition effect, increasing the amount of data in semantic expression, and answering conflicts, etc., to achieve data diversity, Improve the classification accuracy and improve the effect of the effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0037] This embodiment provides an incremental data enhancement method for visual question answering model training, the method includes a data statistics step, a threshold value determination step and a data expansion step, specifically: obtaining the original training data set, the training samples in the data set The form of is , the text is formed by natural language sequence; obtain the sentence length distribution of the natural language sequence in the original training data set and the word frequency distribution of each word, and determine based on the sentence length distribution Minimum sentence length threshold and maximum sentence length threshold; according to the minimum sentence length threshold, maximum sentence length threshold and word frequency distribution, the natural language sequence in the training sample is expanded to realize data enhancement.
[0038] Data statistics include the length statistical distribution and word frequency distribution of text ...
Embodiment 2
[0044] This embodiment provides a training method for a visual question answering model, which adopts end-to-end training. The process of the training method is as follows figure 1 shown, including:
[0045] (1) Model initialization.
[0046] (2) Expand the original training data set with the incremental data enhancement method described in Embodiment 1, obtain the expanded training data set, and realize text data enhancement.
[0047] (3) Feature extraction is performed on the training samples in the expanded training data set to obtain text features and image features.
[0048] During the training process of the model, the maximum length of the text language sequence is cut so that the maximum length is less than the maximum length limit of the sequential neural network model, and then the sequence is sent to the query table module, and then the output result is sent to the sequential neural network to extract the text In the test stage, the original text language sequence...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com