Unlock instant, AI-driven research and patent intelligence for your innovation.

Cross-language description-oriented adversarial data enhancement method, system and storage medium

An adversarial, cross-language technology, applied in the field of adversarial data enhancement for cross-language description, can solve problems such as affecting sentence accuracy, inaccurate and comprehensive information, and text problems.

Pending Publication Date: 2021-05-18
WUHAN INSTITUTE OF TECHNOLOGY
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Both geometric transformations and randomly cropped images can affect the accuracy of generated sentences
When it comes to orientation information, such as a person standing on the left side of the table, flipping or cropping the picture may cause the information obtained by the model to be inaccurate and comprehensive, resulting in problems with the generated text
In terms of text, it is also challenging to propose general language conversion rules, and general data enhancement techniques in Natural Language Processing (NLP) have not been fully explored

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-language description-oriented adversarial data enhancement method, system and storage medium
  • Cross-language description-oriented adversarial data enhancement method, system and storage medium
  • Cross-language description-oriented adversarial data enhancement method, system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention will be further described below in conjunction with accompanying drawing:

[0050] The invention discloses an adversarial data enhancement method for cross-language image description, the steps of which are as follows: firstly, for the image, a gradient attack adversarial algorithm is used to generate an adversarial sample by adding as small a disturbance as possible; secondly, for the text, by The sequence-to-sequence idea is used to generate adversarial samples with the same semantics as the original sentence; finally, the adversarial samples are used as additional samples and put into the network training together with the clean samples, and the adversarial samples of the text are generated before training. Images are continuously generated during training. Four pairs of additional data can be generated once each image-text pair is generated, which effectively increases the scale of the data set, and the invention can improve the model performan...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-language description-oriented adversarial data enhancement method, a system and a storage medium. The method comprises the following steps of obtaining a clean image-text pair data set; using the sequence-to-sequence model to generate a text antagonism sample; training an image description generation model: if the current training stage is an adversarial training stage, generating an image adversarial sample, then expanding an image-text pair, then training the model by using an expanded image-text pair data set, and optimizing the model according to a joint loss function; if the current training stage is a non-adversarial training stage, training the model by using a clean image-text pair data set, and optimizing the model according to a loss function; and obtaining a trained image description generation model and an expanded image-text pair data set. Through an easy-to-operate data enhancement mode, the data set is expanded, and the robustness and performance of the image description generation model are improved.

Description

technical field [0001] The invention belongs to the technical field of data enhancement, and in particular relates to an adversarial data enhancement method, system and storage medium for cross-language description. Background technique [0002] Algorithms can directly benefit from the size of the data set, and the fit and robustness of models trained on large-scale data sets are often better than models obtained from small-scale data sets. For image description tasks in small languages, if you want to achieve the same performance as the English image description task, the first challenge you need to face is the acquisition of large-scale data sets. [0003] In order to ensure the quality, manually labeling the dataset is the best method, but this method is extremely time-consuming. In order to balance the model performance and the cost of manually labeling datasets, data augmentation methods are usually used to expand datasets. Data augmentation has been widely used in the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/08G06N3/04G06F16/33G06F40/30G06F40/289G06F40/216
CPCG06N3/084G06F16/3344G06F40/30G06F40/289G06F40/216G06N3/044G06N3/045G06F18/214
Inventor 肖宇鲁统伟
Owner WUHAN INSTITUTE OF TECHNOLOGY