Differential description statement generation method and device, equipment and medium

A technology for describing sentences and differences, which is applied in the field of generating difference description sentences, can solve problems such as the computer's inability to reason normally, and achieve the effect of improving accuracy

Active Publication Date: 2022-05-17
SUZHOU LANGCHAO INTELLIGENT TECH CO LTD
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the purpose of the present invention is to provide a method, device, device and medium for generating difference description sentences, which can solve the problem that the computer cannot reason normally due to human language errors, and further enhance the human-computer interaction experience. The specific solution is as follows :

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Differential description statement generation method and device, equipment and medium
  • Differential description statement generation method and device, equipment and medium
  • Differential description statement generation method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0071] In the field of multimodality, due to errors in human language, the computer model does not know what the problem that humans want to describe is, which makes the computer unable to reason normally.

[0072] For this reason, the embodiment of the present application proposes a difference description sentence generation scheme, which can solve the problem that the computer cannot reason normally due to human language errors, and further enhance the human-...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a difference description statement generation method and device, equipment and a medium, and relates to the technical field of artificial intelligence, and the method comprises the steps: carrying out the feature splicing of image coding features and text coding features, inputting the spliced coding features into a preset image-text alignment unit constructed based on a preset self-attention mechanism to obtain spliced alignment features, using a preset noise monitoring unit constructed based on a preset self-attention mechanism and a preset cross-attention mechanism to process image alignment features and text alignment features obtained after splitting the text coding features and the spliced alignment features so as to extract difference signals; the difference description statement is generated by utilizing the preset difference description generation algorithm and based on the difference signal, and therefore, the part, which cannot be aligned with the image, in the human language text is positioned based on the preset cross-attention mechanism, and the corresponding interpretation description is given, so that the problem that a computer cannot perform normal reasoning due to human language errors is solved.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a method, device, equipment and medium for generating difference description sentences. Background technique [0002] In recent years, multimodality has become an emerging research direction in the field of artificial intelligence, such as visual commonsense reasoning (Visual Commonsense Reasoning, VCR), visual question answering (Visual Question Answering, VQA) and other fields have become key research directions in the industry. In the field of multimodality, existing topics are based on the assumption that human language is infallible in multimodal processes, i.e., human language will necessarily match images. However, for humans, slips of the tongue are unavoidable, although usually, human language errors are not very outrageous, that is to say, the text and the image itself are relatively close, but the wrong use of a certain subject or attributive wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V30/40G06V30/418G06V30/18
CPCG06V30/418G06V30/40G06V30/18
Inventor 李晓川李仁刚郭振华赵雅倩范宝余
Owner SUZHOU LANGCHAO INTELLIGENT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products