Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Image natural language description generation method and device with cross-linguistic learning ability

A learning ability, natural language technology, applied in the field of image recognition, can solve the problems of poor readability, high running cost, low efficiency of image language description, etc.

Active Publication Date: 2017-12-15
RENMIN UNIVERSITY OF CHINA
View PDF2 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, at present, the description sentences of large image sets are generally described in English. When it comes to cross-language image description, it is necessary to use manual annotation to collect training data in the corresponding language for the target language, or translate the English description sentences through a translation machine. Describing sentences for the target language, these methods have high operating costs, low efficiency of image language description, and poor readability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image natural language description generation method and device with cross-linguistic learning ability
  • Image natural language description generation method and device with cross-linguistic learning ability
  • Image natural language description generation method and device with cross-linguistic learning ability

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0091] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0092] figure 1 A flow chart of a method for generating natural language descriptions of images with cross-language learning capabilities provided by an embodiment of the present invention, as shown in figure 1 As shown, the method in this embodiment may include:

[0093] S101. Obtain an English description sentence corresponding to the image se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an image natural language description generation method and device with cross-linguistic learning ability. The method comprises the steps that English description sentences are translated into target language description sentences through a machine; part of the target language description sentences are selected randomly to form a training sample set; a smooth sample set and a non-smooth sample set are used for training a sentence smooth degree model; the target language description sentences in a candidate data set are subjected to smooth degree evaluation through the sentence smooth degree model, and according to the smooth degree probability of each target language description sentence, a strategy for training an image description sentence generation model is set; according to the strategy, the image description sentence generation model is trained, and the trained image description sentence generation model is obtained. According to the image natural language description generation method and device with the cross-linguistic learning ability, the aim is achieved that the image description sentence generation model of a target language is guided, trained and generated dependent on smooth degree evaluation results, the influence of non-smooth target language description sentences on the training process is lowered, and the accuracy of the image description sentence generation model of the target language is improved.

Description

technical field [0001] The invention relates to the technical field of image recognition, in particular to a method and device for generating natural language descriptions of images with cross-language learning capabilities. Background technique [0002] Image captioning refers to that given a picture, the computer automatically generates a natural sentence that can describe the main visual content of the corresponding picture. The natural sentence is used to describe the specific behavior of the main object in the corresponding picture in a specific scene. Image description sentences can contain more semantic information than traditional image description tags, which is helpful for better management of image data. [0003] However, at present, the description sentences of large image sets are generally described in English. When it comes to cross-language image description, it is necessary to use manual annotation to collect training data in the corresponding language for ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06F17/27G06K9/62
CPCG06F40/253G06F40/58G06F18/24
Inventor 李锡荣蓝玮毓董建锋
Owner RENMIN UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products