Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and device for image description

A technology of image features and target images, applied in the field of image processing, can solve problems such as sentence deviation, and achieve the effect of accurate translation of sentences

Active Publication Date: 2019-12-03
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this case, the extraction of global information by the feature extraction model depends on the performance of the feature extraction model itself. Some feature extraction models will pay attention to one type of information in the image, and some feature extraction models will pay attention to another type of information in the image. One type of information, which will cause the translation model to often fail to use the complete global information corresponding to the image as a reference in the subsequent process, resulting in deviations in the output sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for image description
  • A method and device for image description
  • A method and device for image description

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0066] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a method and device for image description. The method includes: using multiple first feature extraction models to perform feature extraction on a target image to obtain image features generated by each first feature extraction model; The image features generated by the feature extraction model are fused to generate the global image features corresponding to the target image; the second feature extraction model is used to extract the features of the target image to obtain the target detection features corresponding to the target image; the global image features corresponding to the target image and The target detection features are input to the translation model, and the generated translation sentences are used as the description sentences of the target image, so that in the subsequent process of inputting the global image features and target detection features corresponding to the target image into the translation model to generate translation sentences, there are more The global image features of the image information are used as a reference to make the output translation sentences more accurate.

Description

technical field [0001] The present application relates to the technical field of image processing, and in particular to an image description method and device, a computing device, and a computer-readable storage medium. Background technique [0002] Image description is to automatically generate a descriptive text based on the image, similar to "talking through pictures". For humans, image description is a simple and natural thing, but for machines, this task is full of challenges. The reason is that the machine must not only detect the objects in the image, but also understand the relationship between the objects, and finally express them in a reasonable language. [0003] In the prior art, the process of image description requires a machine to extract local information and global information from the target image, input the global information and local information into the translation model, and use the sentence output by the translation model as the description informati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/46G06K9/62
CPCG06V10/457G06V10/44G06V2201/07G06F18/2414G06F18/253G06V10/82G06V20/70G06V10/806G06F40/40G06V10/424
Inventor 宋振旗李长亮廖敏鹏
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products