Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Picture-based text generation method and device

A picture and text technology, applied in the field of picture-based text generation and computer-readable storage media, can solve problems such as difficulty in guaranteeing accuracy, misunderstanding of pictures, affecting the accuracy of article expression and writing efficiency, etc. The effect of usability

Active Publication Date: 2020-06-19
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] When users create articles such as writing, such as news writing, they need to convert the picture information taken on the spot into text. This process requires users to have high observation ability and prior knowledge. If the picture information in the picture cannot be accurately identified , there will be errors in the user's understanding of the picture, and it will affect the accuracy of the article expression and writing efficiency
In the existing technology, the main ways to convert picture information into text include manual description and open-domain picture recognition and text conversion. Among them, manual description means that users manually edit the content of pictures by observing pictures, and open-domain pictures Recognition and text conversion means to list the content in the picture for the user to use by detecting the target of the picture. However, the problem with the existing technology is that the manual description requires the user to have a lot of professional knowledge, and the accuracy is difficult to guarantee. The image recognition and text conversion in the open domain is only a simple list of targets, and cannot return accurate image information for a certain vertical field. At the same time, because the returned text content is a simple list of target names in the image, it cannot be directly used by users. less efficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Picture-based text generation method and device
  • Picture-based text generation method and device
  • Picture-based text generation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0045] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a picture-based text generation method and device, and the method comprises the steps: determining at least one target vertical field, and obtaining the picture content information corresponding to a to-be-edited picture and the picture classification information corresponding to the to-be-edited picture according to the target vertical field to which the to-be-edited picture belongs; generating picture accurate information corresponding to the to-be-edited picture according to the picture content information corresponding to the to-be-edited picture and the picture classification information corresponding to the to-be-edited picture; and generating a picture editing text corresponding to the to-be-edited picture through the text generation model according to the picture accurate information corresponding to the to-be-edited picture. According to the method, the picture accurate information corresponding to the to-be-edited picture is acquired for the target vertical field, and the picture editing text with professional reference value is edited in combination with the target vertical field, so that a user is assisted in efficiently and accurately completingprofessional document editing and writing work in the field.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to an image-based text generation method, device, computing device, and computer-readable storage medium. Background technique [0002] When users create articles such as writing, such as news writing, they need to convert the picture information taken on the spot into text. This process requires users to have high observation ability and prior knowledge. If the picture information in the picture cannot be accurately identified , there will be errors in the user's understanding of the picture, and it will affect the accuracy of the article expression and writing efficiency. In the existing technology, the main ways to convert picture information into text include manual description and open-domain picture recognition and text conversion. Among them, manual description means that users manually edit the content of pictures by observing pictures, and open...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06T11/60G06F16/55G06N3/04
CPCG06T11/60G06F16/55G06N3/045G06F18/214G06F18/24
Inventor 李长亮樊骏锋汪美玲
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products