A method and device for generating text based on pictures

A picture and text technology, applied in the field of picture-based text generation and computer-readable storage media, can solve the problems of difficulty in guaranteeing accuracy, affecting the accuracy of article expression and writing efficiency, and the efficiency of direct use by users, so as to ensure accuracy. and usability effects

Active Publication Date: 2020-09-01
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] When users create articles such as writing, such as news writing, they need to convert the picture information taken on the spot into text. This process requires users to have high observation ability and prior knowledge. If the picture information in the picture cannot be accurately identified , there will be errors in the user's understanding of the picture, and it will affect the accuracy of the article expression and writing efficiency
In the existing technology, the main ways to convert picture information into text include manual description and open-domain picture recognition and text conversion. Among them, manual description means that users manually edit the content of pictures by observing pictures, and open-domain pictures Recognition and text conversion means to list the content in the picture for the user to use by detecting the target of the picture. However, the problem with the existing technology is that the manual description requires the user to have a lot of professional knowledge, and the accuracy is difficult to guarantee. The image recognition and text conversion in the open domain is only a simple list of targets, and cannot return accurate image information for a certain vertical field. At the same time, because the returned text content is a simple list of target names in the image, it cannot be directly used by users. less efficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for generating text based on pictures
  • A method and device for generating text based on pictures
  • A method and device for generating text based on pictures

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0045] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application provides a picture-based text generation method and device, wherein the method includes: determining at least one target vertical field, and obtaining the picture content information corresponding to the picture to be edited according to the target vertical field to which the picture to be edited belongs. The picture classification information corresponding to the picture to be edited; according to the picture content information corresponding to the picture to be edited and the picture classification information corresponding to the picture to be edited, generate the accurate picture information corresponding to the picture to be edited; according to the picture to be edited For the accurate information of the picture corresponding to the edited picture, the picture editing text corresponding to the picture to be edited is generated through the text generation model described in this application. This application obtains the accurate information of the picture corresponding to the picture to be edited according to the target vertical field, and edits the picture editing text with professional reference value in combination with the target vertical field, so as to assist the user to efficiently and accurately complete the professional document editing and writing work in the field.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to an image-based text generation method, device, computing device, and computer-readable storage medium. Background technique [0002] When users create articles such as writing, such as news writing, they need to convert the picture information taken on the spot into text. This process requires users to have high observation ability and prior knowledge. If the picture information in the picture cannot be accurately identified , there will be errors in the user's understanding of the picture, and it will affect the accuracy of the article expression and writing efficiency. In the existing technology, the main ways to convert picture information into text include manual description and open-domain picture recognition and text conversion. Among them, manual description means that users manually edit the content of pictures by observing pictures, and open...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06T11/60G06F16/55G06N3/04
CPCG06T11/60G06F16/55G06N3/045G06F18/214G06F18/24
Inventor 李长亮樊骏锋汪美玲
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products