Image file transfer method, device and equipment based on OCR and readable storage medium

An image and file conversion technology, which is applied to devices, equipment and readable storage media, in the field of OCR-based image file conversion methods, can solve the problem of low document recognition efficiency, large resource consumption, text table content, and image content that cannot be combined with text. Recognition and other problems to achieve the effect of improving conversion accuracy and avoiding format confusion

Active Publication Date: 2019-06-25
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides an OCR-based image conversion method, device, and readable storage medium, which can solve the problem that text layout, table content, image content, etc. It takes a lot of resources and the document recognition efficiency is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image file transfer method, device and equipment based on OCR and readable storage medium
  • Image file transfer method, device and equipment based on OCR and readable storage medium
  • Image file transfer method, device and equipment based on OCR and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0091] First, a brief introduction to the nouns involved in the embodiments of this application:

[0092] Optical Character Recognition (OCR): Optical Character Recognition is the process of converting the text in the document to be recognized into a text format through character recognition. Usually, the OCR process needs to go through steps such as file input to be recognized, text feature extraction, comparison and recognition, etc. before it can be completed.

[0093] Image to be converted: refers to the image whose image content is to be converted into a target document. Optionally, the image to be converted can be realized as at least one of a photo, a picture, and a portable document format (Portable Document Format, PDF)....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image file transfer method, device and equipment based on OCR and a readable storage medium, and relates to the field of artificial intelligence. The method comprises the steps of obtaining a to-be-shifted image; performing layout segmentation on the to-be-shifted image according to the image content of the to-be-shifted image to obtain n image layouts, each image layoutcorresponding to a content type, and n being a positive integer; according to the content type corresponding to the image layout, performing corresponding processing on the image content in the imagelayout to obtain a file transfer content corresponding to the image layout; and adding the file transfer contents corresponding to the n image layouts to the electronic document to obtain a target document; carrying out layout segmentation on the to-be-shifted image through the image content. According to the method and the device, the n image layouts corresponding to the content types are obtained, and the image contents in the image layouts are processed according to the types of the image layouts, so that different types of contents in the to-be-transferred image are identified and processed in different ways, and the conversion accuracy of converting the image into the document is improved.

Description

technical field [0001] The embodiments of the present application relate to the field of artificial intelligence, and in particular to an OCR-based image conversion method, device, device, and readable storage medium. Background technique [0002] Optical Character Recognition (OCR) is a function of recognizing characters in an image. Usually, a user inputs an image with characters into an optical character recognition module and gets an output result. The output results include characters in the recognized image. The OCR technology can be applied in many fields, such as: license plate recognition, document conversion, etc., wherein the document conversion refers to converting an image including characters into an editable document form. [0003] In related technologies, in the process of document conversion, after an image with characters is input into the document conversion module, the document conversion module recognizes the characters in the image through OCR, and pa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06K9/00G06T5/00G06T5/40G06T7/11G06V10/44G06V30/10G06V30/146
CPCG06F40/106G06F40/279G06F40/177G06V30/414G06V10/44G06V30/10G06V30/146G06V30/1463G06V30/15G06F18/214
Inventor 陈星耀黄灿芦胡文灿陈贻东林汉权黄飞柯戈扬杨志权
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products