PDF drawing character recognition method, system and device

A text recognition and image recognition technology, applied in the field of image processing, can solve problems such as no good solution, no special area extraction and recognition

Pending Publication Date: 2020-07-10
深圳新致软件有限公司
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

With the popularization of smart phones, the traditional method does not have a good solution for the low-quality PDF images taken by personal mobile ph...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • PDF drawing character recognition method, system and device
  • PDF drawing character recognition method, system and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The following description serves to disclose the present invention to enable those skilled in the art to carry out the present invention. The preferred embodiments described below are only examples, and those skilled in the art can devise other obvious variations. The basic principles of the present invention defined in the following description can be applied to other embodiments, variations, improvements, equivalents and other technical solutions without departing from the spirit and scope of the present invention.

[0037] It can be understood that the term "a" should be understood as "at least one" or "one or more", that is, in one embodiment, the number of an element can be one, while in another embodiment, the number of the element The quantity can be multiple, and the term "a" cannot be understood as a limitation on the quantity.

[0038] The present invention is an invention related to a computer program. Such as figure 1 Shown is the flow chart of a kind of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a PDF drawing character recognition method, system and device. The method comprises the following steps: an optical character recognition step executed based on deep learning; acustomized identification and universal identification step; and a mobile device low-quality image identification step, wherein the step of executing optical character recognition based on deep learning comprises the following steps: detecting an area with characters in a scene, recognizing the characters in the area, and executing text detection based on CTPN, Seglink, TextBox, FTSN, Pixellink and a CRAFT algorithm, and the character recognition is carried out based on CNN and CRNN algorithms; the customized identification step comprises the following steps: identifying a PDF drawing type according to table characters in the PDF or framework contents in the PDF, extracting contents in the region according to the structured features, extracting the key area, and identifying the charactersin the area or extracting the key characters through the deep neural network.

Description

technical field [0001] The invention relates to the field of image processing, in particular to a method, system and equipment for character recognition of PDF drawings. Background technique [0002] Artificial intelligence has achieved rapid development in terms of data, algorithms, and computing power, ushering in a new wave of development in the context of the digital transformation of the global economy. The influence of this wave of artificial intelligence is far greater than before, and the most notable feature is that the influence has spread from the professional field to the popular field. [0003] PDF high-precision recognition is a mature technology in today's market, and methods based on traditional OCR and deep learning are also used in various industries. The recognition of bank notes, PDF form recognition and industrial drawing recognition are all widely used and mature technologies. The recognition of formatted and templated PDFs has achieved remarkable res...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/32G06N3/04
CPCG06V30/422G06V30/414G06V20/62G06V30/10G06N3/045
Inventor 张东锋曾雏鹏李俊波
Owner 深圳新致软件有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products