Table identification method adaptive to multiple types of OCR recognition interfaces and related equipment

A recognition method and table technology, applied in the field of image recognition, can solve problems such as inability to accurately and effectively identify content

Active Publication Date: 2021-06-04
数库(上海)科技有限公司
View PDF10 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention aims at the technical problem that the content cannot be accurately and effectively recognized when the traditional OCR recognition a...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Table identification method adaptive to multiple types of OCR recognition interfaces and related equipment
  • Table identification method adaptive to multiple types of OCR recognition interfaces and related equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the present invention will be further described below in conjunction with specific diagrams.

[0059] A form recognition method adapted to multiple types of OCR recognition interfaces, comprising the following steps:

[0060] S1, receiving a request: receiving an extraction request, which includes documents and recognition patterns.

[0061] In bond disclosure announcements, many financial and annotated data tables usually include two categories, one is in the form of table documents, and more is in the form of pictures. Therefore, the documents received in this step include corresponding general form documents and pictures. The corresponding recognition mode includes one of the general form extraction mode, the picture normal form extraction mode and the picture frameless form extraction mode.

[0062] Before making an extraction request, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of picture recognition, and particularly relates to a table recognition method adaptive to multiple types of OCR recognition interfaces and related equipment. The method comprises the following steps: receiving an extraction request, wherein the extraction request comprises a document and an identification mode; according to the recognition mode, calling a preset external OCR interface, performing recognition processing on the document through the external OCR interface, and receiving recognition data returned by the external OCR interface; and generating table data from the identification data, and returning the table data. According to the invention, through the adaptation of a plurality of identification modes, most of the encountered OCR tables in the disclosed bond announcement can be identified basically, the coverage range is wider, and the identification rate is higher.

Description

technical field [0001] The invention belongs to the technical field of image recognition, and in particular relates to a form recognition method and related equipment adapted to multiple types of OCR recognition interfaces. Background technique [0002] In the bond disclosure announcement, many financial and annotated data tables are disclosed and displayed in the form of pictures, and OCR technology is required to extract information from these data tables. Adapting to multiple types of OCR interfaces to recognize picture tables is mainly a technology to deal with complex and diverse table styles according to different OCR recognition modes, and to process the recognition results accordingly, and finally generate unified table structure data. Usually, if you want to identify the table content in the image, most of the processing process is to first identify all the lines and text blocks, use the line information to calculate the table area and cell area, and then correspond...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/32G06K9/00G06F40/177
CPCG06F40/177G06V30/412G06V30/414G06V20/62G06V30/10
Inventor 曹峰黄夫龙
Owner 数库(上海)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products