Table parsing method and device in document image

A document image and analysis method technology, applied in the field of data processing, can solve problems such as poor adaptability and inability to parse various forms, and achieve the effect of improving efficiency and accuracy

Active Publication Date: 2018-08-17
BEIJING ABC TECH CO LTD
View PDF7 Cites 75 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to improve the existing poor adaptability and inability to effectively analyze various forms, and provide a method and device for analyzing forms in document images

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Table parsing method and device in document image
  • Table parsing method and device in document image
  • Table parsing method and device in document image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0035] see figure 1 , a method for parsing a table ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a table parsing method and device in a document image. The method comprises: detecting a table area in a to-be-parsed document image by using a pre-trained table detection model; detecting an internal character block included by the table area by using a pre-trained character detection model; determining a spatial structure of the table; and according to the spatial structure of the table, carrying out character identification on the character block in each table cell to obtain an editable structural data by parsing. The table parsing method and device can be applied to various tables like a lined table, a line-free table, or a black-and-white table and a simple and effective solution is provided for table parsing in the document image.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a table analysis method and device in document images. Background technique [0002] In recent years, with the continuous improvement of the digitization of information, the amount of data in the form of document images has shown a trend of becoming massive. Extract information from document images to form structured data, which can be used to better build indexes, facilitate search, and can be used in scientific research, engineering, statistics, strategy formulation, market research, etc. to provide quantitative data. [0003] As the most streamlined expression of data record summary, table is the basis of data analysis. Automatically identify these document image table data, and restore the table content in the picture to structured data, which can undoubtedly improve the efficiency of data collection. [0004] Generally, there are two types of tables in a document, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06V30/10
CPCG06V30/412G06V30/10G06V30/153G06V30/413G06V30/414
Inventor 余宙杨永智汪贤
Owner BEIJING ABC TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products