Table picture analysis method and system

An analysis method and table technology, applied in special data processing applications, instruments, network data indexing, etc., can solve the problems of manual sorting and time-consuming

Active Publication Date: 2020-06-05
望海康信(北京)科技股份公司
View PDF11 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, there is no good technical solution to automatically parse table images into s...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Table picture analysis method and system
  • Table picture analysis method and system
  • Table picture analysis method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Embodiments and examples of the present invention will be described in detail below with reference to the drawings.

[0025] The scope of applicability of the present invention will become apparent from the detailed description given below. It should be understood, however, that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are given for purposes of illustration only.

[0026] figure 2 A flow chart of a preferred embodiment of the method for parsing tables and pictures according to the present invention is shown.

[0027] In step S202, the table picture to be parsed, for example figure 1 The table picture shown in uses OCR (Optical Character Recognition) technology to identify the text content and text position in the table picture to obtain a triplet set. Each triplet is composed of the text and the horizontal and vertical coordinates of the text location. The OCR technology can use any suitable OCR tech...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a table picture analysis method and system, and the method comprises the steps: recognizing the text content and text position in a table picture, obtaining a triple set, whereeach triple consists of a text, a position horizontal coordinate and a position vertical coordinate; inputting the table picture into a segmentation model containing the trained convolutional neuralnetwork to obtain segmentation line information of the table; and combining the segmentation line information with the triple set according to a preset rule to form a structured table document. According to the method, the table picture can be automatically converted into the structured data, so that the labor cost and time are greatly saved.

Description

technical field [0001] The present application relates to the field of electrical digital data processing, in particular to a form and picture analysis method and system. Background technique [0002] A lot of data on the Internet is presented in the form of pictures. In the process of crawling data from the Internet, you will encounter a large number of tables in the form of pictures, which look very regular. However, after the data is captured, it is necessary to convert it into tabular data or enter it into the database. The image needs to be parsed. At present, there is no good technical solution to automatically parse table images into standard text data (structured) format, which can only be sorted manually, which is time-consuming. Therefore, a method is urgently needed to solve this problem. Contents of the invention [0003] In order to overcome the deficiencies in the prior art, the present invention provides a table image analysis method and system, which can...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/951G06K9/00G06K9/34G06N3/04
CPCG06F16/951G06V30/412G06V10/267G06N3/045Y02D10/00
Inventor 齐昱曹海峰
Owner 望海康信(北京)科技股份公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products