Supercharge Your Innovation With Domain-Expert AI Agents!

A Table Structure Recognition Method Based on Graph Neural Network

A neural network and table structure technology, applied in the field of image recognition, can solve problems such as loss of information in pdf files, inability to accurately parse, and inability to accurately identify complex tables, and achieve the effect of optimizing classification results

Active Publication Date: 2021-09-03
杭州火石数智科技有限公司
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] For tables in pdf, currently existing table parsing methods generally include parsing tables by reading xml information of pdf (such as xpdf tool), converting pdf to xml, html, word and other formats and then parsing (such as pdf-docx tool ), converting the pdf into an image and then performing structural recognition. The first two methods cannot be accurately analyzed due to the information loss of the pdf file itself. The third method mainly relies on image recognition algorithms. The existing methods are not suitable for complex forms. can accurately identify

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Table Structure Recognition Method Based on Graph Neural Network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] In order to make the above objects, features and advantages of the present invention more comprehensible, specific implementations of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0036] In the following description, a lot of specific details are set forth in order to fully understand the present invention, but the present invention can also be implemented in other ways different from those described here, and those skilled in the art can do it without departing from the meaning of the present invention. By analogy, the present invention is therefore not limited to the specific examples disclosed below.

[0037] Such as figure 1 As shown, the present invention provides a form structure recognition method based on a graph neural network, which converts documents in other formats into images, identifies the position of the form for each input image, and intercepts the form area. The table area identifies text blo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a form structure recognition method based on a graph neural network. The method converts each page of a pdf document into an image, recognizes the position of the form for each input image, and intercepts the form area; for the form Region recognition text blob blocks; find adjacent blob sets for each blob, thereby establishing a blob graph structure; establish a dual graph structure for a blob graph, and transform the graph node connection prediction problem into a graph node classification problem; train graph node classification model; organize the relationship between blobs to obtain the cell structure of the table; the present invention applies the graph neural network to the table structure recognition, models the table structure recognition as graph node classification, and adds feedback adjustment network and condition random For airports, the classification results of graph nodes are corrected based on the rationality of the overall structure of the table, which improves the recognition accuracy.

Description

technical field [0001] The invention relates to image recognition technology, in particular to a table structure recognition method based on a graph neural network. Background technique [0002] In the application scenarios of big data and artificial intelligence, it is necessary to collect, process, and analyze a large amount of information, structure the data, and discover the laws in the data to guide production. Information exists in various and unstructured ways. A large amount of information exists in tables, and tables may exist in pdfs, web pages, and images. [0003] For tables in pdf, currently existing table parsing methods generally include parsing tables by reading xml information of pdf (such as xpdf tool), converting pdf to xml, html, word and other formats and then parsing (such as pdf-docx tool ), converting the pdf into an image and then performing structural recognition. The first two methods cannot be accurately analyzed due to the information loss of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00G06K9/62
CPCG06V30/413G06V30/10G06F18/2414
Inventor 杨红飞金霞韩瑞峰
Owner 杭州火石数智科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More