Picture table content extraction method based on computer vision and natural language processing

A natural language processing and computer vision technology, applied in natural language data processing, computer components, computing, etc., to solve problems such as inability to "understand" tables

Active Publication Date: 2022-01-28
南京跑码地计算技术有限公司
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a table content extraction method based on computer vision and natural language processing, using border detection, OCR, text classification, etc. Technology, develo

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Picture table content extraction method based on computer vision and natural language processing
  • Picture table content extraction method based on computer vision and natural language processing
  • Picture table content extraction method based on computer vision and natural language processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to further understand the structure, features and purpose of the present invention, it is now described as follows in conjunction with the accompanying drawings. The implementation illustrated in the drawings is only used to illustrate the technical solution of the present invention, not to limit the present invention.

[0035] Such as figure 1 As shown, the present invention discloses a table content extraction method based on computer vision and natural language processing, including five aspects of table border recognition, cell character recognition, table content classification, table layout reasoning, and structured table data. Proceed as follows:

[0036] Step 1: Input the picture containing the table into the table border recognition model, and recognize the table border in the picture. The recognition of the table border includes three parts: table area detection, cell area detection and table border recognition. Such as figure 2 As shown, the spec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a picture table content extraction method based on computer vision and natural language processing. The method comprises the following steps: 1, inputting a picture into a table frame identification model, identifying a table frame, and calculating coordinates of each cell in a table; 2, extracting the text content of each cell; 3, according to the extracted text content, labeling according to three types of keys, values and mixed values, constructing a table content classification data set, and training a cell content classification model based on the data set; 4, inferring a table layout according to the table coordinates, the cell coordinates and the category of each cell text; and 5, organizing the data in the table in a JSON format according to the layout information of the table, the content of the table cells and the category information. A natural language processing technology is introduced, the category of the content of each cell in the table is marked, the table layout is reasoned in combination with the position information of the cells, and finally the table content is output in a structured mode.

Description

technical field [0001] The invention relates to the technical field of table data extraction, in particular to a method for extracting table content from pictures based on computer vision and natural language processing. Background technique [0002] The application of information extraction based on computer vision and natural language processing technology is becoming more and more widespread, such as recognizing text from pictures, extracting entities such as names, place names, and phone numbers from text, and extracting key information from invoices, insurance policies, and other forms. Wait. At the same time, major cloud vendors also provide identification services based on cloud platforms for form data such as bills and contracts. [0003] Existing techniques for extracting tabular data mainly focus on two aspects. First, through traditional image processing methods, such as erosion, expansion, edge detection, contour recognition, etc., first identify the table in t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06V30/412G06V30/414G06V30/19G06K9/62G06F16/35G06F16/31G06F40/289
CPCG06F16/353G06F16/313G06F40/289G06F18/217G06F18/214
Inventor 王国栋
Owner 南京跑码地计算技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products