Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Table recognition method and device, computer device and storage medium

A table and format technology, applied in the field of image recognition, can solve problems such as difficulty in extracting table data, and achieve the effect of improving processing efficiency

Pending Publication Date: 2019-10-15
PING AN TECH (SHENZHEN) CO LTD
View PDF7 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This makes it difficult to extract tabular data from PDF documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Table recognition method and device, computer device and storage medium
  • Table recognition method and device, computer device and storage medium
  • Table recognition method and device, computer device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0037] The form identification method provided by this application can be applied to such as figure 1 shown in the application environment. Wherein, the terminal 110 communicates with the server 120 through a network. The user can send the target document in PDF format to the server 120 through the terminal 110, and the server 120 acquires the target document and executes the form recognition method. Wherein, the terminal 110 can be, but not limited to, various personal computers, notebook computers, smart phones, tablet computers and portable wearable devices, and the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a table recognition method and device, a computer device and a storage medium. The method comprises the steps of obtaining a target document of which the document format is aPDF format; determining a table area where table content in the target document is located through a pre-trained table positioning model; cutting table content in the table area from the target document, and generating a corresponding table picture according to the table content; performing image recognition on the table picture, and determining characters in the table picture and position information of the characters; and generating a corresponding table file in a preset format according to the character and the position information. By adopting the method based on the image detection technology, the table data can be accurately extracted from the PDF document.

Description

technical field [0001] The present application relates to the technical field of image recognition, in particular to a form recognition method, device, computer equipment and storage medium. Background technique [0002] With the development of computer technology, more and more document formats have been developed and widely used, such as documents in PDF (Portable Document Format, Portable Document Format) format. Among them, PDF is a widely used electronic document format. Now more and more professional materials, e-books, product descriptions and e-mails are beginning to use PDF format documents. [0003] A document in PDF format is a document that cannot be directly edited, and many professional data are displayed in the form of PDF files. When it comes to tabular data, it is common to convert the table into an image in advance, and then embed the table image into the PDF document. PDF documents also have no special definition of tabular data, but only the combinatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/20G06N3/04G06N3/08
CPCG06N3/08G06V30/412G06V10/22G06N3/045
Inventor 高梁梁孙双双
Owner PING AN TECH (SHENZHEN) CO LTD
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More