Check patentability & draft patents in minutes with Patsnap Eureka AI!

A method and apparatus for parsing a document table in a portable document format

A portable document and table technology, applied in the field of data recognition, to achieve the effect of improving utilization efficiency and enhancing utilization value

Pending Publication Date: 2019-03-08
ZHONGKE DINGFU BEIJING TECH DEV
View PDF4 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for the data contained in the table of the PDF document, since the stored PDF document does not record and store the location information of the table, how to analyze the data contained in the table to improve the utilization efficiency of the data and enhance the utilization value of the data is a problem. A current technical difficulty

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and apparatus for parsing a document table in a portable document format
  • A method and apparatus for parsing a document table in a portable document format
  • A method and apparatus for parsing a document table in a portable document format

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of this application, not all of them. The components of the embodiments of the application generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the application provided in the accompanying drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments of the present application, all other embodiments obtained by those skilled in the art without...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method and apparatus for parsing a document table in a portable document format is provided. That method includes determining a PDF page containing a table in a PDF document; converting the PDF pageinto a picture; identifying cells included in a table in the picture; determining coordinate information of the cell in the PDF page; identifying data in the cell according to coordinate informationof the cell. The method and the device can effectively improve the efficiency of data utilization.

Description

technical field [0001] The present application relates to the technical field of data identification, in particular, to a method and device for parsing a portable document format (PDF, Portable Document Format) document form. Background technique [0002] PDF is an electronic document format independent of hardware and applications. It has the advantages of cross-platform and security. It has become one of the most widely used electronic document formats. Today, a large number of enterprises and institutions use PDF format to store documents. [0003] With the widespread use of PDF format documents, a large amount of valuable data is stored and presented in the form of PDF document tables. Among them, for the data in the PDF document, such as text and characters, some algorithms can be used to parse the data in the stored PDF document, so that the parsed data can be reused to improve the utilization efficiency of the data. However, for the data contained in the table of the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/21G06F17/22G06F17/24
CPCG06F40/183G06F40/106G06F40/151
Inventor 房平会尚继耀杨宇
Owner ZHONGKE DINGFU BEIJING TECH DEV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More