Supercharge Your Innovation With Domain-Expert AI Agents!

PDF (Portable Document Format) file information analysis method and device

An analysis method and technology of file information, applied in the field of data analysis, can solve problems such as low analysis efficiency and information loss, and achieve more information, improve the logic and readability of files

Pending Publication Date: 2021-06-25
善诊(上海)信息技术有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of the above problems in the prior art, the purpose of this paper is to provide a PDF file information analysis method and device to solve the problems of low analysis efficiency and loss of a large amount of information when analyzing and processing PDF file information in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • PDF (Portable Document Format) file information analysis method and device
  • PDF (Portable Document Format) file information analysis method and device
  • PDF (Portable Document Format) file information analysis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] The following will clearly and completely describe the technical solutions in the embodiments herein in conjunction with the accompanying drawings in the embodiments herein. Obviously, the described embodiments are only some of the embodiments herein, not all of them. Based on the embodiments herein, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts fall within the scope of protection herein.

[0060]It should be noted that the terms "first" and "second" in the description and claims herein and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments herein described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a PDF file information analysis method and device.The method comprises the steps that a PDF file to be processed is analyzed, a plurality of elements and position information and feature information of the elements are obtained, and the elements comprise character elements and non-character elements; And according to the position information of the element, the feature information is inserted into the corresponding element of the PDF file. According to the PDF file information analysis method and device provided by the invention, each element in the PDF file can be identified, and the identified feature information of each element can be inserted beside the corresponding element according to the position information of the element, so that the information of the file is more comprehensive, and the logicality and readability of the file are improved.

Description

technical field [0001] The invention relates to the technical field of data analysis, in particular to a PDF file information analysis method and device. Background technique [0002] PDF (Portable Document Format, portable document format) file is a widely used electronic file format, which can encapsulate information such as text, font, format, color and graphics, and has the advantages of less storage space, easy transmission, and high compatibility. , not easy to be tampered with and so on. PDF files are mainly used to represent (view or print) document typesetting on a two-dimensional plane, not to edit (similar to word) or save and transmit structured data. It is difficult to restore the data to the original PDF file for production structured data. For example, a text paragraph in a PDF file appears to the reader to be composed of lines of text, but in fact these texts are independently positioned on the plane in the form of characters combined with two-dimensional c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/166G06K9/00G06K9/34
CPCG06F40/166G06V30/40G06V30/153G06V30/10
Inventor 方政
Owner 善诊(上海)信息技术有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More