Atlas data reduction method based on PDF file analysis
A file analysis and map technology, which is applied in the direction of text database query, unstructured text data retrieval, special data processing applications, etc., can solve the problems of incomplete report data and poor analysis of maps, so as to facilitate automatic analysis, Facilitate unified management and quickly analyze the effect of results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] Target PDF page see figure 2, there is a coordinate axis frame 2 drawn by the LTCurve object and an integral line 5 drawn by the LTLine object in the PDF atlas of this embodiment. see image 3 .
[0037] 1. Use software to analyze the PDF. By analyzing the path object (Path Object) generated with the PDF page as the reference in the file, this type of path object is defined as an LTRect object in Pdfminer, and the x1 in the properties of this type of object is calculated. The maximum value of -x0 and y1-y0, analyze the position information of the qualified LTRect object, and obtain the spectrum range 1.
[0038] 2. Use software to analyze the PDF, and generate a path object (Path Object) for displaying the map by using the PDF page as a reference in the analysis file. This type of path object is defined as an LTCurve object in Pdfminer, and the LTCurve object is processed Identify, distinguish between axis frame 2 and map curve 4, see Figure 5 . Analyze the path ...
Embodiment 2
[0048] The analyzed spectrum is the same as in Example 1, and the implementation ideas are similar, except that the selected specific points for calculation are respectively the specific points 13 on the ordinate axis and the specific points on the abscissa axis with identifiable scale marks 11, see attached image 3 , instead of finding the absolute coordinates of a specific point by reading the data summary table.
Embodiment 3
[0050] The analyzed spectrum is the same as Example 1, and the implementation ideas are similar, except that one of the specific points selected for calculation is the starting point of the spectrum: specific point 12. In a map similar to this embodiment, the starting point is usually the origin by default, so its relative coordinate is the first coordinate in the relative coordinate data of the map. The absolute coordinates of this position are (0, 0); another specific point is the specific point 11 on the abscissa axis with identifiable scale marks, see image 3 , instead of finding the absolute coordinates of a specific point by reading the data summary table.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com