Method for converting PDF file to XML file
A document conversion and document technology, applied in the field of information conversion, can solve problems such as not seeing, and achieve the effect of improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] 1. The specific design and implementation of the module
[0026] 1. Intermediate document generation module:
[0027] The intermediate document generation module 7 is designed to convert the PDF source document 1 into an easy-to-handle intermediate format, and then perform rule-based automatic XML document conversion on the intermediate format.
[0028] The implementation of this module has two key points:
[0029] (1) Definition of the structure of the intermediate document.
[0030] The requirements for the structure design of the intermediate document are as follows: first, it can describe the format characteristics and layout structure information of the source document, which is the basis for the automatic extraction module 9 rule matching; second, the conversion from the PDF document to the intermediate document should preferably be relatively easy conduct.
[0031] (2) Design a parser for PDF documents to generate intermediate documents that meet the above req...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com