Method and device for extracting document structure
A document structure and document technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as cumbersome operations and achieve the effect of improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0013] The present invention will be described in detail below with reference to the accompanying drawings and in combination with embodiments.
[0014] figure 1 A flowchart showing a method for extracting a document structure according to an embodiment of the present invention, including:
[0015] Step S10, obtaining the object of the document;
[0016] Step S20, converting the object into a predefined standard format;
[0017] Step S30, identifying and labeling each item in the object in the standard format;
[0018] Step S40, extracting the content of each matched item to organize structured data about the document.
[0019] Commonly used electronic documents are in various formats such as PDF and WORD. The existing document structure recognition technology cannot identify objects in documents of different formats at the same time. Therefore, different processing methods and systems can only be used for many different document formats. It is cumbersome, heavy workload, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com