Method for extracting structured information of continuous page format document
A layout document, structured technology, applied in the field of structured information extraction of continuous page layout documents, can solve the problems of ineffective processing, failure to consider page relevance, low accuracy, etc., to improve the structural accuracy, high accuracy efficiency, improve efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0013] Exemplary implementations of the present invention are described below in conjunction with the accompanying drawings, which include various details of the implementations of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Likewise, for the sake of clarity and conciseness, descriptions of well-known functions and structures are omitted in the following description. A method for structuring documents in continuous page format includes the following steps:
[0014] 1. Analyze the layout document, and obtain its page information and Chinese text block information page by page, among which:
[0015] a) Page information includes page size information
[0016] b) Text block information includes character internal code,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com