Method for analyzing reading order of electronic layout file
A technology of reading order and layout documents, applied in the field of information, which can solve problems such as ambiguous block division
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be described in further detail below with reference to the embodiments and accompanying drawings.
[0031] like figure 1 As shown, it is a method flow for analyzing the reading order of electronic file format files, including the following steps:
[0032] Extract original information from PDF files;
[0033] Identify headers and footers, and merge adjacent text content to obtain row content;
[0034] Merge the content of the text line in blocks to obtain the content of the text block;
[0035] Merge adjacent pictures to get the content of the picture block;
[0036] Analyze the path information to obtain the dividing line in the horizontal direction;
[0037] Project the text quick content and the image block content in the X direction to obtain the horizontally separated block content;
[0038] Using text block content, image block conte...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com