Layout analyzing method and system
A technology for layout analysis and analysis, which is applied in the field of layout analysis of layout documents, and can solve the problem of single layout analysis method
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0145] This embodiment provides a layout analysis method, such as figure 1 As shown, including the following process:
[0146] Obtain the logical paragraph information of the layout document, the logical reference information of each paragraph includes character objects, dynamic area objects, and static area objects arranged in a logical order, and obtain the basic graphic metadata of the current page as the basic graphic metadata to be analyzed.
[0147] Collect primitives for static area objects, collect primitives for character objects through character analysis, line analysis, segment analysis, and paragraph result screening, and collect primitives for dynamic area objects to complete the graph of the basic primitive data to be analyzed meta collection.
[0148] The layout analysis method of the present invention collects graph elements for different types of logical reference information, adopts the method of combining logical reference information with basic graph eleme...
Embodiment 2
[0150] This embodiment provides a layout analysis method, including the following process, see the flow chart figure 2 and image 3 :
[0151](1) Extraction process: Obtain the logical paragraphs of the existing one-page layout document, each paragraph includes characters, dynamic area objects, static area objects, and the basic graphic metadata of the current page obtained through the layout document engine, including basic character graphics Elements, image primitives, graphics primitives. Before the layout analysis, all the logical paragraph information of the document already exists in the early layout document processing process, and all the logical paragraphs are logically ordered, which is the logical information before the layout analysis.
[0152] A page contains a center rectangle and multiple logical paragraphs, and the logical paragraphs are sorted according to the natural logical order of the page. The center rectangle here refers to the area where the main co...
Embodiment 3
[0167] This embodiment provides a layout analysis method, including the following process:
[0168] (1) Extraction process. Same as Example 1.
[0169] (2) Collection of object primitives in the static area. Same as Embodiment 1, and in this embodiment, when filtering all basic graphic elements in the page for each static area object, according to the logic type of the static area object, use the corresponding collection strategy class to collect, the specific strategy is:
[0170] ① Image collection strategy: only collect the basic image primitives, and require the enclosing rectangle of the basic image primitives to intersect with the target collection area, and the ratio of the area of the interlaced area to the area of the enclosing rectangle of the basic image primitives is greater than an empirical threshold.
[0171] ② Form collection strategy: collect basic primitives of characters, graphics, and images, and require the enclosing rectangle of the basic primitives...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com