Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Layout analyzing method and system

A technology for layout analysis and analysis, which is applied in the field of layout analysis of layout documents, and can solve the problem of single layout analysis method

Active Publication Date: 2015-04-15
NEW FOUNDER HLDG DEV LLC +1
View PDF6 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For this reason, the technical problem to be solved by the present invention is that the layout analysis method in the prior art is single, thereby proposing a layout analysis method that combines logical structure information into the existing layout analysis method and effectively improves the layout document analysis results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Layout analyzing method and system
  • Layout analyzing method and system
  • Layout analyzing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0145] This embodiment provides a layout analysis method, such as figure 1 As shown, including the following process:

[0146] Obtain the logical paragraph information of the layout document, the logical reference information of each paragraph includes character objects, dynamic area objects, and static area objects arranged in a logical order, and obtain the basic graphic metadata of the current page as the basic graphic metadata to be analyzed.

[0147] Collect primitives for static area objects, collect primitives for character objects through character analysis, line analysis, segment analysis, and paragraph result screening, and collect primitives for dynamic area objects to complete the graph of the basic primitive data to be analyzed meta collection.

[0148] The layout analysis method of the present invention collects graph elements for different types of logical reference information, adopts the method of combining logical reference information with basic graph eleme...

Embodiment 2

[0150] This embodiment provides a layout analysis method, including the following process, see the flow chart figure 2 and image 3 :

[0151](1) Extraction process: Obtain the logical paragraphs of the existing one-page layout document, each paragraph includes characters, dynamic area objects, static area objects, and the basic graphic metadata of the current page obtained through the layout document engine, including basic character graphics Elements, image primitives, graphics primitives. Before the layout analysis, all the logical paragraph information of the document already exists in the early layout document processing process, and all the logical paragraphs are logically ordered, which is the logical information before the layout analysis.

[0152] A page contains a center rectangle and multiple logical paragraphs, and the logical paragraphs are sorted according to the natural logical order of the page. The center rectangle here refers to the area where the main co...

Embodiment 3

[0167] This embodiment provides a layout analysis method, including the following process:

[0168] (1) Extraction process. Same as Example 1.

[0169] (2) Collection of object primitives in the static area. Same as Embodiment 1, and in this embodiment, when filtering all basic graphic elements in the page for each static area object, according to the logic type of the static area object, use the corresponding collection strategy class to collect, the specific strategy is:

[0170] ① Image collection strategy: only collect the basic image primitives, and require the enclosing rectangle of the basic image primitives to intersect with the target collection area, and the ratio of the area of ​​the interlaced area to the area of ​​the enclosing rectangle of the basic image primitives is greater than an empirical threshold.

[0171] ② Form collection strategy: collect basic primitives of characters, graphics, and images, and require the enclosing rectangle of the basic primitives...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention provide a layout analysis method, comprising: extraction, collection of basic elements with respect to static area objects, analysis sequence determination and logical paragraph analysis, wherein the logical paragraph analysis comprises character analyzing, logical connection edge generating, line forming analyzing, paragraph forming analyzing, paragraph result filtering, basic elements collecting with respect to the dynamic area objects and basic element removing. According to the embodiments of the present invention, logical reference information and basic element data information are combined, and the logical reference information is fully used during layout analysis, such that a more accurate layout analysis result with respect to a fixed-layout document is acquired, and the layout analysis result is effectively improved.

Description

technical field [0001] The invention relates to the field of information processing and pattern recognition, in particular to a layout analysis method of a layout document. Background technique [0002] The layout document format is an electronic document format with a fixed layout rendering effect. The presentation of the layout document has nothing to do with the device. When reading, printing or printing on various devices, the layout rendering results are consistent. Format documents are mainly used in the release, dissemination and archiving of written documents. The feature of the layout document is that the layout is fixed and does not run, that is, what you see is what you get (WYSIWYG for short), so that the presentation effect of the electronic document will not be affected by the change of the software and hardware environment or the operator during the use process. And changes, in terms of layout, layout, font, font size, etc., are completely consistent with pap...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F17/211G06F17/24G06V30/413
Inventor 张军董宁王长胜
Owner NEW FOUNDER HLDG DEV LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products