Method and device for extracting document structure
Patent Information
- Authority / Receiving Office
- CN Β· China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- PEKING UNIV FOUNDER GRP CO LTD
- Publication Date
- 2013-01-02
- Estimated Expiration
- Not applicable Β· inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the field of digital publishing, in particular, to a method and device for extracting document structure. Background technique
[0002] In the field of traditional publishing, the document format of books and newspapers is only to meet the needs of traditional printing. The description of the content is limited to visual elements such as text, graphics, image outline, color, position, etc., without the logical content and internal relationship of the document. In the field of digital publishing, more attention is paid to the logical content, association relationship, and content granularity of documents. Structural processing of documents is a prerequisite for digital content reuse.
[0003] At present, the method of structured processing of document content mainly adopts manual processing. According to the predefined rules, the processing personnel visually identify the document content in the document that conforms to...