Method and system for identifying form in layout file
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- NEW FOUNDER HLDG DEV LLC
- Publication Date
- 2010-07-07
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The invention belongs to the technical field of pattern recognition in the field of computer information processing, and in particular relates to a form recognition method and system in format files. Background technique
[0002] In industries such as newspapers and publishing houses, after the typesetting software is used for typesetting, it is necessary to extract articles and related metadata information from the produced layouts for further use, which is the reconstruction and indexing of article information. In order to restore the content of the layout more realistically, in addition to the content information of the article itself (such as title, citation, subtitle, author, body and other information), the position of the text block, font size and other information are also extracted when indexing .
[0003] The Chinese patent application with the application number 200710179938.4 "An indexing method for complex layouts based on PDF" discloses ...