Form image classification method

A classification method and form technology, applied in the direction of instrumentation, calculation, character and pattern recognition, etc., can solve problems such as difficult classification of similar forms, and achieve the effect of increasing weight and reducing weight

Active Publication Date: 2015-09-09
PEKING UNIV
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0017] The existing form classification technology mainly solves the classification problem of forms with different layouts in the form classification problem, but for forms with similar or similar layouts, this type of algorithm considers that they belong to the same type of form
Therefore, in order to solve the problem that similar forms are difficult to classify, the present invention proposes a simple and effective Chinese form classification method based on weighted distance, which reduces the impact of the randomness of the information filled in by users, and at the same time enlarges the importance of distinguishing information in the form layout. In order to achieve better classification performance for Chinese forms with similar layouts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Form image classification method
  • Form image classification method
  • Form image classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] In order to make the above objects, features and advantages of the present invention more obvious and understandable, the present invention will be further described below through specific embodiments and accompanying drawings.

[0082] This embodiment introduces the specific implementation process of the form query condition input method for the application scenario where the form image is input as the query condition into the form classification system. The output is usually the text area in the form, and the next step of text recognition and input of the corresponding area information is performed. The input of the form here can be a scanned form or a form photo with better image quality, and supports form classification in multiple languages. The preprocessing of the input image uses hough transform for line detection and tilt correction, etc., and normalizes the form image to the same scale and angle as the training form image. User U's equipment (scanner, handhel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a form image classification method. For training images, first, forms, belonging to the same type, are averaged to obtain a mean value image, each pixel point of the mean value image is a mean value of the pixel point in each training image, and each type of a mean value template is composed of the obtained mean value images; then three weight values including a consistency weight, a randomness weight and a vibration weight are solved; when form classification is carried out, the three weights and the mean value template are used for carrying out classification calculation. The average forms can be changed to the forms of a position pixel point mode of the position; then a variance and different weight value are calculated aiming at the forms of the mode. By virtue of the form image classification method, the influence caused by the randomness of user filling-in information can be reduced, and the importance of distinguishing information in form layout can be amplified simultaneously, so that a very good classification performance is obtained aiming at the Chinese forms with similar layout.

Description

technical field [0001] The invention belongs to the technical field of document classification and pattern recognition, and in particular relates to a form image classification method based on distance measurement. Background technique [0002] At present, in many businesses (such as banking, insurance, statistics, etc.), a large number of Chinese forms are generated by printing / copying, etc., and then passed to customers for printing or manual filling, resulting in a large number of Chinese forms existing in paper form , which brings many challenges and difficulties to the later form automation processing. On the other hand, in order to make the office more automated, and then be able to extract and mine useful information from the form, the demand for form automation is increasingly strong. [0003] The automated processing of forms usually includes a series of processes such as scanning, reading, classification, layout analysis, identification and editing of paper forms....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06K9/46
CPCG06V30/413G06V10/48G06V30/287G06F18/2415
Inventor 王思萌高良才王悦涵汤帜
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products