Method and device for recognizing table cells in scanned image

A technology for scanning images and table units, applied in the field of image recognition, can solve problems such as limited application occasions, large amount of calculation, slow operation speed, etc., and achieve the effect of solving table unit adhesion, improving recognition speed, and high recognition rate

Inactive Publication Date: 2010-03-24
PEKING UNIV FOUNDER GRP CO LTD +2
View PDF0 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage of this method is that it is difficult to deal with thin and slightly skewed or complex tables
[0005] The search method is to traverse along the table lines. The disadvantage of this method is that it is difficult to deal with burrs, broken lines and character adhesion.
The disadvantage of this type of method is that the recognition success rate of the table unit is very high, but the disadvantage is that the calculation amount is large and the calculation speed is slow, which limits its application occasions.
[0007] Therefore, in the current prior art, there is no automatic identification scheme for table units that can improve the recognition speed of table units in scanned images under the premise of ensuring a high recognition rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for recognizing table cells in scanned image
  • Method and device for recognizing table cells in scanned image
  • Method and device for recognizing table cells in scanned image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In practice, the present invention provides a method for identifying table units in a scanned image. The idea in practice of the present invention is to scan the image area from top to bottom and from left to right to obtain all straight line segments in the image, and then use Quick screening algorithm to filter out other content in the table, only keep long horizontal and vertical line segments, these line segments constitute the table unit, and then use these line segments to identify the structure of the table unit, by obtaining the position and size of the table unit, Thus, the table units in the scanned image are identified, specifically, the structure and position of the table units are identified through each line segment and their intersection points.

[0065] Specific embodiments of the present invention will be described below in conjunction with the accompanying drawings.

[0066] figure 1 It is a schematic diagram of the implementation process of the metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for recognizing table cells in a scanned image, comprising the following steps: acquiring horizontal line segments and vertical line segments in the scanned image of a table document; removing the horizontal line segments and the vertical segments which are less than the first threshold value in the scanned image, wherein, the first threshold value isset according to the height of the minimum character in the scanned image and the resolution ratio of the scanned image; and recognizing the table cells in the scanned image according to the remainedhorizontal line segments and vertical line segments. The invention not only has the characteristic of high recognition success rate of the traditional line detection algorithm, but also can improve the recognition speed of the table cells in the scanned image under the premise of ensuring high recognition rate.

Description

technical field [0001] The invention belongs to the technical field of image recognition, and in particular relates to a method and a device for recognizing table units in a scanned image. Background technique [0002] Tables are commonly used data carriers in documents and are widely used in various occasions. In order to facilitate the automation and electronic processing of paper forms, a fast form automatic recognition method is needed to determine the position and size of each unit in the form, the purpose of which is to facilitate the next step to obtain the contents of the form units and send them to subsequent modules Carry out OCR (Optical Character Recognition, optical character recognition), automatic form filling and other processing. [0003] Form recognition methods commonly used in the prior art include projection method, search method, line detection method and the like. [0004] The projection method is to project the table image vertically and horizontall...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/20
Inventor 亓文法李晓龙
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products