Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for removing character adhesion

A character and character recognition technology, which is applied in character and pattern recognition, instruments, calculations, etc., can solve the problem that the effect of debonding is not very high, and two characters are conglutinated, so as to achieve a good effect of debonding

Inactive Publication Date: 2015-05-20
PEKING UNIV +2
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage of this method is that it can only solve the problem of sticking two characters
[0004] Therefore, most of the existing character debonding methods can only deal with the situation of 2 character conglutinations, and the effect of debonding is not very high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for removing character adhesion
  • Method and system for removing character adhesion
  • Method and system for removing character adhesion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0093] The character image to be processed in this embodiment is as image 3 The digital image to be debonded shown in is a binary image. If the character image to be processed is not a binary image, it needs to be binarized first.

[0094] In the first step, the prior knowledge set of digital strings of digital images is first set. The prior knowledge set specifically includes: the maximum aspect ratio Ratio of characters max , the minimum aspect ratio Ratio min ;Characteristics: numbers are of the same height, and other numbers are of the same width except 1; layout guidelines: other numbers are arranged at equal intervals except for 1, and the character spacing is d times the character height.

[0095] The second step is to perform connected domain analysis on the character image to be processed, and determine the connected domains that need to be split in the connected domain analysis results, as follows:

[0096] Connected domain analysis is performed on the binary ima...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and system for removing character adhesion. The method includes: setting the prior knowledge set of a to-be-processed character image, performing connected domain analysis on the to-be-processed character image, calculating character height H and character width W, and determining connected domains, which need to be separated, in a connected domain analysis result; performing separation point positioning on the connected domains which need to be separated to obtain all separation schemes, using the separation schemes to respectively separate the connected domains which need to be separated, performing OCR on the character separation result corresponding to each separation scheme to obtain the character recognition result of the image, evaluating the character separation result corresponding to each separation scheme according to prior knowledge evaluation function, and using the character recognition result with the most matched evaluation as the recognition result after adhesion removing. By the method, the problem of multi-character adhesion under the condition of unknown adhesion number is solved, and good adhesion removing effect can be achieved.

Description

technical field [0001] The invention belongs to the technical field of character processing in images, and in particular relates to a method and system for removing stickiness of characters. Background technique [0002] When recognizing the text in the area to be recognized in the image, there will be adhesion between the characters. In order to obtain a more accurate recognition result, it is necessary to remove the adhesion of the characters before sending the characters to OCR. The adhesion between characters is very complicated. On the one hand, the situation of character adhesion varies greatly, and on the other hand, the number of character adhesion is not fixed. [0003] The most commonly used degluing method in existence is the projection method. The projection method uses the place where the minimum value is projected as the segmentation point. This method will lead to segmentation errors when the glue points are relatively thick, such as when 0 and 0 are glued to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/54G06K9/20
Inventor 李平立史培培
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products