Method for splitting and identifying character strings at complex interference

A recognition method and string technology, applied in the field of image processing, can solve the problems of no obvious difference between interference and character features, failure to obtain learning effect, low string recognition rate, etc., to achieve good universality, good learning effect, The effect of good robustness and recognition effect

Inactive Publication Date: 2012-10-10
HEFEI UNIV OF TECH
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] (2) Characters stick together and are not easy to distinguish;
[0007] (3) There is no obvious difference between the interference and the characteristics of the character itself
This type of method has the ability of machine learning, can automatically count the characteristics of samples, and has a high recogniti

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for splitting and identifying character strings at complex interference
  • Method for splitting and identifying character strings at complex interference
  • Method for splitting and identifying character strings at complex interference

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In this embodiment, the segmentation and recognition method of character strings under complex interference is carried out as follows:

[0041] Ⅰ. Learning stage: use the multi-instance machine learning method to learn the string under complex interference as follows;

[0042] Step 1. Obtain each package of multi-instance learning;

[0043] Automatically segment m character images containing interference into m pictures; each picture contains and only contains one complete character; m pictures are used as each package of multi-instance learning to form learning samples for multi-instance learning, Each package is stored separately; separate storage means that the same character is put into the same folder as the same class, and n folders are obtained that are consistent with the number of classes, and n is not greater than m.

[0044] In specific implementation such as figure 1 As shown, the image containing four characters and interference is divided into four chara...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for splitting and identifying character strings at complex interferences, which is characterized by comprising a learning phase and an identifying phase. The learning phase comprises the following steps of: splitting an image containing m characters into m pictures to form a multi-example learning packet, taking the same characters as a category, and classifying the packet and inputting to a base; and calculating an integrogram of the packet, extracting haar-like characteristics of the packet as an example of the packet, finding out key examples of each category by using a diversity density algorithm, and finally learning the key examples by using classified performances of an SVM (Support Vector Machine). The identifying phase comprises the following steps of: predicting the type of a new packet by using a learning result to identify the character strings. By using the method, the function of automatically identifying the character strings at the complex interferences can be realized, and the identifying speed and efficiency are higher.

Description

technical field [0001] The invention relates to the field of image processing, in particular to a segmentation and recognition technology of character strings under complex interference. Background technique [0002] After years of development, Optical Character Recognition (OCR) technology has made great progress and has been widely used in fields such as handwriting input, automatic license plate recognition, and automatic text scanning and recognition. However, the existing OCR technology is still difficult to robustly segment and recognize character strings under complex interference. Because of this, a character string subject to certain disturbances is usually used as a verification code on the network to identify whether an operation is a manual behavior or an automatic computer behavior. [0003] At present, the recognition methods for character strings are mainly divided into two categories, one is based on Euclidean space distance methods, such as template matchin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/66G06K9/34
Inventor 汪荣贵戴经成周良李想游生福查炜
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products