Character segmentation method for offline handwriting Uighur words

A word and character technology, applied in the field of character segmentation, to achieve the effect of accurate selection, integrity assurance, and accurate positioning

Inactive Publication Date: 2010-06-23
XIDIAN UNIV
View PDF0 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] In short, the above-mentioned several character segmentation techniques still have many deficiencies in the offline handwritten Uyghur character segmentation. Therefore, how to develop a robust method for offline handwritten Uyghur character segmentation is to It has become a new topic of concern to technical personnel in the industry

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Character segmentation method for offline handwriting Uighur words
  • Character segmentation method for offline handwriting Uighur words
  • Character segmentation method for offline handwriting Uighur words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033]In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0034] refer to figure 1 , the character segmentation method of the present invention comprises the steps:

[0035] Step 1, respectively extracting the connected features, belonging features, position features and local peak features of the offline Uighur words.

[0036] (1.1) For the input binary image, utilize the connected domain analysis method to extract all independent fields, there are two adjacency modes of 4 adjacency and 8 adjacency in the pixels of the binary image, the present invention adopts the 8 adjacency mode to carry out connected domain analysis, and obtains All fields are set Φ, and the number of foreground points of each field is used as the area attribute of the connected feature, and the position of the circumscribed rectangle of the field a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a character segmentation method for offline handwriting Uighur words, belonging to the field of character segmentation in optical character identification. The character segmentation method comprises the following realization steps of adopting a multi-characteristic analysis method and extracting a communicating characteristic, an attribution characteristic, a position characteristic and a local summit characteristic of a word; dividing the word into field aggregates template according to the communicating characteristic, and dividing all fields into master fields and slave fields; obtaining a plurality of sub field aggregates through field clustering according to the attribution characteristic; carrying out multi-characteristic combination guide segmentation aiming to each sub field aggregate, extracting a potential master-slave segmentation point according to the local summit characteristic and the communicating characteristic, and determining to adopt an independent or combination segmentation mode combined with the position characteristic; and finally, obtaining the integral optimal character segmentation effect by optimizing a segmentation line according to the communicating characteristic and the position characteristic. The invention has the advantages of favorable character segmentation effect to the Uighur word, simple operation steps with easy realization and low calculation complexity and can finish character segmentation by being transplanted on a mobile phone mobile platform.

Description

technical field [0001] The invention relates to a digital image processing method, which belongs to a character segmentation method and can be used for character segmentation of off-line handwritten Uighur words in optical character recognition. Background technique [0002] With the continuous expansion of the application field of handwritten character recognition and the improvement of the recognition ability of classifiers, character segmentation technology has become a key issue in the field of optical character recognition research. Practice has shown that inaccurate character segmentation is one of the main reasons for misrecognition, and the improvement of the correct recognition rate of a single character largely depends on the accuracy of character segmentation. [0003] The Uyghur language is an important minority language in my country, which belongs to the West-Hungarian branch of the Turkic language family of the Altaic language family. In Xinjiang alone, more t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/20G06K9/46
Inventor 李静卢朝阳阿地力·依米提曹琎谭福秀
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products