Computer-based offline handwritten Uyghur word recognition method based on word segmentation

A word recognition, computer technology, applied in computer parts, character and pattern recognition, calculation and other directions, can solve the problem of character over-segmentation error, poor algorithm expansion of whole word recognition strategy, etc., achieve small feature dimension, improve word Effects of recognition rate, good topological shape and structure

Active Publication Date: 2018-05-29
LANZHOU UNIVERSITY OF TECHNOLOGY
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to overcome the problem that in the existing Uyghur word recognition technology, the segmentation recognition strategy is prone to character over-segmentation erro

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Computer-based offline handwritten Uyghur word recognition method based on word segmentation
  • Computer-based offline handwritten Uyghur word recognition method based on word segmentation
  • Computer-based offline handwritten Uyghur word recognition method based on word segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention is a computer-based offline handwritten Uighur word recognition method based on grapheme segmentation. Aiming at offline handwritten Uyghur words, it is proposed to decompose and recognize words on the grapheme (that is, a part of a character) level, and first establish Uyghur words The grapheme library, over-segmenting word images to form grapheme sequences, and then designing different feature extraction and classifiers for graphemes in different sequences, and finally building a Bayesian network model for Uyghur words, and fusing grapheme recognition information through model reasoning And the prior information of word formation to get the word recognition result.

[0021] The invention is aimed at a character recognition method for off-line handwritten Uighur words. Uyghur words have a unique font structure and adopt a writing method from right to left and from top to bottom. The structural rules of handwritten Uyghur words are as follows figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A computer-based offline handwritten Uyghur word recognition method based on word segmentation belongs to the word processing technique of word pattern recognition. The method includes the following steps that firstly, according to a Uyghur rule and a morphological structure, a Uygur word character library is built and includes three types of graphemes, namely main bodies, additions and dots; secondly, by segmenting a word image, three grapheme sequences are obtained, and according to the graphemes, different feature extraction and classification devices are designed; finally, through a hierarchical matching model of the graphemes, conjoined segments and words is built through the Bayesian network, reasoning calculation is conducted on grapheme features to obtain the recognition confidencecoefficients of word types, grapheme recognition information and word-formation prior information are mixed, and a word recognition result is obtained. By means of the offline handwritten Uyghur wordrecognition method, Uyghur words which are written unrestrainedly, naturally and smoothly can be robustly recognized, the training type required by the algorithm is fixed, and the expansibility of the algorithm is very high.

Description

technical field [0001] The invention belongs to the word processing technology of character pattern recognition in pattern recognition, specifically belongs to the field of off-line handwritten character recognition, and is used for recognizing off-line handwritten Uighur word images. Background technique [0002] The Uyghur script belongs to the West-Hungarian branch of the Turkic language family of the Altaic language family. It is the language of the Uyghur minority, an important ethnic minority in my country. The processing and recognition of Uyghur is beneficial to the development of information and technology in ethnic areas. The modern Uyghur language is composed of 32 letters. According to the position in the word, each letter can be written in the form of front connection, double connection, back connection, independent writing, etc., and has evolved into a total of 128 characters. Uighur characters have a long history, and the deformation of handwritten characters ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/34G06K9/46G06K9/62
CPCG06V30/153G06V10/267G06V10/44G06F18/24155
Inventor 许亚美徐志刚何继爱陈海燕朱宁宁
Owner LANZHOU UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products