Computer-based offline handwritten Uyghur word recognition method based on grapheme segmentation

A word recognition and computer technology, applied in computer parts, character and pattern recognition, calculation, etc., can solve the problems of poor algorithm scalability and over-segmentation errors of the whole word recognition strategy, so as to improve the word recognition rate and feature dimension. The effect of small numbers and simple calculations

Active Publication Date: 2021-06-29
LANZHOU UNIVERSITY OF TECHNOLOGY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to overcome the problem that in the existing Uyghur word recognition technology, the segmentation recognition strategy is prone to character over-segmentation errors, and the algorithm scalability of the whole word recognition strategy is poor, and to provide a computer-based detachment method based on grapheme segmentation. Recognition method of handwritten Uyghur words

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Computer-based offline handwritten Uyghur word recognition method based on grapheme segmentation
  • Computer-based offline handwritten Uyghur word recognition method based on grapheme segmentation
  • Computer-based offline handwritten Uyghur word recognition method based on grapheme segmentation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention is a computer-based offline handwritten Uighur word recognition method based on grapheme segmentation. Aiming at offline handwritten Uyghur words, it is proposed to decompose and recognize words on the grapheme (that is, a part of a character) level, and first establish Uyghur words The grapheme library, over-segmenting word images to form grapheme sequences, and then designing different feature extraction and classifiers for graphemes in different sequences, and finally building a Bayesian network model for Uyghur words, and fusing grapheme recognition information through model reasoning And the prior information of word formation to get the word recognition result.

[0021] The invention is aimed at a character recognition method for off-line handwritten Uighur words. Uyghur words have a unique font structure and adopt a writing method from right to left and from top to bottom. The structural rules of handwritten Uyghur words are as follows figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A computer-based off-line handwritten Uighur word recognition method based on grapheme segmentation belongs to the word processing technology of character pattern recognition. The steps are as follows: firstly, according to the Uyghur rules and morphological structure, a Uyghur word grapheme database is established, including the main body, additional and Click three types of graphemes; then, obtain three grapheme sequences by over-segmenting the word image, and design different feature extraction and classifiers for each type of grapheme; finally, construct graphemes, connected segments and words through the Bayesian network The layered matching model among them, reasoning and calculating the recognition confidence from grapheme features to word categories, and fusing grapheme recognition information and word formation prior information to obtain word recognition results. Utilizing the off-line handwritten Uyghur word recognition method of the present invention, unconstrained, natural and fluently written Uyghur words can be recognized robustly, and the training category required by the algorithm is fixed, and the algorithm has strong expansibility.

Description

technical field [0001] The invention belongs to the word processing technology of character pattern recognition in pattern recognition, specifically belongs to the field of off-line handwritten character recognition, and is used for recognizing off-line handwritten Uighur word images. Background technique [0002] The Uyghur script belongs to the West-Hungarian branch of the Turkic language family of the Altaic language family. It is the language of the Uyghur minority, an important ethnic minority in my country. The processing and recognition of Uyghur is beneficial to the development of information and technology in ethnic areas. The modern Uyghur language is composed of 32 letters. According to the position in the word, each letter can be written in the form of front connection, double connection, back connection, independent writing, etc., and has evolved into a total of 128 characters. Uighur characters have a long history, and the deformation of handwritten characters ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/34G06K9/46G06K9/62
CPCG06V30/153G06V10/267G06V10/44G06F18/24155
Inventor 许亚美徐志刚何继爱陈海燕朱宁宁
Owner LANZHOU UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products