Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Online handwritten Tibetan syllable recognition method and device

A Tibetan language and syllable technology, applied in the field of character recognition, can solve the problems that Tibetan syllables cannot be recognized efficiently, and cannot meet the writing habits and needs of Tibetan users, so as to meet the writing habits and needs and achieve the effect of efficient recognition.

Active Publication Date: 2015-11-04
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing online handwritten Tibetan syllable recognition method cannot efficiently recognize the Tibetan syllables input by users' continuous handwriting, and cannot meet the writing habits and needs of Tibetan users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Online handwritten Tibetan syllable recognition method and device
  • Online handwritten Tibetan syllable recognition method and device
  • Online handwritten Tibetan syllable recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] The present embodiment provides a kind of online handwritten Tibetan syllable recognition method, such as figure 1 As shown, the method includes:

[0027] S11. Preprocessing the dot track of the Tibetan syllables continuously handwritten by the user.

[0028] S12. Segment the preprocessed Tibetan syllables successively from the horizontal direction and the vertical direction to obtain a sub-structural block sequence of two-layer marking results.

[0029] S13. Using a segmentation hypothesis verification method based on a semi-Markov conditional random field, perform segmentation hypothesis verification on the substructure block sequence of the two-layer marking result, and obtain an optimal segmentation path and a component string identification result.

[0030] S14. Determine the handwritten Tibetan syllable category input by the user according to the optimal segmentation path and the recognition result of the component string.

[0031] The online handwritten Tibetan...

Embodiment 2

[0037] This embodiment provides an online handwritten Tibetan syllable recognition method. This embodiment uses the MRG-OHTC sample database of the Multilingual Processing Research Group of the National Engineering Research Center for Basic Software of the Institute of Software, Chinese Academy of Sciences. The database includes handwritten Tibetan syllable samples of 150 different writers. Each writer completed the writing of 827 high-frequency syllables pre-selected, including 456 two-character syllables, 309 three-character syllables, and 62 four-character syllables. indivual. Select 130 sets of (writer) samples for training, and the remaining 20 sets of samples for testing. In addition, the 150 sets of samples are labeled at the character level and syllable level with semi-supervised calibration tools.

[0038] The specific process of the online handwritten Tibetan syllable recognition method provided by the present embodiment is as follows:

[0039] (1) Point trajector...

Embodiment 3

[0075] An embodiment of the present invention provides an online handwritten Tibetan syllable recognition device, such as Figure 6 As shown, the device includes:

[0076] The preprocessing unit 11 is used to preprocess the dot track of the Tibetan syllables continuously handwritten by the user;

[0077] The over-segmentation unit 12 is used to successively over-segment the preprocessed Tibetan syllables from the horizontal direction and the vertical direction to obtain the substructure block sequence of the two-layer marking result;

[0078] The segmentation hypothesis verification unit 13 is used to adopt the segmentation hypothesis verification method based on the semi-Markov conditional random field to perform segmentation hypothesis verification on the sub-structural block sequence of the two-layer marking result, and obtain the optimal segmentation path and component string recognition result;

[0079] The determining unit 14 is configured to determine the handwritten ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an online handwritten Tibetan syllable recognition method and an online handwritten Tibetan syllable recognition device, relates to the technical field of character recognition, and is used to solve the problem that Tibetan syllables input by a user in a continuously handwriting manner cannot be recognized efficiently in the prior art. The method comprises performing preprocessing on loca of points of syllables of Tibetan input by the user in the continuous handwriting manner; performing over-segmentation on the Tibetan syllables after the preprocessing successively in the horizontal direction and the vertical direction, and the obtaining sub structure block sequences of two layers of mark results; adopting a segmentation hypothesis verification method based on a semi-Markov condition random field, performing segmentation hypothesis verification on the sub structure block sequences of the two layers of the mark results, and then obtaining an optimal segmentation path and a recognition result of a part string; and, according to the optimal segmentation path and the recognition result of the part string, determining types of syllables of the handwritten Tibetan input by the user. The online handwritten Tibetan syllable recognition method and the online handwritten Tibetan syllable recognition device are suitable for the recognition of the syllables of the Tibetan input by the user in the continuously handwritten manner.

Description

technical field [0001] The invention relates to the technical field of character recognition, in particular to an online handwritten Tibetan syllable recognition method and device. Background technique [0002] Tibetan input methods mainly include handwriting input and keyboard input. Compared with keyboard input, handwriting input is more in line with people's expression habits. It is an effective real-time tool that is easy to use by users, and it is easy to carry and easy to operate. With the advancement and wide application of mobile terminal devices such as smartphones, tablet computers, electronic whiteboards, and iPads, research on online handwritten Tibetan input (pen input) algorithms has received more and more applications and attention. It mainly focuses on Tibetan character recognition, and there is already a handwriting input method that supports Tibetan characters as input units. However, due to the particularity of the Tibetan language itself, people in Tibe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/72G06K9/62
CPCG06V10/768G06V30/293G06F18/295
Inventor 马龙龙吴健
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products