Unlock instant, AI-driven research and patent intelligence for your innovation.

Tibetan character component analysis method, Tibetan character sorting method and corresponding device

A sorting method, Tibetan technology, applied in word processing, electronic digital data processing, special data processing applications, etc., can solve problems such as inconvenient use of computer automatic sorting of Tibetan, complicated and error-prone, and imperfect sorting algorithms and models

Active Publication Date: 2019-07-23
尼玛扎西
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, because the existing sorting algorithms and models are not perfect, and are too complicated and error-prone, the existing Tibetan sorting methods are not universal or compatible, and it is not convenient for the use of computerized Tibetan automatic sorting

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tibetan character component analysis method, Tibetan character sorting method and corresponding device
  • Tibetan character component analysis method, Tibetan character sorting method and corresponding device
  • Tibetan character component analysis method, Tibetan character sorting method and corresponding device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] Such as figure 1 As shown, the embodiment of the present invention provides a method for analyzing components of Tibetan characters, including:

[0036] Step 101, acquire the Tibetan text to be analyzed.

[0037] In this embodiment, the Tibetan text acquired through step 101 may contain only one Tibetan character, or may contain multiple Tibetan characters, which is not limited here. Specifically, when the Tibetan text contains multiple Tibetan characters, the acquired Tibetan text can be firstly segmented in units of characters to obtain at least one Tibetan character; Tokens, double pendents and spaces divide the acquired Tibetan text into characters.

[0038] In particular, when a Tibetan text contains multiple Tibetan characters, it may also be a Tibetan word composed of multiple Tibetan characters. At this time, the obtained Tibetan text can be divided according to specific separators and other symbols, which will not be done here limit.

[0039] Step 102, usin...

Embodiment 2

[0706] like figure 2 As shown, the embodiment of the present invention provides a Tibetan sorting method, including:

[0707] Step 201, obtain at least two Tibetan characters to be sorted.

[0708] In this embodiment, the at least two Tibetan characters acquired through step 201 may be independent Tibetan characters, or a Tibetan text composed of multiple Tibetan characters, which is not limited here. In particular, when obtaining Tibetan texts of at least two Tibetan characters, the Tibetan texts can be segmented first, and the segmentation process is the same as figure 1 The division method of step 101 shown is similar, and will not be repeated here.

[0709]In step 202, at least two Tibetan characters to be sorted are respectively used as inputs of a preset finite state machine group.

[0710] Step 203, when the target finite state automaton in the finite state automata group determines that the spelling of the input Tibetan characters is correct, obtain the components ...

Embodiment 3

[0718] like image 3 As shown, the Tibetan sorting method provided by the embodiment of the present invention includes:

[0719] Step 301, obtaining at least two Tibetan words to be sorted.

[0720] Step 302, acquiring Tibetan characters in the at least two Tibetan words respectively.

[0721] In this embodiment, at least two Tibetan words can be segmented to obtain Tibetan characters; at least two Tibetan words can also be segmented according to signs such as specific separators to obtain Tibetan characters. repeat.

[0722] In step 303, the Tibetan characters in the at least two Tibetan words are respectively used as the input of the preset finite state automaton group.

[0723] Step 304, when the target finite state automaton in the finite state automaton group determines that the spelling of the input Tibetan character is correct, obtain the components of the Tibetan character according to the target finite state machine.

[0724] In this embodiment, the process of obt...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Tibetan character component analysis method, a Tibetan character sorting method and a corresponding device, and relates to the field of natural language processing. Invented to solve the problem that the existing Tibetan sorting methods do not have universality or compatibility, and are not convenient for the use of computer automatic sorting of Tibetan. The technical solution provided by the present invention includes: S10, obtaining the Tibetan text to be analyzed; S20, using the Tibetan characters in the Tibetan text as the input of the preset finite state automata group; S30, when the finite state automaton When the target finite state automaton in the group determines that the spelling of the Tibetan characters in the Tibetan text is correct, the components of the Tibetan characters are obtained according to the target finite state automata.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a method for analyzing components of Tibetan characters, a method for sorting Tibetan characters and a corresponding device. Background technique [0002] Like other languages, computerized Tibetan automatic sorting is also widely used in various fields of Tibetan information technology, including Tibetan dictionaries and dictionary sorting, information retrieval, text sorting, etc. Since the beginning of the research on Tibetan information technology in the early 1980s, the research work on automatic sorting of Tibetan by computer has never stopped. With the development of Tibetan information technology, Tibetan automatic sorting algorithms are generally used to sort Tibetan in the prior art. [0003] However, because the existing sorting algorithms and models are not perfect, and are too complicated and error-prone, the existing Tibetan sorting methods are not univer...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06F17/22
CPCG06F40/12G06F40/242G06F40/253G06F40/279G06F7/06G06F40/129G06F40/232G06F40/268G06F40/284G06F40/289
Inventor 尼玛扎西完么扎西
Owner 尼玛扎西