Uygur language word alignment method

A Uyghur language and vocabulary technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as minority language support and technical level defects

Inactive Publication Date: 2014-07-02
新疆电力信息通信有限责任公司
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In recent years, with the development of ethnic minority informatization, the construction of minority language corpora in Xinjiang has also made new progress, but most of them are Uyghur, and there are certain gaps in the support and technical level of more minority languages. Defects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Uygur language word alignment method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0011] A Uyghur word alignment method, 1. Realized the automatic alignment of Uyghur words, the alignment relationship between Uyghur words and Chinese words is divided into 5 types, namely one-to-one, one-to-many, many-to-one, many One-to-many, one-to-empty; 2. Manually align the words with errors in the automatic alignment, which improves the accuracy of the system in processing Uyghur; 3. According to the characteristics of Uyghur, it realizes the splitting and merging of Uyghur words.

[0012] Such as figure 1 As shown, first, judge the role of the user, and then obtain the sentence after approval. According to the characteristics of Uyghur words, it realizes the splitting and merging of words, manually aligns the words with automatic alignment errors, then saves the alignment results, and registers the wrong sentences at the same time.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Uygur language word alignment method. The method includes that automatic alignment of Uygur language words is realized, and five alignment relationships between Uygur language words and Chinese words include one to one, one to multiple, multiple to one, multiple to multiple and one to none; manual alignment is performed on words which are wrong in automatic alignment, so that accuracy of a system to process Uygur language is improved; word splitting and merging of the Uygur language words is realized according to characteristics of the Uygur language. By the Uygur language word alignment method, automatic alignment of the Uygur language words is realized, assistance is provided for Chinese-Uygur machine translation and establishing of electronic Uygur language dictionaries, and a solid foundation is laid for development of electronic dictionaries for Uzbek, Kazak, Kyrgyz and Turkish and machine-aided translation systems.

Description

technical field [0001] The invention relates to language information processing technology, in particular to a method for aligning Uighur words. Background technique [0002] Today, with the informatization of the national economy and society, people have put forward faster and higher requirements for information acquisition, query and translation in various languages. Subsequently, various electronic dictionary products and machine translation systems have been developed, which are welcomed by the majority of users. When performing machine translation, the quality of the corpus directly affects the quality of translation, and the Uyghur word alignment system is an auxiliary tool for machine translation and corpus construction. [0003] In the practical process of machine translation systems and natural language processing systems, machine dictionaries and machine translation systems have become the focus of development, and the speed and quality of corpus construction are ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/30
Inventor 尼加提·纳吉米买合木提·买买提帕肉克·司地克马斌
Owner 新疆电力信息通信有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products