Old-Chinese bilingual corpus construction method and device with Thai language as pivot
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- KUNMING UNIV OF SCI & TECH
- Publication Date
- 2020-01-21
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to a method and device for constructing an old-Chinese bilingual corpus with Thai as a pivot, and belongs to the technical field of natural language processing. Background technique
[0002] Corpus construction is the premise of natural language processing research. Lao-Chinese bilingual corpus is an important data resource for Chinese-Lao machine translation and cross-language retrieval. Lao language is a language with scarce resources among Southeast Asian languages. Lao-Chinese bilingual parallel Resources are relatively scarce, and it is difficult to directly obtain parallel bilingual resources of Old-Chinese from the Internet.
[0003] Both Laotian and Thai belong to the Zhuang-Dai branch of the Zhuang-Dong language family of the Sino-Tibetan language family. The basic vocabulary is almost the same or similar, and there is also a great similarity in the syntactic structure. The Chinese-Thai parallel corpus is relatively easy ...