Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Phrase tree to dependency tree transformation method capable of combining Vietnamese grammatical features

A conversion method and phrase tree technology, which is applied in special data processing applications, natural language data processing, instruments, etc., can solve the problems of scarcity of Vietnamese dependency treebanks and difficulties in manual labeling of Vietnamese dependency treebanks, and save tree construction library time, save manpower, and improve accuracy

Active Publication Date: 2016-07-06
KUNMING UNIV OF SCI & TECH
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides a conversion method from a phrase tree to a dependency tree that integrates Vietnamese grammatical features, so as to solve the problem that it is difficult to manually label the Vietnamese dependency treebank, and the constructed Vietnamese dependency treebank is relatively scarce. The Vietnamese dependency tree bank invented and constructed can provide strong support for upper-level applications such as syntactic analysis, machine translation, and information acquisition of Vietnamese

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phrase tree to dependency tree transformation method capable of combining Vietnamese grammatical features
  • Phrase tree to dependency tree transformation method capable of combining Vietnamese grammatical features
  • Phrase tree to dependency tree transformation method capable of combining Vietnamese grammatical features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] Embodiment 1: as Figure 1-3 Shown, a kind of conversion method that fuses the phrase tree of Vietnamese grammatical feature to dependency tree, the concrete steps of the conversion method of the phrase tree of described fusion Vietnamese grammatical feature to dependency tree are as follows:

[0032] Step1, first construct the Vietnamese phrase tree library;

[0033] Step2, using the central sub-node filter table and the dependency tagger that incorporates Vietnamese grammatical features to complete the conversion from the phrase tree in the Vietnamese phrase tree bank to the dependency tree, and obtain the first-level Vietnamese dependency tree bank;

[0034] Step3. Obtain the MSTParser model according to the corpus training of the first-level Vietnamese dependency tree bank after manual annotation, and use the MSTParser model to expand the first-level Vietnamese dependency tree bank to obtain the expanded second-level Vietnamese dependency tree bank;

[0035] Step4....

Embodiment 2

[0036] Embodiment 2: as Figure 1-3 Shown, a kind of conversion method that fuses the phrase tree of Vietnamese grammatical feature to dependency tree, the concrete steps of the conversion method of the phrase tree of described fusion Vietnamese grammatical feature to dependency tree are as follows:

[0037] Step1, first construct the Vietnamese phrase tree library;

[0038] Step2, using the central sub-node filter table and the dependency tagger that incorporates Vietnamese grammatical features to complete the conversion from the phrase tree in the Vietnamese phrase tree bank to the dependency tree, and obtain the first-level Vietnamese dependency tree bank;

[0039] Step3. Obtain the MSTParser model according to the corpus training of the first-level Vietnamese dependency tree bank after manual annotation, and use the MSTParser model to expand the first-level Vietnamese dependency tree bank to obtain the expanded second-level Vietnamese dependency tree bank;

[0040] Step4....

Embodiment 3

[0045] Embodiment 3: as Figure 1-3 Shown, a kind of conversion method that fuses the phrase tree of Vietnamese grammatical feature to dependency tree, the concrete steps of the conversion method of the phrase tree of described fusion Vietnamese grammatical feature to dependency tree are as follows:

[0046] Step1, first construct the Vietnamese phrase tree library;

[0047] Step2, using the central sub-node filter table and the dependency tagger that incorporates Vietnamese grammatical features to complete the conversion from the phrase tree in the Vietnamese phrase tree bank to the dependency tree, and obtain the first-level Vietnamese dependency tree bank;

[0048] Step3. Obtain the MSTParser model according to the corpus training of the first-level Vietnamese dependency tree bank after manual annotation, and use the MSTParser model to expand the first-level Vietnamese dependency tree bank to obtain the expanded second-level Vietnamese dependency tree bank;

[0049] Step4....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a phrase tree to dependency tree transformation method capable of combining Vietnamese grammatical features, and belongs to the technical field of natural language processing. The phrase tree to dependency tree transformation method comprises the following steps: firstly, constructing a Vietnamese phrase tree library; utilizing a center subnode filter table which combines the Vietnamese grammatical features and a dependency relationship annotator to finish the phrase tree to dependency tree transformation in the Vietnamese phrase tree library to obtain a first-level Vietnamese dependency tree library; according to the corpus of the manually annotated first-level Vietnamese dependency tree library, training to obtain a MSTParser model, utilizing the MSTParser model to carry out the expansion of the first-level Vietnamese dependency tree library to obtain an expanded second-level Vietnamese dependency tree library; and utilizing a dependency relationship corrector to correct the corpus of the expanded second-level Vietnamese dependency tree library to obtain a final three-level Vietnamese dependency tree library. The method avoids a process that the Vietnamese dependency tree library is manually collected and annotated, saves manpower and time for constructing the tree library, and obviously improves accuracy.

Description

technical field [0001] The invention relates to a conversion method from a phrase tree integrating Vietnamese grammatical features to a dependency tree, and belongs to the technical field of natural language processing. Background technique [0002] Vietnam and Yunnan are connected by mountains and rivers, and the exchanges between the two peoples have a long history. Language communication has played a very important role in the friendly exchanges, getting along and learning from each other between the two peoples. Therefore, research on Chinese-Vietnamese bilingualism has important practical significance. In the process of mutual translation between Vietnamese and Chinese, syntactic analysis of Vietnamese is a very important basic work. Syntactic analysis refers to analyzing the grammatical structure of a sentence according to a given grammar, which plays a vital role in the research of natural language processing, information extraction, and machine translation. There a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/2246G06F40/205
Inventor 郭剑毅李英余正涛线岩团毛存礼陈玮
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products