Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

MST algorithm based Vietnamese dependency tree library construction method

A construction method and treebank technology, which is applied in the field of Vietnamese dependency treebank construction, can solve the time-consuming and labor-intensive problems of manually labeling Vietnamese dependency treebanks, and achieve the effects of saving manpower, time for building treebanks, and improving accuracy

Inactive Publication Date: 2016-07-06
KUNMING UNIV OF SCI & TECH
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patented technology allows us to create an artificially labeled veneer that supports both English languages (BML) and Japanese languages (JL). It simplifies manual labelings while still providing accurate results compared to existing methods like JBL or BLM.

Problems solved by technology

This patented technical solution describes how understanding English speaking patterns helps us better communicate more easily through conversations involving different languages or culturally related words like Japanese yen (English) compared against spoken word pairs used during traditional speeches such as Mandarin Chinese (Chinese).

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • MST algorithm based Vietnamese dependency tree library construction method
  • MST algorithm based Vietnamese dependency tree library construction method
  • MST algorithm based Vietnamese dependency tree library construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] Embodiment 1: as Figure 1-3 Shown, a kind of Vietnamese language dependency tree bank construction method based on MST algorithm, the concrete steps of described method are:

[0027] Step1, first construct the Vietnamese language dependency tree bank training corpus;

[0028] Step2. Use the training corpus of the Vietnamese dependency tree bank to train the MST model, and then use the MST model to train Vietnamese sentences to get the Vietnamese dependency tree bank;

[0029] Step3. Correct the obtained Vietnamese dependency treebank corpus.

Embodiment 2

[0030] Embodiment 2: as Figure 1-3 Shown, a kind of Vietnamese language dependency tree bank construction method based on MST algorithm, the concrete steps of described method are:

[0031] Step1, first construct the Vietnamese language dependency tree bank training corpus;

[0032] Step2. Use the training corpus of the Vietnamese dependency tree bank to train the MST model, and then use the MST model to train Vietnamese sentences to get the Vietnamese dependency tree bank;

[0033] Step3. Correct the obtained Vietnamese dependency treebank corpus.

[0034] In the step Step1, the specific steps of constructing the Vietnamese language dependency tree bank training corpus are:

[0035] Step1.1. First, perform word alignment processing on the Chinese-Vietnamese parallel sentence pairs, and obtain the Chinese-Vietnamese word-aligned parallel sentence pair library;

[0036] Step1.2. Perform Chinese sentence dependency syntactic analysis on the Chinese-Vietnamese word alignment ...

Embodiment 3

[0038] Embodiment 3: as Figure 1-3 Shown, a kind of Vietnamese language dependency tree bank construction method based on MST algorithm, the concrete steps of described method are:

[0039] Step1, first construct the Vietnamese language dependency tree bank training corpus;

[0040] Step2. Use the training corpus of the Vietnamese dependency tree bank to train the MST model, and then use the MST model to train Vietnamese sentences to get the Vietnamese dependency tree bank;

[0041] Step3. Correct the obtained Vietnamese dependency treebank corpus.

[0042] In the step Step1, the specific steps of constructing the Vietnamese language dependency tree bank training corpus are:

[0043] Step1.1. First, perform word alignment processing on the Chinese-Vietnamese parallel sentence pairs, and obtain the Chinese-Vietnamese word-aligned parallel sentence pair library;

[0044] Step1.2. Perform Chinese sentence dependency syntactic analysis on the Chinese-Vietnamese word alignment ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an MST algorithm based Vietnamese dependency tree library construction method and belongs to the technical field of natural language processing. The method comprises the steps of firstly constructing a Vietnamese dependency tree library training corpus base; secondly performing training by utilizing corpora of the Vietnamese dependency tree library training corpus base to obtain an MST model and then training Vietnamese sentences by utilizing the MST model to obtain a Vietnamese dependency tree library; and correcting the obtained Vietnamese dependency tree library corpus base. The Vietnamese dependency tree library constructed with the method can provide powerful support for upper-layer applications such as syntactic analysis, machine translation, information acquisition and the like of Vietnamese language; the Vietnamese dependency tree library with one hundred thousand Vietnamese sentences can be constructed; the method avoids the processes of manually collecting and marking the Vietnamese dependency tree library, reduces the labor and shortens the time for constructing the tree library; and compared with a method for constructing a Vietnamese dependency tree library by adopting a CRFParser and Chinese-Vietnamese bilingual word-alignment corpora, the method provided by the invention has the advantage that the accuracy is remarkably improved.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products