Dependency graph network-based Han-Vietnamese neural machine translation method

A machine translation and syntax-dependent technology, applied in neural learning methods, neural architecture, natural language translation, etc., can solve problems such as out-of-order translation, inability to model syntactic structure information, insufficient learning of syntactic differences, etc., and achieve translation effect improvement Effect

Active Publication Date: 2021-03-16
KUNMING UNIV OF SCI & TECH
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides a Chinese-Vietnamese neural machine translation method based on a dependency graph network to solve the problem that in low-resource scenarios, due to the lack of large-scale parallel co

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dependency graph network-based Han-Vietnamese neural machine translation method
  • Dependency graph network-based Han-Vietnamese neural machine translation method
  • Dependency graph network-based Han-Vietnamese neural machine translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] Embodiment 1: as Figure 1-3 As shown, the Chinese-Vietnamese neural machine translation method based on the dependency graph network first crawls the Chinese-Vietnamese parallel corpus from the website, and uses the dependency syntax analysis tool to analyze the dependency syntax of the obtained Chinese-Vietnamese bilingual corpus to obtain the dependency syntax analysis of the source language. secondly, convert the acquired dependency syntax analysis tree of the source language into a dependency graph with a graph encoder, and encode it to obtain the dependency graph structure information; then send the dependency graph structure information to the coding end to fuse with the source language sequence information, and use This fusion information is sent to the decoder to guide model translation.

[0040] Specific steps are as follows:

[0041] Step1. Obtain data and perform data preprocessing: Crawl the Chinese-Vietnamese parallel corpus of the website through crawler...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a dependency graph network-based Han-Vietnamese neural machine translation method, and belongs to the technical field of natural language processing. The method comprises thefollowing steps: firstly, crawling Chinese-Vietnamese parallel corpora ar from a website, and performing dependency syntax analysis on the obtained Chinese-Vietnamese bilingual corpora through a dependency syntax analysis tool; secondly, converting the obtained source language dependency syntax parsing tree into a dependency graph by using a graph encoder, encoding the dependency graph to obtain dependency graph structure information, sending the dependency graph structure information to an encoding end to be fused with source language sequence information, and sending the fused information toa decoding end to guide a model to translate. According to the invention, the dependency relationship is converted into the dependency graph by fusing the dependency syntax information, and the graphneural network is utilized to realize global structural coding of the dependency graph, so that richer global dependency information is provided for the translation model, and certain help is provided for improving the translation effect of the Han-Van language pairs with syntax differences.

Description

technical field [0001] The invention relates to a Chinese-Vietnamese neural machine translation method based on a dependency graph network, and belongs to the technical field of natural language processing. Background technique [0002] Chinese-Vietnamese neural machine translation is a low-resource machine translation task. Due to the lack of large-scale parallel sentence pairs, the translation performance is not good. Chinese and Vietnamese belong to different language families, and their syntax is relatively different. There is a difference between the preposition and the postposition of attributives. In the comparison of Chinese and Vietnamese bilingual word alignment, Chinese attributives are generally located in front of the modified terms, such as "beautiful" as The attributive modifies "singer", and "beautiful" is used as an attributive to modify "singing voice". In the syntactic structure of Vietnamese, the attributive is generally located after the modified word, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/58G06F40/289G06F40/211G06N3/04G06N3/08
CPCG06F40/58G06F40/289G06F40/211G06N3/049G06N3/08G06N3/045Y02D10/00
Inventor 余正涛杨威亚高盛祥文永华朱俊国吴霖
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products