Translation method, device and equipment based on machine learning and storage medium

A machine learning and mechanism technology, applied in natural language translation, instruments, special data processing applications, etc., can solve the problems of irreparable sentence semantics, wrong sentence semantics or ambiguity, poor translation effect, etc., and achieve the effect of improving translation accuracy.

Active Publication Date: 2020-02-18
TENCENT TECH (SHENZHEN) CO LTD
View PDF9 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a translation method, device, device, and storage medium based on machine learning, which can solve the problem of wrong segmentation caused by word segmentation, resulting in wrong semantics or ambiguity in sentences, thereby causing irreparable damage to the semantics of sentences. less effective problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation method, device and equipment based on machine learning and storage medium
  • Translation method, device and equipment based on machine learning and storage medium
  • Translation method, device and equipment based on machine learning and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the purpose, technical solution and advantages of the present application clearer, the implementation manners of the present application will be further described in detail below in conjunction with the accompanying drawings.

[0040] First, a brief introduction to the nouns involved in the embodiments of this application:

[0041] Machine translation: refers to the translation method of translating sentences in one natural language into sentences in another natural language by computer. Usually, the machine translation is to translate sentences through a trained machine translation model. Schematically, a large number of translation corpus samples are used to train the machine translation model. The translation corpus samples include multiple sets of first natural language corpus and Correspondence between the corpus of the second natural language. Each corpus of the first natural language corresponds to a corpus of the second natural language as the t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a translation method, device and equipment based on machine learning, and a storage medium, and relates to the field of artificial intelligence, and the method comprises the steps: obtaining a sentence of a first language type; dividing the sentence into at least two word segmentation sequences by adopting different word segmentation devices; generating a word graph structure of the sentence according to the at least two word segmentation sequences; calling an encoder to convert the word graph structure into an intermediate vector representation of the sentence; and calling a decoder to convert the intermediate vector representation into a sentence of a second language type. Because the word graph represents the possibility of covering various word segmentation of the sentence, the problem that wrong segmentation is generated in word segmentation can be solved, the problem that wrong semantics or ambiguity is generated in the sentence due to the fact that wrongsegmentation is generated in word segmentation, and irreparable damage is caused to the semantics of the sentence is solved, and the translation accuracy of a machine translation model is improved.

Description

technical field [0001] The embodiments of the present application relate to the field of artificial intelligence, and in particular to a translation method, device, device and storage medium based on machine learning. Background technique [0002] The translation system based on machine learning is currently the mainstream translation system. A typical neural network model used in a machine learning-based translation system includes: an encoder and a decoder. [0003] When using the neural network model for translation, the user inputs a sentence in the first language, and the sentence is expressed as a word sequence through word segmentation. The encoder converts the word sequence into an intermediate vector, and the decoder converts the intermediate vector into a sentence in the second language. [0004] However, because word segmentation may produce wrong divisions, resulting in wrong semantics or ambiguity in the sentence, which will cause irreparable damage to the sema...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/289G06F16/36
CPCG06F16/367
Inventor 张祥文谢军
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products