Statistics-based machine translation method and apparatus, and electronic device

A technology of machine translation and translation probability, applied in the field of machine translation, can solve problems such as semantic inconsistency between the original text and the translated text

Active Publication Date: 2017-03-08
阿里巴巴(中国)网络技术有限公司
View PDF4 Cites 57 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0017] This application provides a machine translation method, device and electronic equipment based on statistics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Statistics-based machine translation method and apparatus, and electronic device
  • Statistics-based machine translation method and apparatus, and electronic device
  • Statistics-based machine translation method and apparatus, and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0285] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0286] In this application, a statistical-based machine translation method, device and electronic device, a method, device and electronic device for calculating semantic similarity, and a word quantification method, device and electronic device are provided. Each will be described in detail in the following examples.

[0287] The core basic idea of ​​the statistics-based machine translation method provided by this application is: when building a machine translation model, go deep into the semantic level of natural la...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a statistics-based machine translation method and apparatus and an electronic device, a semantic similarity-degree calculation method and apparatus and an electronic device, and a word quantization method and apparatus and an electronic device. The statistics-based machine translation method comprises: according to a feature that affects a translation probability and that is of each candidate translation and a pre-generated translation probability prediction model generating a translation probability of a sentence to be translated into each candidate translation, wherein the feature that affects the translation probability at least comprises a semantic similarity-degree between the sentence to be translated and the candidate translation; and selecting a preset number of candidate translations whose translation probabilities rank top as a translation of the sentence to be translated. By adoption of the statistics-based machine translation method provided by the present application, the semantic level of the natural language can be reached deeply when the machine translation model is constructed, and the deviation of semantics between the translation and the source text is avoided, so as to achieve the effect of improving translation quality.

Description

technical field [0001] The present application relates to the technical field of machine translation, in particular to a statistics-based machine translation method, device and electronic equipment. The present application also relates to a method, device and electronic device for calculating semantic similarity, and a word quantization method, device and electronic device. Background technique [0002] Machine translation refers to the use of electronic computers to automatically translate from one natural language text (source language) to another natural language text (target language). The software used to complete this process is called a machine translation system. With the development and popularization of electronic computers and the Internet, cultural exchanges between various ethnic groups have become more frequent, and the problem of language barriers has once again become prominent in the new era. People are in more urgent need of machine translation than ever be...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28G06N3/02
CPCG06F40/44G06F40/55G06F40/205G06F40/30
Inventor 曾晓东骆卫华林锋
Owner 阿里巴巴(中国)网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products