Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text translation model training method and device and storage medium

A translation model and text technology, applied in natural language translation, instrumentation, computing, etc., can solve the problems of poor semantic representation ability, smaller target word representation boundary, and sample representation collapse, etc., to improve performance and enhance semantic expression. Ability, the effect of optimizing semantic space representation

Pending Publication Date: 2022-06-03
ALIBABA (CHINA) CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, due to the existence of low-frequency words in the machine translation corpus, these words rarely appear at the output of the model during model training, and their representations will be optimized to the opposite direction of most high-frequency words, which will lead to representation space The medium sample representation collapses into a narrower cone, making the boundary between different target word representations smaller and the semantic representation ability worse
The problem of semantic collapse will seriously affect the representation ability of the semantic space of the Transformer model, thereby affecting the effect of machine translation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text translation model training method and device and storage medium
  • Text translation model training method and device and storage medium
  • Text translation model training method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be described clearly and completely below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of this application.

[0061] The terms "first", "second" and the like in the description, claims and the above-mentioned drawings of the embodiments of the present application are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that data so used may be interchanged under appropriat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a training method and device for a text translation model and a storage medium, and the training method comprises the steps: receiving a text training sample set containing multiple pairs of natural language texts from a client, carrying out the comparison learning based on the word level, and combining the word frequency information of the natural language texts, the model parameters of the text translation model are optimized, a final text translation model is obtained through multiple rounds of training until the loss function of the text translation model converges, and the text translation model is used for translating one natural language text into another natural language text. Since the training process can optimize the model parameters of the text translation model based on the word frequency information, semantic space representation of words with different word frequencies is optimized, the semantic expression ability of the model to the input text is enhanced, and the performance of the machine translation model is improved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular, to a training method, device and storage medium for a text translation model. Background technique [0002] In machine translation tasks, the Transformer model is a model that uses an attention mechanism to improve the speed of model training, and is currently the most commonly used deep learning model. Usually, the Transformer model includes an encoding module and a decoding module. The input text is first passed through the encoding module to encode the text, and then the encoded data is sent to the decoding module for decoding, and the translated text is obtained after decoding. [0003] However, due to the existence of low-frequency words in the machine translation corpus, these words rarely appear at the output of the model during model training, and their representation will be optimized to push the opposite direction of the representation of mos...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/58G06F40/284G06F40/30
CPCG06F40/58G06F40/284G06F40/30
Inventor 张通杨宝嵩任星彰刘大一恒张海波谢军
Owner ALIBABA (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products