Unlock instant, AI-driven research and patent intelligence for your innovation.

Model training and data processing method and device, electronic equipment and storage medium

A technology for training data and model training, applied in the computer field, can solve problems such as large labor costs and consumption, and achieve easy-to-obtain results

Pending Publication Date: 2021-12-07
ALIBABA GRP HLDG LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In order to obtain better translation results based on machine translation, relevant personnel usually post-edit the machine-translated translation, but this method will consume a lot of labor costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training and data processing method and device, electronic equipment and storage medium
  • Model training and data processing method and device, electronic equipment and storage medium
  • Model training and data processing method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] Hereinafter, exemplary embodiments of the present disclosure will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily implement them. Also, for clarity, parts not related to describing the exemplary embodiments are omitted in the drawings.

[0069] In the present disclosure, it should be understood that terms such as "comprising" or "having" are intended to indicate the presence of features, numbers, steps, acts, components, parts or combinations thereof disclosed in the specification, and are not intended to exclude one or a plurality of other features, numbers, steps, acts, parts, parts or combinations thereof exist or are added.

[0070] In addition, it should be noted that, in the case of no conflict, the embodiments in the present disclosure and the features in the embodiments can be combined with each other. The present disclosure will be described in detail below with reference to the accompanying drawin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a model training and data processing method and device, electronic equipment and a storage medium, and the method comprises the steps: obtaining training data which comprises a first original text, a first translation and a first post-translation reference translation; training an edit-after-translation model by using the training data; using a pre-trained language model as an encoder of the edit-after-translation model, wherein the initial parameters of the encoder are parameters of the pre-trained language model; enabling the training data to enter a decoder of the edit-after-translation model through the encoder; and adjusting the parameters of the edit-after-translation model according to the output of the decoder. The semantic knowledge between the original text and the translated text corresponding to the original text is learned by using a large amount of pre-training data, and the semantic knowledge is migrated to the training process of the editing-after-translation model, so that the editing-after-translation model has higher robustness, and meanwhile, the problem that the acquisition cost of triples such as training data is relatively high is solved.

Description

technical field [0001] The present disclosure relates to the field of computer technology, and in particular to a model training and data processing method, device, electronic equipment, and storage medium. Background technique [0002] Machine translation refers to the technology of using computer programs to translate sentences from one natural language (source language) to another natural language (target language). The currently more commonly used neural network architecture Transformer is an attention-based encoder-decoder model. The main idea is to encode the sentence to be translated (hereinafter collectively referred to as the original text) into a vector representation through an encoder, and then use a decoder to decode the vector representation of the original text and translate it into its corresponding translation ( Hereinafter collectively referred to as translations. [0003] In order to obtain better translation results based on machine translation, relevan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/166G06F40/289G06K9/62G06N3/04G06N3/08
CPCG06F40/58G06F40/166G06F40/289G06N3/08G06N3/045G06F18/214
Inventor 汪嘉怿赵宇张昱琪骆卫华施杨斌
Owner ALIBABA GRP HLDG LTD