Language modeling system structure searching method for translation tasks

A search method and system structure technology, applied in the field of language modeling, can solve problems such as inconsistency in training and decoding, increase the consistency of search process and decoding, and non-optimal structural performance, so as to achieve the effect of improving possibility and effectiveness

Inactive Publication Date: 2021-07-13
沈阳雅译网络技术有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the inconsistency of training and decoding in the structure search task oriented to natural language processing tasks in the prior art, which leads to the problem that the performance of the decoded structure is not optimal, the technical problem to be solved by the present invention is to provide a translation task Language modeling system structure search method, to increase the consistency of the search process and decoding

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language modeling system structure searching method for translation tasks
  • Language modeling system structure searching method for translation tasks
  • Language modeling system structure searching method for translation tasks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0039] The invention further improves the traditional natural language processing task-oriented cyclic neural network structure search, improves the accuracy of the search algorithm by adding topology learning, alleviates the problem of inconsistent training and decoding caused by variable input of the differentiable structure search method, and improves the search accuracy . This method models the search process of the topology and operation in the metastructure separately, and ensures the coupling of the two through the joint learning method, improves the search accuracy while ensuring the search efficiency, and improves the model without increasing the number of parameters Searched structural properties.

[0040] In order to solve the problems of the technologies described above, the technical solution adopted in the present invention is:

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a language modeling system structure search method for translation tasks, and the method comprises the following steps: obtaining and processing training data through the Internet, and modeling and training a network structure representation space; carrying out normalization operation on structure parameter values of meta-structure topology and operation in the training process; optimizing structure parameters and model parameters of the used model, and adjusting and optimizing a network structure and target parameters; further obtaining a discretized final structure according to the weight difference of different topologies and operations obtained after tuning, wherein the search result comprises the topological structure of the meta-structure and the operation used between the nodes; and circularly unfolding the searched meta-structures by using a connection mode between the meta-structures to obtain an integral model, performing parameter tuning on the model again by using training data, and finally training until convergence. The method greatly improves the possibility that the optimal solution of the model structure falls into the representation space of the search structure, thereby improving the effectiveness of the network structure search method.

Description

technical field [0001] The invention relates to a language modeling technology, in particular to a language modeling system structure search method for translation tasks. Background technique [0002] As with many deep learning-based systems, one of the core issues of neural network-based natural language processing tasks is to design the structure of the neural network. Especially for the difficult natural language processing task of translation, the network structure of neural machine translation is often very complex, and the design of the network structure requires a lot of skills and engineering experience. Although researchers continue to propose new network structures to improve model performance, there is still no complete solution for how to explore the network structure more scientifically. In the traditional method, it is necessary to find a network structure with better performance by constantly trying new networks. There are two problems in this method: one is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F16/335G06F30/27G06F40/205G06N20/00
CPCG06F40/58G06F40/205G06F16/335G06F30/27G06N20/00
Inventor 杜权
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products