Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Model parameter importance degree evaluation method and device and electronic equipment

A technology of model parameters and importance, applied in the computer field, can solve problems such as inability to understand model parameters, unfavorably improve model performance, increase difficulty in understanding model training mechanism, etc., to achieve the effect of reducing calculation amount and improving model performance

Inactive Publication Date: 2020-08-21
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current NMT model is trained as a black box, and it is impossible to understand the role of each model parameter in the loss function during the training process. This increases the difficulty of understanding the model training mechanism, which is not conducive to further targeted Improve model performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model parameter importance degree evaluation method and device and electronic equipment
  • Model parameter importance degree evaluation method and device and electronic equipment
  • Model parameter importance degree evaluation method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0034] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be under...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a model parameter importance degree evaluation method and device and electronic equipment, and relates to the technical field of computers. The method comprises the steps of training a neural machine translation NMT model based on a training data set, and obtaining parameter value variation of each model parameter before and after each training; sampling the training data set to obtain a sampling data set; determining the gradient of the loss function of the NMT model corresponding to each time of training relative to each model parameter based on the sampling data set; and determining the importance degree of each model parameter based on the gradient and parameter value variation corresponding to each training of each model parameter. According tothe technical scheme, the gradient of each model parameter is determined based on the sampling data set, so that the data calculation amount is reduced; based on the gradient of each model parameterand the parameter value variation before and after training, the importance degree of each model parameter is determined, and the contribution of each model parameter in the loss function convergenceprocess can be clarified.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular, the present application relates to a method, device and electronic equipment for evaluating the importance of model parameters. Background technique [0002] Since the neural machine translation (Neural Machine Translation, NMT) model was proposed, it has rapidly become the focus of translation research. Not only can the NMT model produce impressive translation results, but it is also inherently advantageous in its translation model structure. Compared with traditional statistical machine translation, its model can model language model, translation model and alignment model in a unified way instead of a pipelined way, which can reduce the side effects of error accumulation. [0003] As a complex neural network, the NMT model can have up to 108 million model parameters in the entire network, and all model parameters must be iteratively trained until convergence. Ho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/58G06N3/04
CPCG06F40/58G06N3/045
Inventor 朱聪慧刘乐茂李冠林史树明
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products