Unlock instant, AI-driven research and patent intelligence for your innovation.

Model training method

A model training and model technology, applied in the field of model training, can solve problems such as huge computing and storage resources, and achieve the effect of reducing data computing and storage

Pending Publication Date: 2022-04-29
NANJING UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This application provides a model training method to solve the problem of requiring huge computing and storage resources in the model training process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method
  • Model training method
  • Model training method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the purpose, technical scheme and advantages of the application clearer, the technical scheme of the application will be clearly and completely described below in combination with the specific embodiments of the application and the corresponding accompanying drawings. Obviously, the described embodiments are only part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the application, all other embodiments obtained by ordinary technicians in the art without creative work belong to the scope of protection of the application. The technical solutions provided by the embodiments of the application are described in detail below in combination with the accompanying drawings.

[0042] It should be noted that the brief description of terms in this application is only for the convenience of understanding the embodiments described below, rather than intended to limit the embodiments of this application. Unless other...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a model training method. The method comprises the following steps: defining a new quantitative linear layer; all elements in the multi-dimensional input tensor of the quantization linear layer are quantized into a PINT format, all elements in the to-be-calculated tensor of the quantization linear layer are quantized into a PINT data format, and matrix multiplication calculation is performed on the quantized multi-dimensional input tensor and the to-be-calculated tensor to obtain a fixed point result; performing inverse quantization on a fixed point result into a floating point number and spreading the floating point number to a subsequent network layer; and replacing an original linear layer in the model with a quantitative linear layer, and training the model based on a floating-point number and a PINT data format. According to the method, the quantization linear layer based on the PINT data format is developed, the PINT data format with low bit and high representation capability is applied to model training, and the quantization linear layer is used for replacing the linear layer used in the model, so that the requirements in the aspects of data calculation, storage and the like are effectively reduced under the condition that the accuracy change of the trained model is very small.

Description

technical field [0001] The application relates to the technical field of natural language processing, in particular to a model training method. Background technology [0002] In recent years, Bert and other models based on transformer network have performed well in natural language processing and other fields. Transformer is a classic model applied to NLP (natural language processing) proposed by Google's team in 2017. Transformer model uses self attention mechanism and does not adopt the sequential structure of RNN (recurrent neural network), so that the model can be trained in parallel and have the global information of samples. The popular models such as Bert (bidirectional encoder representations from transformers) are also based on the transformer implementation. [0003] Take the Bert model in the model as an example. The full name of Bert is bidirectional encoder representations from transformers, which is a pre trained language representation model. Bert model uses the en...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04G06K9/62G06F16/332G06F40/295G06F16/35
CPCG06N3/084G06F16/3329G06F40/295G06F16/35G06N3/045G06F18/2415
Inventor 王中风邵海阔鲁金铭魏敬和
Owner NANJING UNIV