Deep learning model training method, device and system based on mixed precision

A technology of deep learning and model training, applied in the field of model training, to achieve high data processing capabilities, reduce training costs, and ensure accuracy and effectiveness

Pending Publication Date: 2019-08-23
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a mixed precision-based deep learning model training method, device, electronic equipment, system, and storage medium to solve the problem of how to improve the efficiency of searching for scaling coefficients in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning model training method, device and system based on mixed precision
  • Deep learning model training method, device and system based on mixed precision
  • Deep learning model training method, device and system based on mixed precision

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0071] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention. In the case of no conflict, the embodiments and the features in the embodiments of the present invention can be combined arbitrarily with each other. Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0072] The t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep learning model training method, device and system based on mixed precision, and the method comprises the steps: carrying out the data processing of sample data in a sample data set based on a deep learning model obtained through the last training, and obtaining a set number of first weight gradient data with the data precision being the first data precision; determining the data precision as a scaling coefficient of the second data precision according to the set number of first weight gradient data and the second data precision, the first data precision being higher than the second data precision; based on the sample data set and the scaling coefficient, training the deep learning model to update the weight parameters of the deep learning model, obtaining thedeep learning model trained this time, and the scaling coefficient is used for amplifying the loss value with the data precision being the second data precision in the process of training the deep learning model, so that the training efficiency and the training precision are improved.

Description

technical field [0001] The present invention relates to the technical field of model training, in particular to a mixed precision-based deep learning model training method, device and system. Background technique [0002] Deep learning models have been widely used in various fields, such as robotics, speech recognition, image recognition, and natural language processing. In practical applications, before the deep learning model is put into application, it is necessary to use a large amount of sample data to train the deep learning model. How to improve the training efficiency of the deep learning model and reduce the training cost is particularly important. At present, major hardware chip manufacturers have launched AI acceleration chips with super computing power in the range of low-precision values. Taking the V100 chip as an example, the computing power of its half-precision data processing unit is 10 times that of a single-precision data processing unit. In order to tak...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/08G06N3/045
Inventor 裴建国侯金龙刘伟黄俊洲
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products