Convergence method of high-dimensional depth learning model and device

A model and model error technology, which is applied in the field of convergence methods and devices for high-dimensional deep learning models, and can solve problems such as blanks

Active Publication Date: 2018-11-23
HUAWEI TECH CO LTD
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In large-scale machine learning problems, the method of optimizing the accuracy growth efficiency of the model's solution and escaping the saddle point by adjusting the gradient and the amount of random estimation noise in the iterative process is relatively blank.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convergence method of high-dimensional depth learning model and device
  • Convergence method of high-dimensional depth learning model and device
  • Convergence method of high-dimensional depth learning model and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The technical solutions of the embodiments of the present invention will be further described in detail below through the accompanying drawings and embodiments.

[0043] Specific embodiments of the present invention provide a method and device for convergence of a high-dimensional deep learning model. The model is trained by adopting the method of stochastic gradient descent, and the batch number of the next unit iteration is adjusted according to the accuracy growth efficiency and the convergence state of the solution of the model in the process of model training. Therefore, the efficiency of the model training is improved, and the escape of the saddle point is accelerated.

[0044] The following describes the convergence method of the high-dimensional deep learning model in the specific embodiment of the present invention by using a specific method.

[0045] figure 1 It is a flow chart of a method for convergence of a high-dimensional deep learning model provided by a speci...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a convergence method of a high-dimensional depth learning model and a device. The method includes the steps of performing one unit iteration on a model according to a first position of the error surface to determine a solution of the model at a second position of the error surface; determining a gradient and a curvature of the second position relative to the error surface according to one unit iteration, and determining an accuracy increasing efficiency and a model error of the solution of the model according to the first position and the second position; determining whether the second position is a saddle point or a high noise point of the error surface based on the gradient, the curvature, the accuracy increasing efficiency and the model error; and when the second position of the error surface is a saddle point or a high noise point, adjusting the batch number of the next unit iteration. Embodiments of the present invention determine the batch number of the model at the next batch iteration based on the saddle point or high noise. Therefore, it is possible to optimize the accuracy increasing efficiency and the escape saddle point of the model by adjusting the gradient and stochastic noise estimation in the iteration process.

Description

Technical field [0001] The present invention relates to the technical field, in particular to a method and device for convergence of a high-dimensional deep learning model. Background technique [0002] With the vigorous development of big data in all walks of life, many applications in the field of artificial intelligence appear in our lives through deep learning methods. Deep learning simulates the working principle of the human brain by constructing deep neural networks. This kind of deep neural network mechanism has made breakthrough progress in speech recognition, image recognition, natural language processing and other fields in recent years. [0003] The amount of parameters of deep neural networks is very large, which can reach tens of millions to hundreds of millions. For deep learning model training, including the use of small batch stochastic gradient descent (MBGD, Mini-batch Gradient Descent) method, so as to find the optimal solution of the model. [0004] The advant...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08
Inventor 庄雨铮郑荣福魏建生
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products