A Calculation Method of Adaptive Learning Rate

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of adaptive learning rate and calculation method, applied in the field of deep learning, can solve problems such as fluctuation, discount of chess power, failure to reach the optimal value, etc., and achieve the effect of reducing training time and increasing chess power

Active Publication Date: 2022-06-07

TONGJI UNIV

View PDF14 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Generally, in the initial stage of network training, it is often better to set a larger learning rate, because the weight of the training model is far away from the optimal value, and a larger learning rate can quickly approach the optimal value; and in the later stage of training, because it is already close to the optimal value value, at this time, it is better to use a smaller learning rate, and a larger learning rate will easily lead to fluctuations around the real optimal value, that is, it cannot reach the optimal value

The inferior training model weights generated by the training data will cause Go AI to have many blind spots when playing the game, and its chess strength will be greatly reduced, and the non-adaptive learning rate will consume a lot of training time for the training model weights, and cause GPU and other hardware. super energy loss

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0042] This embodiment provides an adaptive learning rate calculation method, such as figure 1 shown, including the following steps:

[0043] Step S1: Based on the initial batch number and the initial optimal network parameter combination, the second batch number is obtained;

[0044] Step S2: Obtain the current optimal network parameter combination based on the number of secondary batches;

[0045] Step S3: Obtain the current number of batches based on the current optimal network parameter combination, the current loss function and the amount of training data;

[0046] Step S4: Obtain the current learning rate based on the current batch number;

[0047] Step S5: The current batch number replaces the second batch number, and steps S2-S5 are repeated until the training model weights converge.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention relates to an adaptive learning rate calculation method, comprising the following steps: step S1: based on the initial batch number and the combination of initial optimal network parameters, obtain the number of secondary batches; step S2: based on the number of secondary batches, obtain The current optimal network parameter combination; Step S3: Based on the current optimal network parameter combination, the current loss function and the amount of training data, obtain the current batch number; Step S4: Based on the current batch number, obtain the current learning rate; step S5: the current batch number replaces the secondary batch number, and repeats steps S2-step S5 until the training model weights converge. Compared with the existing technology, it avoids generating low-quality training model weights, ensures that the generated training model weights are optimal, increases chess power when playing Go, and reduces training time for training model weights and superpower consumption of GPU and other hardware.

Description

technical field [0001] The invention relates to the field of deep learning, in particular to an adaptive learning rate calculation method. Background technique [0002] Artificial Intelligence (AI) has been vigorously developed along with many applications in human reality scenarios, and the progress of artificial intelligence in Go has also achieved great results. The weight of the training model is the key basis for the performance of the Go AI in the game, and it is the chess information generated by the Go AI through the residual network training. [0003] Learning rate is a classic hyperparameter of neural network, and it is also one of the problems that plague neural network training, because the parameters cannot be learned by conventional methods. The learning rate of training model weight training in the residual network of many Go AIs today is a fixed value, and the network training cannot be based on the environmental needs of Go AI (weight initialization, networ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06N20/00G06N3/08

CPCG06N20/00G06N3/08

Inventor 杨恺张春炯

Owner TONGJI UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A Calculation Method of Adaptive Learning Rate

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology