Image recognition model rapid training method and system based on many-core processor

A many-core processor, image recognition technology, applied in neural learning methods, character and pattern recognition, biological neural network models, etc., can solve the problems of synchronization waiting for data parallel acceleration ratio is not ideal, gradient aging, etc., to solve the problem of gradient aging , the effect of large acceleration, the effect of reducing training costs

Pending Publication Date: 2021-06-04
UNIV OF SCI & TECH OF CHINA
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

SSGD uses the parameter information of all nodes, so the synchronous waiting brought by slow nodes makes the acceleration ratio of data parallelism unsatisfactory
ASGD achieves greater parallelism by reducing the dependencies between nodes, but due to its inherent randomness, it causes a large amount of gradient aging in the network during training, resulting in it taking a long time to reach the same convergence point

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image recognition model rapid training method and system based on many-core processor
  • Image recognition model rapid training method and system based on many-core processor
  • Image recognition model rapid training method and system based on many-core processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0049] The preferred embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0050] like figure 1 Shown, a kind of fast training method of image recognition model based on many-core processors, including many-core processors, the fast training method of image recognition model comprises the following steps:

[0051] S01: Build an image recognition model;

[0052] S02: Obtain image data for training, use an improved asynchronous stochastic gradient descent algorithm to train image recognition model parameters, and obtain a trained image recognition model;

[0053] The method for training image recognition model parameters using the improved asynchronous stochastic gradient descent algorithm includes:

[0054] S21: Perform calculation and learning through computing nodes, perform training on the allocated image data, and update corresponding parameters;

[0055] S22: The parameter server receives the image recognit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an image recognition model rapid training method based on a many-core processor, and the method employs the many-core processor, and the image recognition model rapid training method comprises: constructing an image recognition model, and acquiring image data for training, and training the image recognition model parameters by using an improved asynchronous stochastic gradient descent algorithm to obtain a trained image recognition model. The method for training the image recognition model parameters by using the improved asynchronous stochastic gradient descent algorithm comprises: carrying out calculation and learning through calculation nodes, carrying out training on distributed image data, and updating corresponding parameters; receiving, by the parameter server, image recognition model parameters sent by the computing node; and tracking and calculating the updating delay of each calculation node, and if the updating delay is smaller than a threshold value, updating the model parameters; otherwise, discarding the model parameters. A better acceleration effect can be obtained on a many-core processor platform, so that the training speed of the image recognition model is higher, and the training cost is greatly reduced.

Description

technical field [0001] The invention belongs to the technical field of image recognition model training, and in particular relates to a fast training method and system for an image recognition model based on many-core processors. Background technique [0002] In recent years, deep neural network (DNN) has been widely used in many fields due to its excellent algorithm performance, especially in image recognition. DNNs have made great progress because it can perform feature extraction and data fitting by increasing the depth of the model. However, the excellent performance of DNNs comes with a major obstacle—huge computational cost. With the increase of training data scale and model complexity, the training cost of DNNs is getting higher and higher, which will become the bottleneck of image recognition model training. [0003] To shorten the training time of image recognition models, designing a parallel DNN algorithm based on various high-performance computing platforms bec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06K9/00G06N3/063G06N3/08
CPCG06N3/063G06N3/08G06V10/94G06F18/214
Inventor 王明贵许冬毛赛
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products