Production line parallel training node weight distribution method based on version difference
A node weight and pipeline technology, which is applied in the field of pipeline parallel training node weight distribution, can solve the problem of low precision and achieve the effect of improving model precision, ensuring effectiveness and improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0059] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.
[0060] Aiming at the staleness of node weights in pipeline parallel training and the low accuracy of existing node weight prediction methods, based on asynchronous pipeline parallel training, a more accurate weight prediction method is used to calculate the difference value of weight versions and improve the accuracy of node weight prediction , to further improve the model accuracy, achieve good node weight update, and further ensure the effectiveness of model training.
[0061] figure 1 (a) Differ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


