Distributed training method and device for deep learning model, equipment and storage medium
A deep learning and training method technology, applied in the field of model training, can solve the problem of not considering the node network communication speed, and achieve the effect of avoiding the node network communication speed from being too slow
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.
[0037] It should be noted that Kubernetes can manage large-scale distributed clusters, but the network environment of the cluster is very complex. For example, the physical distance between the nodes in the cluster is very far, or there are multiple gateways between adjacent nodes, or the access bandwidth of the nodes is inconsistent, etc. Practical problems, these problems will lead to inconsistencies in the network communication speed between nodes, and the networ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


