A model training method and apparatus
By constructing a directed acyclic graph to optimize the deployment of sub-models and sub-data in the cluster, the problem of data interaction time in distributed training is solved, thereby improving the efficiency and speed of model training.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HUAWEI TECH CO LTD
- Filing Date
- 2024-12-31
- Publication Date
- 2026-06-30
AI Technical Summary
In distributed model training, the amount of data interaction between different domains affects the model training speed. How to optimize the allocation of sub-models and sub-data to reduce the data interaction time has become an urgent problem to be solved.
By constructing a directed acyclic graph through control nodes, and based on data transmission bandwidth and model dependencies, the deployment locations of sub-models and sub-data in multiple clusters are determined, and data transmission paths are optimized to reduce the amount of interactive data.
It improves the efficiency of model training, shortens data transmission time and computation latency, and optimizes the model training process.
Smart Images

Figure CN122310101A_ABST