Data processing method and device, electronic equipment, storage medium and program product
By dividing the input matrix into submatrices and setting up computation and communication thread blocks in the thread block network, the problem of serial execution of computation and communication in multi-GPU parallel inference is solved, and the synchronization of computation and communication is achieved, thereby improving GPU resource utilization and inference performance.
CN121900974BActive Publication Date: 2026-06-23INSPUR (SHANDONG) COMPUTER TECH CO LTD
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- INSPUR (SHANDONG) COMPUTER TECH CO LTD
- Filing Date
- 2026-03-17
- Publication Date
- 2026-06-23
Smart Images

Figure CN121900974B_ABST
Abstract
The present disclosure provides a data processing method and device, electronic equipment, storage medium and program product, relating to the technical field of large model, the method comprises: dividing an input matrix into a plurality of sub-matrices, determining the number and position index of each sub-matrix based on a grouping interleaving method; performing corresponding sub-matrix operation based on at least one computing thread block in a thread block network, and transmitting the operation result of the corresponding sub-matrix based on at least one communication thread block corresponding to the at least one computing thread block; storing the operation result of each sub-matrix in the cache based on the number sequence corresponding to each sub-matrix, determining the output matrix based on the number and position index corresponding to each sub-matrix and the operation result of the sub-matrix stored in the cache in sequence; in this way, the communication thread block is arranged between the computing thread blocks, so that the computing thread blocks can perform sub-matrix operation synchronously in the process of communication, improving the utilization rate of graphics processor resources.
Need to check novelty before this filing date? Find Prior Art