Array element processing method and apparatus, storage medium, and electronic device
By dividing array elements into data sub-blocks for parallel processing and utilizing GPU parallel computing and atomic operations, the problem of low efficiency in finding non-zero elements in sparse data structures is solved, thereby improving the efficiency and computation speed of sparse data processing.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- INSPUR SUZHOU INTELLIGENT TECH CO LTD
- Filing Date
- 2024-12-20
- Publication Date
- 2026-06-12
AI Technical Summary
In large-scale sparse data processing, existing technologies are inefficient at finding non-zero elements in sparse data structures, resulting in inefficient array element processing.
By dividing the target input array into multiple data sub-blocks, multiple threads are executed in parallel to find the position index of non-zero elements, and the parallel computing power and atomic operation management of the GPU are used to record the position index of non-zero elements into the target output array.
It improves the efficiency of finding non-zero elements in large-scale arrays and the management of atomic operations, thereby increasing the computation speed and resource utilization during model training.
Smart Images

Figure CN119760180B_ABST