A CSR storage format-based spmv implementation method and device and medium

By dividing the SpMV operation into three tasks and managing the ready state of the task blocks, the problem of low memory access efficiency in the prior art is solved, and more efficient GPU memory access and parallelism are achieved.

CN116431967BActive Publication Date: 2026-06-23SOUTH CHINA UNIV OF TECH

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
SOUTH CHINA UNIV OF TECH
Filing Date
2023-03-23
Publication Date
2026-06-23

Smart Images

  • Figure CN116431967B_ABST
    Figure CN116431967B_ABST
Patent Text Reader

Abstract

The application discloses a CSR storage format-based SpMV implementation method and device and a medium, wherein the method comprises the following steps: dividing the SpMV into three segment operations, the three segment operations being a value operation, a product operation and a summation operation respectively, and each segment operation corresponding to a task; dividing each task into a plurality of task blocks; dividing all threads into a plurality of vectors, one vector comprising vector_size threads, and the vector being an execution unit of the task block; designing two launch queues and one counter to record the ready state of the task block; each vector acquiring and executing the task block in the ready state, and updating the ready state of the task block after execution; and when all the task blocks are executed, the SpMV operation is completed. The application divides the value and the product into task blocks of a fixed size, and improves the data utilization rate of GPU memory access. The application can be widely applied to the technical field of high-performance numerical calculation.
Need to check novelty before this filing date? Find Prior Art