Mixing precision SpMV optimization system and method applied to computing equipment
A computing equipment and precision technology, applied in the field of mixed-precision SpMV optimization systems, can solve problems affecting memory-intensive applications, mixed-precision algorithm concerns, etc., achieve high theoretical significance and practical application value, and reduce memory access overhead.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0041] Such as figure 1 As shown, the mixed-precision SpMV optimization system applied to computing equipment includes:
[0042] An acquisition module, a first processing module, and a second processing module.
[0043] The obtaining module is used to obtain the input data of the computing device;
[0044] The first processing module is used to divide the sparse matrix into sub-matrices of different precisions based on the precision of floating-point numbers of non-zero elements in the sparse matrix;
[0045] The second processing module is used to calculate the multiplication of the sub-matrix and the vector with different precisions to obtain a mixed-precision SpMV calculation result;
[0046] Specifically, the division of the sparse matrix by the first processing module is performed only once before the mixed-precision SpMV calculation.
[0047] Specifically, the floating point number includes: a sign, an exponent and a mantissa.
[0048] Specifically, the sub-matrices ...
Embodiment 2
[0055] Such as figure 2 As shown, the mixed-precision SpMV optimization method applied to computing equipment includes the following steps:
[0056] Obtain a computing device input sparse matrix;
[0057] Divide the sparse matrix into sub-matrices of different precision based on the precision of the floating-point numbers of the non-zero elements in the sparse matrix;
[0058] The multiplication of the sub-matrix and the vector with different precisions is calculated to obtain a mixed-precision SpMV calculation result.
Embodiment 3
[0060] (1) Matrix lossless division
[0061] The key of the present invention is to represent the sparse matrix with mixed precision based on the precision of non-zero elements, so as to reduce its memory access overhead and calculation intensity in SpMV calculation. Mixed-precision representation divides a sparse matrix into up to 3 matrices (half-precision, single-precision, and double-precision submatrices) according to the actual floating-point precision of the nonzero elements. Storing more non-zero elements with low precision means less memory usage and data transfer between CPU memory and GPU global memory, as well as memory access transactions on the GPU side. In addition, low-precision calculations are expected to achieve higher performance than high-precision calculations. Therefore, different storage and computation precision may help to improve the performance of SpMV computation.
[0062] The representation of floating-point numbers in memory is divided into thr...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


