Method and system for improving parallel computing efficiency related to sparse matrix

A sparse matrix and parallel computing technology, applied in the field of data processing, can solve problems such as time-consuming sparse matrix calculations, long overall model training time, and irregular distribution of non-zero values ​​in sparse matrices

Active Publication Date: 2020-06-05
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF9 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the size of the sparse matrix is ​​large, the calculation of sparse matrix multiplication will take more time, which will cause the overall model training time to be too long
Most of the existing sparse matrix multiplication schemes are based on single thread, which cannot take full advantage of the advantages of multi-core CPUs
If the multi-threaded parallel computing scheme is adopted, due to the irregular distribution of non-zero values ​​in the sparse matrix, the conventional evenly divided data area scheme will cause overlapping parts of the data processed by each thread, which needs to be synchronized through locking operations to improve operating efficiency. will be lower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for improving parallel computing efficiency related to sparse matrix
  • Method and system for improving parallel computing efficiency related to sparse matrix
  • Method and system for improving parallel computing efficiency related to sparse matrix

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] In order to more clearly illustrate the technical solutions of the embodiments of the present specification, the following briefly introduces the drawings that need to be used in the description of the embodiments. Apparently, the accompanying drawings in the following description are only some examples or embodiments of this specification, and those skilled in the art can also apply this specification to other similar scenarios. Unless otherwise apparent from context or otherwise indicated, like reference numerals in the figures represent like structures or operations.

[0019] It should be understood that "system", "device", "unit" and / or "module" as used herein is a method for distinguishing different components, elements, components, parts or assemblies of different levels. However, the words may be replaced by other expressions if other words can achieve the same purpose.

[0020] As indicated in the specification and claims, the terms "a", "an", "an" and / or "the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a method and system for improving parallel computing efficiency related to a sparse matrix. The method comprises the steps: acquiring a sparse matrix, whereinthe sparse matrix is composed of a plurality of non-zero values and a plurality of coordinates corresponding to the non-zero values, and the coordinates represent the positions of the non-zero valuesin the sparse matrix, the coordinates comprise row coordinates, and the row coordinates represent the row number of the non-zero value in the sparse matrix; dividing the plurality of non-zero valuesinto a plurality of first-class data regions based on the number of the plurality of non-zero values and the number of the calculation threads; traversing the plurality of first-class data regions, and dividing the non-zero values in the same row of the sparse matrix in two adjacent first-class data regions into the same data region to generate a plurality of second-class data regions; distributing the plurality of second-class data regions to a plurality of computing threads; and enabling the plurality of computing threads to execute the computing task in parallel.

Description

technical field [0001] This description relates to the field of data processing, in particular to a method and system for improving the efficiency of parallel computing involving sparse matrices. Background technique [0002] Sparse matrices are widely used in various machine learning models. For example, the core logic of the graph neural network model (GraphNeural Network, GNN) - the aggregation of information, can be realized through sparse matrix multiplication. When the size of the sparse matrix is ​​large, the calculation of sparse matrix multiplication will take more time, which will cause the overall model training time to be too long. Most of the existing sparse matrix multiplication schemes are based on single thread, which cannot take full advantage of the advantages of multi-core CPUs. If the multi-threaded parallel computing scheme is adopted, due to the irregular distribution of non-zero values ​​in the sparse matrix, the conventional evenly divided data area ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/30G06F9/38
CPCG06F9/3885G06F9/30007
Inventor 葛志邦张大龙胡志洋黄鑫宋宪政王琳
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products