Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method optimizing sparse matrix vector multiplication to improve incompressible pipe flow simulation efficiency

A technology of sparse matrix and compressed tubes, which is applied in the direction of machine execution devices and concurrent instruction execution, etc. It can solve problems such as the large influence of the distribution characteristics of non-zero elements in the matrix, the need to improve the simulation efficiency of incompressible tube flow, and poor spatial and temporal locality. , to achieve the effects of improving acceleration efficiency, improving data locality and cache hit rate, and reducing influencing factors

Active Publication Date: 2014-08-13
HANGZHOU DIANZI UNIV
View PDF3 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] These methods are greatly affected by the distribution characteristics of the non-zero elements of the matrix and are not general
For matrices that are generally sparse but have many dense sub-matrix characteristics locally, the sparse matrix-vector multiplication operation based on the traditional sparse matrix storage format has poor spatial and temporal locality, and the optimization for this will also introduce additional overhead, which is incompressible The simulation efficiency of pipe flow still needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method optimizing sparse matrix vector multiplication to improve incompressible pipe flow simulation efficiency

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention will be further described in detail below in conjunction with the accompanying drawings and implementation methods.

[0019] Such as figure 1 Shown, the inventive method comprises the following steps:

[0020] Step 1. Read the equation system and use the QCSR storage structure to store the sparse matrix. First, read in the non-zero elements of the sparse matrix sequentially according to the row-major order, then divide the matrix into rows on average according to the block method, and reorder and store the elements in each partition block according to the column-major order, within the current partition block interval , taking the column of the first non-zero element that is not in the block interval as the starting column, divide the matrix evenly by column according to the block method, and divide it recursively. Finally, when the sub-matrix block size is smaller than the set block size, the recursive matrix division ends, and the leaf node bl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method optimizing sparse matrix vector multiplication to improve incompressible pipe flow simulation efficiency. The method uses a QCST storage structure to combine with the advantages of a quadtree structure and a CSR storage structure to operate recursion decomposition and rearrangement to a sparse matrix to realize the storage of the sparse matrix, so that the sparse matrix vector multiplication operating process has the universality to the matrix form, particularly is suitable for the matrix with the whole being sparse and the local part being provided with a plurality of dense sub-matrixes. The method realizes the sparse matrix vector multiplication based on the QCSR storage structure through four strategies of thread mapping optimization, data storage optimization, data transmission optimization and data reusing optimization in a CPU / GPU (central processing unit / graphics processing unit) heterogeneous parallel system. The method has the advantages that the data locality and the cache hit rate in the sparse matrix vector multiplication value calculating process are improved, and the better calculating acceleration and the whole acceleration effect are obtained, so that the incompressible pipe flow simulation efficiency is improved.

Description

technical field [0001] The invention belongs to numerical simulation of fluid mechanics, in particular to a method for optimizing sparse matrix-vector multiplication to improve the efficiency of incompressible pipe flow simulation. Background technique [0002] The problem of incompressible pipe flow is an important research object of fluid mechanics, and the research results have been widely used in scientific research and industrial technology in related fields such as plasma physics, magnetohydrodynamics and astrophysics. With the development of computer science and technology, the incompressible pipe flow problem that can only be solved through field tests can be solved by computer numerical simulation methods. Simulation of incompressible pipe flow requires solving a large system of linear equations, namely Ax=b form( A as a matrix, x, b is a vector), while solving the matrix of the system of equations A It is often a sparse matrix, where the number of non-zero ele...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/38
Inventor 张纪林万健刘恩益毛洁任永坚任祖杰殷昱煜蒋从锋周丽
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products