Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Heterogeneous parallel computing method for row updating of sparse matrix LU factorization

A sparse matrix and parallel computing technology, applied in the field of sparse matrix LU decomposition, can solve problems such as low efficiency and low operating efficiency, and achieve the effects of improving overall efficiency, avoiding conflicts and dependencies, and increasing computing overhead

Active Publication Date: 2019-01-04
SHANDONG COMP SCI CENTNAT SUPERCOMP CENT IN JINAN
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the row update stage, superlu will perform a large number of unit lower triangular matrix equations solving operations, and the efficiency of this type of solving process is not high when calling the blas library provided by the usual TaihuLight, which is much lower than the simple main core The operating efficiency of the , resulting in a computational bottleneck

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Heterogeneous parallel computing method for row updating of sparse matrix LU factorization
  • Heterogeneous parallel computing method for row updating of sparse matrix LU factorization
  • Heterogeneous parallel computing method for row updating of sparse matrix LU factorization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0041] A heterogeneous parallel computing method for sparse matrix LU decomposition row update, including a master core part and a slave core part; such as figure 2 As shown, the specific implementation of the row update main core part is as follows:

[0042] A1) In the row update phase, obtain the row matrix block currently to be processed, analyze each column vector of the row matrix block, count the starting position of each vector in the memory and the size of the vector, and generate an index array; the vector is in The starting position in the memory corresponds to the storage location of the non-zero metadata of the column vector, and the size of the vector corresponds to the number of non-zero metadata of the column vector; the index number of the index array corresponds to the column vector of the row matrix block; A column vector, index number 2 corresponds to the second column vector, and so on;

[0043] The purpose of analyzing each column vector of the row matri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a heterogeneous parallel computing method for updating sparse matrix LU decomposition rows. Based on the superlu algorithm, combined with the master-slave heterogeneity of Taihu Lake Light, the superlu solver transfers the large-scale computing task to the slave kernel in the process of matrix decomposition and row updating, and utilizes the efficient computing and data communication ability of the slave kernel to improve the computational ability of the super-large-scale sparse matrix and the overall performance of the solution. The invention divides tasks according to the data of the matrix, Firstly, a matrix space is created for each slave kernel, and each slave kernel is responsible for solving several matrix rows. In the process of solving the linear equationsof the lower triangular identity matrix type, each row of data is independent of each other, avoiding the conflict and dependence of the solution space, and smoothly realizing the solution of the equations.

Description

technical field [0001] The invention relates to a heterogeneous parallel computing method for row update of sparse matrix LU decomposition, and belongs to the technical field of sparse matrix LU decomposition. Background technique [0002] In recent years, my country has made a series of significant progress in fields closely related to electromagnetism. Among them, the numerical calculation of electromagnetic field is playing an increasingly important role because of its significant advantages such as high efficiency, flexibility, and convenience. In the numerical analysis method of electromagnetic field, moment Quantitative method has the characteristics of high theoretical precision. Moment method converts the electromagnetic field operator equation to be solved into a matrix equation. Due to its high theoretical precision, when dealing with electromagnetic problems of complex electric and large systems such as airborne phased array The huge complex number sparse matrix pre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/16
CPCG06F17/16
Inventor 张赞军田敏曾云辉潘景山杨美红
Owner SHANDONG COMP SCI CENTNAT SUPERCOMP CENT IN JINAN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products