Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Quickening method utilizing cooperative work of CPU and GPU to solve triangular linear equation set

A technology of linear equations and collaborative work, applied in the direction of complex mathematical operations, etc., can solve problems such as inability to use GPU to accelerate components, inability to use CPU resources, failure to meet performance requirements, etc., and achieve a good load balancing effect

Inactive Publication Date: 2010-06-23
NAT UNIV OF DEFENSE TECH
View PDF0 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, there is no public literature on the method of accelerating the solution of triangular linear equations by using CPU and GPU to work together.
[0004] At present, the acceleration methods for solving triangular linear equations are all aimed at a single platform, or are implemented on the CPU, and the GPU cannot be used to accelerate components, and the performance requirements cannot be met; or they are only implemented on the NVIDIA GPU, and the CPU resources cannot be utilized. Not suitable for acceleration on CPU+GPU heterogeneous platforms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quickening method utilizing cooperative work of CPU and GPU to solve triangular linear equation set
  • Quickening method utilizing cooperative work of CPU and GPU to solve triangular linear equation set
  • Quickening method utilizing cooperative work of CPU and GPU to solve triangular linear equation set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] figure 1 Is the general flowchart of the present invention.

[0027] Step 1), perform an inversion operation on the CPU on the matrix A to obtain A -1 ;

[0028] Step 2), according to the data division ratio k, the matrix B is divided into two parts B executed on the CPU and GPU by columns 1 , B 2 , namely B=[B1 , B 2 ];

[0029] Step 3), put A -1 and B 2 Transfer from CPU to GPU;

[0030] Step 4), start computing tasks on the CPU and GPU at the same time, respectively X 1 =A -1 ×B 1 and x 2 =A -1 ×B 2 ;

[0031] Step 5), calculate the result X on the GPU 2 send back to the CPU;

[0032] Step 6), merge X by columns on the CPU 1 、X 2 Merge into a matrix X, that is, X=[X 1 , X 2 ], and output the solution X of the triangular linear equation system.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a quickening method utilizing cooperative work of a CPU and a GPU to solve a triangular linear equation set, aiming at providing the quickening method so as to lead a solving method based on a CPU platform for the triangular linear equation set to be quickened on a heterogeneous platform of the CPU plus the GPU. The technical scheme of the quickening method is as follows: first, the CPU is utilized to carry out matrix inversion so as to obtain an inverse matrix A-1 of a triangular matrix A; second, a matrix B is divided into two matrixes B1 and B2; third, two calculations of A-1*B1 and A-1*B2 are executed on the CPU and the GPU in a collateral manner so as to achieve load balance of the CPU and the GPU, and the results of A-1*B1 and A-1*B2 are respectively X1 and X2; finally, the X2 is returned to the CPU, and the X1 and the X2 are merged into one matrix X for output. The quickening method realizes overlapping calculation of the CPU and the GPU, achieves good effect of load balance, and quickens the solving of the triangular linear equation set.

Description

technical field [0001] The invention relates to a method for accelerating the solution of triangular linear equations, in particular to a method for accelerating the solution of triangular linear equations by cooperating with CPU and GPU. Background technique [0002] Trigonometric linear equations are widely used in many scientific fields, in the form of op(A)×X=α×B or X×op(A)=α×B. Where A is an upper triangular or lower triangular matrix, op(A) is either A or A T ; X and B are matrices, and α is a constant. The process of solving triangular linear equations is to solve the matrix X with known matrices A, B and coefficient α. The existing implementation of solving matrix X based on CPU is a triple loop, the number of outermost loops is the number of columns of matrix X, the number of loops in the middle layer is the number of rows of matrix X, and each element of matrix X is calculated in the double loop. In order to calculate the i-th row and the j-th column element of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/16
Inventor 杨灿群廖湘科陈娟王锋刘杰黄春易会战杜云飞
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products