Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Implementation method of systolic array structure for LU decomposition

A technology of pulsation array and implementation method, which is applied in the field of matrix calculation, can solve the problems that the execution speed of the algorithm cannot meet the demand, and the operation time of the algorithm is occupied.

Active Publication Date: 2019-11-15
JINAN UNIVERSITY
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, most of the image processing, signal processing and other algorithms are still running on the PC side based on software, which contains a large number of matrix decompositions that take up most of the computing time of the overall algorithm , and as the complexity of the algorithm and the dimension of the matrix increase, the execution speed of the algorithm based on software operation is far from meeting the demand

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Implementation method of systolic array structure for LU decomposition
  • Implementation method of systolic array structure for LU decomposition
  • Implementation method of systolic array structure for LU decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0056] Such as figure 1 As shown, this embodiment discloses a method for implementing a systolic array structure for LU decomposition, wherein the systolic array structure includes an array 1 module, an array 2 module, and a state controller module, wherein:

[0057] Array 1 module, used to calculate the upper triangular matrix U. The matrix to be calculated is input into the input interface of the array 1 module in parallel, without specific sorting of the data, and a column of data is read every clock, and the upper triangular matrix U can be calculated after 2N clocks.

[0058] Array2 module for computing the lower triangular matrix L. The intermediate data generated by the array 1 module is sent to the array 2 module through the state controller module, and the lower triangular matrix L is calculated synchronously with the array 1 module.

[0059] The state controller module is used to control the intermediate data exchange between the array 1 module and the array 2 modu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an implementation method of a systolic array structure for LU decomposition. The systolic array structure comprises an arithmetic unit array for calculating an upper triangularmatrix, an arithmetic unit array for calculating a lower triangular matrix, and a state controller module for controlling data input. Compared with an existing operation structure, the implementationmethod has the following innovations that (1) matrix data is input in parallel according to lines, and an input sequence does not need to be planned; (2) the dividers are independently extracted fromthe array processing unit, so that the number of the dividers is reduced, and the resource utilization rate is improved; (3) the pulsation array is partially parallelized, and the number of clock cycles required by operation is reduced while maintaining high pipelining; and (4) the high-dimensional systolic array structure designed by the invention can be compatible with lower-dimensional matrixLU decomposition, so that the same structure can be shared by matrixes with different dimensions, and the redesign of the structure is avoided. The implementation method can be applied to hardware implementation of matrix decomposition problems in the fields of signal processing, numerical analysis, artificial intelligence and the like.

Description

technical field [0001] The invention relates to the technical field of matrix calculation, in particular to a method for realizing a systolic array structure for LU decomposition. Background technique [0002] LU decomposition is a method of matrix decomposition. Its principle is to decompose a square matrix into an upper triangular matrix and a lower triangular matrix. Matrix factorization is commonly used in many scientific fields such as image processing and signal processing. [0003] At present, most algorithms such as image processing and signal processing are still running on the PC side based on software, which contains a large number of matrix decompositions that take up most of the computing time of the overall algorithm. With the increase of algorithm complexity and matrix dimension, based on The execution speed of the algorithm running by the software is far from meeting the demand. [0004] At present, it is urgent to carry out research on the matrix operation...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/16
CPCG06F17/16
Inventor 易清明陈嘉文符清杆
Owner JINAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products