Triangular matrix multiplication vectorization method of vector processor

A vector processor and triangular matrix technology, applied in machine execution devices, concurrent instruction execution, etc., can solve the problems of reducing bandwidth requirements, inability to support, difficult data alignment, etc., to achieve efficient vectorized calculations, improve computing performance, and save storage effect of space

Active Publication Date: 2013-12-11
NAT UNIV OF DEFENSE TECH
View PDF6 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with ordinary matrices, half of the matrix elements of triangular matrices are 0, so applying ordinary matrix multiplication in computing and storage cannot effectively improve the computing performance of triangular matrices
Moreover, the calculation on the vector processor-oriented platform, on the one hand, because the triangular matrix is ​​not a square matrix like an ordinary matrix, it is difficult to achieve regular data alignment and cannot balance the calculation load of each processing unit; on the other hand, the vector data access unit The supported vector data Load/Store generally only suppo...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Triangular matrix multiplication vectorization method of vector processor
  • Triangular matrix multiplication vectorization method of vector processor
  • Triangular matrix multiplication vectorization method of vector processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0038] like figure 2 As shown, the present invention is oriented to the vector processor-oriented triangular matrix multiplication vectorization method, and the specific steps are:

[0039] Step 1: Store the triangular matrix elements in the triangular matrix T of the multiplicand continuously by row; set the triangular matrix T of the multiplicand as a matrix of order m*m, and the matrix B of the multiplier as a matrix of order m*k, calculate the triangular matrix T and The multiplication of the multiplier matrix B, the result matrix C=T*B, C is a matrix of order m*k;

[0040] Step 2: According to the number of vector processing units of the vector processor and the number of MAC components of the vector processing unit, the multiplier matrix B is divided into several sub-matrices B by columns i ; Suppose the number of vector proc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a triangular matrix multiplication vectorization method of a vector processor. The triangular matrix multiplication vectorization method of the vector processor comprises the steps that (1) triangular matrix elements in a multiplicand triangular matrix T are stored continuously by row; (2) a multiplier matrix B is divided into a plurality of sub-matrixes Bi by row according to the number of vector processing units of the vector processor and the number of MAC parts of the vector processing units; (3) the sub-matrixes Bi are multiplied by the multiplicand triangular matrix T in sequence and then the results are stored on storage positions of the original sub-matrixes Bi; (4) the sub-matrixes Bi of the multiplier matrix are traversed and then the fact that whether sub-matrixes Bi which are not multiplied by the multiplicand triangular matrix exist is judged, the I is updated according to the formula i=i+1 and the steps are repeated from the step (3) if sub-matrixes Bi which are not multiplied by the multiplicand triangular matrix exist, and step (5) is executed if sub-matrixes Bi which are not multiplied by the multiplicand triangular matrix do not exist; (5) triangular matrix multiplication is accomplished. The triangular matrix multiplication vectorization method of the vector processor has the advantages that the principle is simple, operation is easy and convenient, and the calculation efficiency of the vector processor can be fully performed.

Description

technical field [0001] The invention mainly relates to the field of triangular matrix multiplication calculation, in particular to a vector processor-oriented triangular matrix multiplication vectorization method. Background technique [0002] Basic Linear Algebra Subprograms (BLAS) is a core mathematical library widely referenced in various scientific calculations and a core component of many mathematical software. At present, BLAS is widely used in scientific and engineering calculations such as physics, electronic engineering, biology, economics, and computing science. Among them, the two function libraries of triangular matrix-matrix multiplication and triangular matrix solve are one of the most frequently called function libraries. Since the transpose and inverse of a triangular matrix are still triangular matrices, these two types of function libraries can be attributed to the triangular matrix multiplication calculation: C=TB, where T is the upper triangular matrix o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/38
Inventor 刘仲窦强郭阳邓让钰田希龚国辉唐遇星王永文彭元喜陈海燕胡封林陈磊
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products