Symmetric matrix and vector multiplication parallel computing method and system

A parallel computing and symmetric matrix technology, applied in complex mathematical operations, multi-programming devices, resource allocation, etc., can solve problems such as waste of processor thread resources, unbalanced thread load, etc., achieve a reasonable distribution of thread resources and avoid writing Conflict, computationally efficient effects

Pending Publication Date: 2022-07-22
SHANGHAI LINCTEX DIGITAL TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, in the process of parallel computing, the thread load is extremely unbalanced, which seriously wastes the thread resources of the processor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Symmetric matrix and vector multiplication parallel computing method and system
  • Symmetric matrix and vector multiplication parallel computing method and system
  • Symmetric matrix and vector multiplication parallel computing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0083] Embodiment 1 of the present invention discloses a parallel computing method for multiplying a symmetric matrix and a vector, which can avoid write conflicts, realize multi-thread parallel computing, balance the load of each thread, and distribute thread resources reasonably. The flow chart of the parallel computing method is attached to the manual figure 1 As shown, the specific scheme is as follows:

[0084] A parallel computing method for multiplying a symmetric matrix and a vector, comprising the following steps:

[0085] 101. Obtain the symmetric matrix and vector to be calculated, and determine the number of threads that can be calculated in parallel by all threads;

[0086] 102. Divide the symmetric matrix into a symmetry axis area, an upper triangle area and a lower triangle area with the symmetry axis as the boundary, and divide the upper triangle area or the lower triangle area into multiple sub-areas in line units according to the number of threads; in each s...

Embodiment 2

[0119] Embodiment 2 of the present invention discloses a parallel computing system for multiplying a symmetric matrix and a vector. On the basis of Embodiment 1, the method of Embodiment 1 is systematized, and the specific structure is shown in the appendix of the description. image 3 As shown, the specific scheme is as follows:

[0120] A parallel computing system for multiplying symmetric matrices and vectors, comprising:

[0121] Input unit 1, used to obtain the symmetric matrix and vector to be calculated, and determine the number of threads that can be calculated in parallel by all threads;

[0122] The area division unit 2 is used to divide the symmetric matrix into the symmetry axis area, the upper triangle area and the lower triangle area with the symmetry axis as the boundary, and divide the upper triangle area or the lower triangle area into a plurality of sub-areas according to the number of threads in row units; In each sub-region, a first axis and / or a second a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a symmetric matrix and vector multiplication parallel computing method and system, and the method comprises the steps: obtaining a symmetric matrix and a vector, and determining a thread count; dividing sub-regions; in each sub-region, defining a first axis and/or a second axis parallel to the symmetry axis; calculating a symmetry axis region and a vector, and writing into a register; performing two times of different first multiplication calculation on the first axis to obtain two groups of different data, and respectively writing the two groups of different data into a register; and performing two different second multiplication calculation on the second axis to obtain two or four groups of different data, and respectively writing the data into the register to obtain a calculation result of multiplication of the symmetric matrix and the vector. According to the scheme, multiplication calculation of the symmetric matrix and the vector can be accurately and rapidly achieved, the processing sequence of the elements in the matrix is reasonably planned, resources are reasonably allocated to all the threads, write-in conflicts can be effectively avoided, multi-thread parallel calculation is achieved, the calculation efficiency is high, loads of all the threads are balanced, and thread resources are reasonably distributed.

Description

technical field [0001] The invention relates to the field of matrix data processing, in particular to a parallel computing method and system for multiplying a symmetric matrix and a vector. Background technique [0002] A real number matrix A is a data block composed of M×N real numbers. And a vector x can be considered as a matrix composed of N×1 real numbers. The multiplication result y between the matrix A and the vector x is an M×1 vector. The multiplication of matrices and vectors is defined as follows: [0003] [0004] The parallel calculation of matrix multiplication by the processor can effectively shorten the data processing time. When developing parallel algorithms, an important issue is how to avoid write conflicts. Simply put, a write conflict is when many concurrently running threads need to write to the same memory address at the same time. If no processing is done, the previously written result may be overwritten by the latter, resulting in an incorre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/16G06F9/50
CPCG06F17/16G06F9/5038
Inventor 王华明刘郴
Owner SHANGHAI LINCTEX DIGITAL TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products