Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Complex matrix optimizing method

A complex number matrix and optimization method technology, applied in the field of computer program optimization, can solve problems such as high operating overhead and low overall efficiency

Inactive Publication Date: 2012-10-10
UNIV OF SCI & TECH OF CHINA
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is to propose an optimization method for a complex matrix to overcome the problem that the existing general block-calculation and fixed-block-size calculation methods have low overall efficiency when running on Godson, resulting in high operating costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Complex matrix optimizing method
  • Complex matrix optimizing method
  • Complex matrix optimizing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0012] The specific operation steps of the optimization method embodiment of the complex matrix on the Godson of the present invention are as follows:

[0013] 1. Loop unrolling and instruction scheduling

[0014] The loop unrolling of the three-level BLAS in the basic linear algebra assembly (BLAS) of the commonly used mathematical library generally refers to the unrolling granularity of the multiplication of the block matrix after the matrix is ​​divided into blocks, focusing on the relationship between loop unrolling and instruction scheduling . Due to the characteristics of Godson-4 launch: one fetch instruction, two floating-point instructions, and one fixed-point instruction are sent in each cycle. In order to shorten the cycle of program operation as much as possible, it is necessary to ensure that the operation fetch ratio is 2:1 in the loop structure. , assuming that the expansion granularities of the first matrix A and the second matrix B of the two matrices are KA ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a complex matrix optimizing method, which is characterized by comprising the steps: firstly calculating the specific unrolling granularity of the godson architecture, carrying out four-by-four loop unrolling to a complex matrix, and selecting a maximum value as the size nb of a partitioning block of a matrix so as to obtain the optimal ideal size of the partitioning block of the matrix on the godson, wherein the maximum value of the size nb of the partitioning block of the matrix is smaller than 52, and the product of 24 and the square of the size nb of the partitioning block of the matrix is smaller than the maximum of 64 kilobyte of a first data cache of a godson processor; reasonably dividing and combing matrixes in a matrix multiplication by utilizing the continuity and the locality of data storage, and reducing the cache access number of the first grade data of the godson; and carrying out the common complex addition and multiplication in the complex matrix operation by utilizing the multiplication of two complexes in the classic complex algorithm so as to reduce the operating scale, so that the calculation performance of the complex matrix multiplication on the godson is enhanced by about 50%, and the operating rate of BLAS (basic linear algebra subprograms) base on the godson 3A is increased by more than 1.5 times.

Description

technical field [0001] The invention belongs to the technical field of computer program optimization methods, and in particular relates to an optimization method for medium function optimization design of complex matrix functions. Background technique [0002] With the wide application of large-scale mathematical calculations in science, engineering, finance and other fields, the basic linear algebra assembly (BLAS), a commonly used mathematical library, has become a necessary matrix operation tool for various system software. In addition, the important test linear system software package (LINPACK) for evaluating high-performance computing is based on the basic linear algebra program (BLAS) library. Therefore, the optimization of the basic linear algebra program (BLAS) library on different platforms is very important. Loongson 3A is the first domestically produced quad-core high-performance processor developed by the Institute of Computing Technology, Chinese Academy of Sci...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/16
Inventor 顾乃杰杨阳朝陈强吴淅
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products