Single-accuracy matrix multiplication optimization method based on loongson chip 3A
A technology of matrix multiplication and optimization method, which is applied in the field of electrical digital data processing, can solve the problem of low performance of single-precision matrix multiplication, achieve the effect of improving operation efficiency and overcoming invalid prefetch
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Examples
Embodiment 1
[0016] The present invention is based on the single-precision matrix multiplication optimization method of Loongson 3A. First, the two single-precision source matrices of Loongson 3A are divided into two sub-matrices according to the principle that the block size is not larger than the second-level cache respectively. The principle that is larger than half of the second-level cache is divided into two sub-matrices; the 128-bit memory access instruction of Godson 3A is used in the matrix multiplication core calculation code of the 32-bit memory access instruction of Godson 3A, the single-precision floating-point multiply-add instruction and the prefetch instruction And parallel single-precision floating-point instructions, and use twice the size of the operation data set minus the size of the operation data unit prefetch address calculation method to prefetch the data.
[0017] In this embodiment, the two single-precision source matrices of Loongson 3A are first divided into two...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com