GEMM-based deep neural network acceleration method and system
A deep neural network, dimensional technology, applied in biological neural network models, neural architecture, climate sustainability, etc., can solve the problems of small matrix and affect the overall efficiency, and achieve the effect of reducing time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0049] like figure 1 As shown, a flow chart of a deep neural network acceleration method based on GEMM, a deep neural network acceleration method based on GEMM, includes the following steps: judging that the input matrix multiplication is regular matrix multiplication or irregular matrix multiplication; if it is a regular matrix Multiplication, traverse the pre-established regular matrix multiplication sharding strategy, and select the best sharding strategy based on Kernel Occupancy; if it is irregular matrix multiplication, generate fractions according to the matrix dimension and the preset irregular matrix multiplication sharding strategy. Sharding strategy and traversing the generated sharding strategy, select the optimal sharding strategy based on Kernel Occupancy; shard the matrix according to the selected optimal sharding strategy, and calculate all matrix shards to obtain the operation result. The working flow chart of the GEMM computing accelerator, such as figure 2...
Embodiment 2
[0108] like image 3 As shown in the figure, a structure diagram of a deep neural network acceleration system based on GEMM, the present embodiment provides a deep neural network acceleration system based on GEMM, the system includes an input judgment module, a selection strategy module and a slice operation module, each of which The specific functions of the module are as follows:
[0109] Input judgment module, used to judge whether the input matrix multiplication is regular matrix multiplication or irregular matrix multiplication;
[0110] The selection strategy module is used to traverse the preset regular matrix multiplication sharding strategy if it is regular matrix multiplication, and select the best sharding strategy based on Kernel Occupancy; if it is irregular matrix multiplication, according to the matrix dimension and preset The irregular matrix multiplication sharding strategy generates a sharding strategy and traverses the generated sharding strategy, and selec...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



