Method and device for deep neural network computing acceleration
A deep neural network, precomputing technique
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0058] In the following, only some exemplary embodiments are briefly described. As those skilled in the art would realize, the described embodiments may be modified in various different ways, all without departing from the spirit or scope of the present invention. Accordingly, the drawings and descriptions are to be regarded as illustrative in nature and not restrictive.
[0059] An embodiment of the present invention provides a method for accelerating calculation of a deep neural network, such as figure 1 shown, including the following steps:
[0060] S100: Sampling each input vector that needs to be input into the matrix model to obtain a plurality of sampling vectors.
[0061] S200: Perform product quantization on each sampling vector according to a preset quantization parameter to obtain multiple quantization points.
[0062] S300: Divide the matrix model into multiple matrix blocks according to the quantization parameter.
[0063] S400: Calculate each quantization point...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com