Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-precision neural network model implementation method and system

A technology of neural network model and implementation method, which is applied in the field of multi-precision neural network model implementation method and system, can solve the problems of high computational complexity of neural network, increased difficulty of model deployment, and inability to support the realization of multiple precision models, etc. The difficulty of model deployment and the effect of optimizing power consumption

Active Publication Date: 2021-10-01
合肥酷芯微电子有限公司
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Due to the high computational complexity of neural networks for complex tasks, multi-precision numerical representation methods are required on specific hardware accelerators to obtain optimized power consumption and efficiency. This requirement increases the difficulty of model deployment
[0003] In the prior art, the Chinese patent application with the publication number CN110942139A discloses a "Deep Learning Neural Network Deployment System and Its Method". Although this deployment method can support the identification, analysis, and deployment model generation of various neural network frameworks , but cannot support the implementation of multiple precision models on specific hardware accelerators

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-precision neural network model implementation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several changes and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0043] like figure 1 As shown, a kind of multi-precision neural network model implementation method provided by the present invention includes:

[0044] Step 1: Obtain specific hardware accelerator operator constraints, and generate network model calculation graph operator configuration information.

[0045] Step 1 includes:

[0046] Step 1.1: According to the numerical accuracy constraints of specific hardware accelerator operators, generate network model calculation graph operator configuration info...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-precision neural network model implementation method and system; the method comprises the steps: obtaining operator constraint conditions of a hardware accelerator, and generating calculation graph operator configuration information of a multi-precision neural network model; performing structural analysis on the multi-precision neural network model, and obtaining a corresponding relation between different numerical precision and a computational graph operator by combining configuration information of the computational graph operator; adjusting parameters of the multi-precision neural network model according to a corresponding relation between different numerical precision and computational graph operators, and obtaining performance compensation of the multi-precision neural network model after the numerical precision is reduced; and generating configuration information of each level according to the multi-precision neural network model after parameter adjustment, and deploying the multi-precision neural network model to a hardware accelerator according to the configuration information of each level. According to the method, when the complex neural network is deployed to a specific hardware accelerator, operators are ensured to adopt different numerical precision combinations, so that optimized power consumption, efficiency and storage bandwidth are obtained, and the model deployment difficulty is reduced.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a method and system for realizing a multi-precision neural network model. Background technique [0002] Due to the high computational complexity of neural networks for complex tasks, multi-precision numerical representation methods are required on specific hardware accelerators to obtain optimized power consumption and efficiency. This requirement increases the difficulty of model deployment. [0003] In the prior art, the Chinese patent application with the publication number CN110942139A discloses a "Deep Learning Neural Network Deployment System and Its Method". Although this deployment method can support the identification, analysis, and deployment model generation of various neural network frameworks , but cannot support the implementation of multiple precision models on specific hardware accelerators. Contents of the invention [0004] In view of the defects in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063
CPCG06N3/063
Inventor 鲍丹季圣洁沈沙
Owner 合肥酷芯微电子有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products