Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep neural network reasoning acceleration method and system based on multi-operator fusion

A deep neural network and neural network technology, applied in inference methods, neural learning methods, biological neural network models, etc., can solve the problems of lack of automatic inference tools, incomplete and in-depth operator fusion, etc., and achieve the effect of realizing inference speed.

Pending Publication Date: 2021-09-21
ZHEJIANG LAB
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the above-mentioned technical problems existing in the prior art, the present invention proposes a deep neural network reasoning acceleration method based on multi-operator fusion to solve the problem that the current operator fusion technology lacks automatic reasoning tools, and the operator fusion is not thorough. In-depth questions to improve the speed of model reasoning, the specific technical solutions are as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep neural network reasoning acceleration method and system based on multi-operator fusion
  • Deep neural network reasoning acceleration method and system based on multi-operator fusion
  • Deep neural network reasoning acceleration method and system based on multi-operator fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the purpose, technical solution and technical effect of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0037] like figure 1 As shown, a deep neural network reasoning acceleration system based on multi-operator fusion, the system is divided into three layers: the underlying framework layer, including the deep learning framework and deep learning reasoning engine required for deep neural network training and reasoning, The deep learning framework includes but is not limited to TensorFlow, Pytorch, MXNet, PaddlePaddle, MindSpore, OneFlow, etc. These deep learning frameworks are used for the construction of deep neural networks, calculation graph generation and training, and for each deep learning framework, Select the corresponding reasoning engine that supports the framework for the reasoning service of the model; the interface layer in the middle includes a d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a deep neural network reasoning acceleration method and system based on multi-operator fusion, and the method specifically comprises the steps: inputting a neural network calculation diagram, obtaining a neural network calculation logic diagram, and obtaining a complete neural network forward calculation symbol expression according to the calculation relation between neural network operators; employing a fusible operator search method, and automatically simplifying the system by means of operator symbol expressions, simplifying a symbol expression of forward calculation of the neural network, obtaining the simplest symbol expression, and achieving multi-operator fusion; according to a multi-operator fusion result and the obtained simplest symbol expression, a new neural network calculation reasoning logic diagram is constructed, the simplest symbol expression is decoupled, offline calculation is performed, new model parameters are stored, and a corresponding neural network model structure is constructed; and finally, loading new model parameters to realize reasoning acceleration. According to the invention, the overhead of operator execution gaps can be reduced, the utilization rate of computing resources of equipment is improved, and the overall network reasoning speed is optimized.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and relates to a deep neural network reasoning acceleration method and system based on multi-operator fusion. Background technique [0002] In the field of artificial intelligence, a deep neural network is usually a very deep neural network that contains many network parameters. For example, ResNet-101 contains a network structure of 101 layers and more than 23 million network parameters. This makes the inference speed of the deep neural network a great challenge in the deployment inference stage. Operator fusion is a neural network inference acceleration technology. Operator fusion is to analyze and optimize the logic of the existing network computing graph, and perform operations such as splitting, reorganizing, and merging the original computing logic to reduce the overhead of operator execution gaps and improve the utilization of device computing resources, so as to realize t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08G06N5/04
CPCG06N3/04G06N3/08G06N5/04
Inventor 傅家庆杨非叶娇娇钟昊文陈岱渊单海军
Owner ZHEJIANG LAB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products