Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep convolutional neural network acceleration method and system suitable for NPU

A neural network and deep convolution technology, applied in the field of deep convolutional neural network acceleration methods and systems, can solve problems such as high latency, affecting system stability, and high cost, and achieve the effect of increasing computing parallelism

Active Publication Date: 2021-10-01
武汉魅瞳科技有限公司
View PDF18 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the use of high-performance GPU (Graphics Processing Unit, graphics processing unit) cluster services can meet the requirements of computing speed, it also has some disadvantages: high latency, the road video needs to be transmitted to the computing center for processing, and the delay of transmission greatly limits the speed of road traffic. Real-time monitoring capability; high cost, need to buy or rent a large number of expensive high-performance servers; low stability, network fluctuations directly affect system stability
However, the NPU embedded AI computing node must have the disadvantage of low computing power while satisfying low power consumption and low cost.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep convolutional neural network acceleration method and system suitable for NPU
  • Deep convolutional neural network acceleration method and system suitable for NPU
  • Deep convolutional neural network acceleration method and system suitable for NPU

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0045]Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The occurrences of this phrase in various places in the specification are not necessarily all referring to the same embodiment, no...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a deep convolutional neural network acceleration method and system suitable for NPU. According to the NPU calculation unit structure, the data arrangement format of the input parameters of the convolutional layer is adjusted in advance; therefore, the problem that the calculation units are not fully utilized in the convolution calculation process is solved, and the throughput of the matrix calculation units is fully utilized. and execution assembly line of the calculation instruction in the NPU is rearranged, the convolution calculation capability of the NPU is mined, and the calculation parallelism in the deep convolutional neural network is enhanced. According to the acceleration method provided by the invention, the arrangement mode of the data streams and the instructions in the deep convolutional neural network is modified, so that the efficient and rapid convolutional neural network can be realized on the NPU.

Description

technical field [0001] Embodiments of the present invention relate to the field of computer technology, and in particular to a deep convolutional neural network acceleration method and system suitable for NPU. Background technique [0002] With the deepening of research in the field of deep learning, convolutional neural networks have been applied to various fields of computer vision. Research and experiments in recent years have shown that convolutional neural networks have shown absolute dominance over traditional computer vision algorithms in many image processing tasks such as target detection, face recognition, image classification, and semantic segmentation. [0003] While the performance of convolutional neural networks in classification and detection tasks continues to improve, the amount of parameters and calculations in neural network models are also increasing significantly. When neural network algorithms are applied, they will face many problems. Although the us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06N3/04
CPCG06N3/063G06N3/045Y02D10/00
Inventor 李开邹复好郭虎
Owner 武汉魅瞳科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products