Method and hardware structure capable of realizing convolution calculation in multiple neural networks

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of neural network and hardware structure, applied in the direction of biological neural network model, neural architecture, physical realization, etc., can solve the problems of low computing efficiency of traditional CPU, long hardware iteration cycle, huge deployment cost, etc.

Inactive Publication Date: 2021-09-03

井芯微电子技术(天津)有限公司

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] 1. The hardware environment is almost customized, and the support for network upgrades and network changes is poor;

[0007] 2. After the algorithm is updated and the scene changes, the hardware iteration cycle is longer, and the cost is high, and the tape-out risk is greater;

[0008] 3. Traditional CPU has low computing efficiency;

[0009] 4. The energy efficiency ratio of GPU is low, and the deployment cost is huge.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0044] Embodiment 1, a method that can realize convolution calculation in various neural networks, by analyzing the convolution operation process in various neural networks, extracting the common features of the convolution calculation change methods involved in the convolution operation process, and The common features are parameterized, and the hardware structure is used to analyze and configure the processed parameters to realize the analysis of different convolution algorithms.

[0045] The basic operations of different CNN networks are roughly the same. The differences are mainly reflected in the different sizes of input feature maps, the number of input feature maps is too different, the dimensions of convolution kernels are different, and the positions of full connections in the network are different. But among the above differences, The process of CNN network operation is composed of convolution, activation, pooling, full connection and other calculation processes; ther...

Embodiment 2

[0069] Embodiment 2 provides a kind of hardware structure (such as figure 1 shown), including: a main control CPU, and a weight cache unit connected to the main control CPU, an input feature map cache unit, an output feature map cache unit, the weight cache unit is used to cache weight data, and the input feature map cache unit , The output feature map cache unit is used to cache feature map data;

[0070] The output end of the weight cache unit is connected to the weight multicast control unit, and the weight multicast control unit is used to broadcast weight data to the parallel computing array unit;

[0071] The output end of the input feature map cache unit is connected to the FP_rd unit, and the output end of the FP_rd unit is connected to the parallel computing array unit;

[0072] The input end of the output feature map cache unit is connected to the Fp_wr unit, and the input end of the Fp_wr unit is connected to the nonlinear computing unit;

[0073] The output end o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a method and a hardware structure capable of realizing convolution calculation in various neural networks, which are characterized in that convolution calculation processes in various neural networks are analyzed, common characteristics of convolution calculation change modes involved in the convolution calculation processes are extracted, parameterization is carried out on the common characteristics. The configured and processed parameters are analyzed by using a hardware structure to realize analysis of different convolution algorithms. According to the method and the hardware structure capable of realizing the convolution calculation in the multiple neural networks, through abstracting multiple convolution neural network calculation processes, network changes are written into a register instruction capable of being analyzed by hardware, and the calculation state of the neural network calculation unit is sensed through the main control CPU; the main control CPU configures weight data, feature map data, a configuration register and other configuration information required by the convolutional neural network calculation unit in real time, and such a hardware structure can realize effective support for various convolutional neural network calculations, and is fast and compatible, and the calculation efficiency is not affected.

Description

technical field [0001] The invention belongs to the technical field of delay performance testing of switching chips, and in particular relates to a method and a hardware structure capable of realizing convolution calculation in various neural networks. Background technique [0002] At present, in the CNN (convolutional neural network) inference acceleration network hardware design, there are mainly the following design directions in the design, lightweight hardware design, low-power hardware design, sparse network hardware design, storage and calculation integrated hardware Design, large computing power hardware design and other directions, each hardware design method has different application scenarios, due to the calculation process of convolutional neural network, the hardware structure is relatively fixed for a certain network, but I want to design a relatively general hardware Environment, to support a variety of convolutional neural network structures, has become relat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N3/04G06N3/063

CPCG06N3/063G06N3/045

Inventor 朱珂王元磊陶常勇汪欣刘长江夏云飞王永胜李晓颖李文强

Owner 井芯微电子技术(天津)有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method and hardware structure capable of realizing convolution calculation in multiple neural networks

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology