Unlock instant, AI-driven research and patent intelligence for your innovation.

CNN accelerator data access method and system

A data access and accelerator technology, applied in the field of calculation and computing, can solve the problems of limited on-chip resources and memory bandwidth acceleration, and achieve the effect of reducing high transmission bandwidth pressure, reducing memory bandwidth and power consumption, and reducing local storage overhead.

Pending Publication Date: 2022-06-24
昆山市工业技术研究院有限责任公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, embedded mobile terminals are facing the challenges of limited on-chip resources, high performance and low power consumption. CNN networks have a large number of parameters, most of which are stored in external memory. Memory bandwidth has become a serious problem in accelerating CNN networks. Problem, for huge multiplication and addition operations and complex control signals, it is necessary to study the data flow and calculation mode in depth to maximize the application of on-chip resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • CNN accelerator data access method and system
  • CNN accelerator data access method and system
  • CNN accelerator data access method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] FPGA has powerful parallel processing capability and abundant logic resources. For example, DSP can perform multiply-accumulate operations with higher computing precision, which is very suitable as a CNN implementation platform.

[0048] like figure 1 As shown, a CNN accelerator data access method provided by an embodiment of the present invention includes steps S1 to S8 as described below.

[0049] S1. Preset training parameters, obtain an input feature map and store it in a buffer area.

[0050] It should be noted that the training parameters include the size and weight of the input feature map, the weight can be obtained by training the neural network, and the input feature maps of different channels are stored in the buffer.

[0051] S2. Perform judgment processing on the input feature map in the buffer area, obtain prefetched data through register array processing, and perform cyclic processing on the prefetched data to obtain post-sequence data

[0052] The pro...

Embodiment 2

[0082] A CNN accelerator data access system provided by an embodiment of the present invention includes:

[0083] Data cache module: used to preset training parameters, obtain input feature maps and store them in the cache area;

[0084] Data preprocessing module: used to judge and process the input feature map in the buffer area, obtain prefetched data through register array processing, and cyclically process the prefetched data to obtain the sequential data;

[0085] Convolution calculation module: used for convolution calculation processing of sequential data to obtain convolution data;

[0086] Data post-processing module: used to perform batch normalization, RELU activation function activation, quantization, pooling and FIFO buffer processing on the convolution data in turn to obtain the output results;

[0087] Judgment output module: used to judge the number of processing layers of the output result to obtain the target coordinates.

[0088] As will be appreciated by ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a CNN accelerator data access method and system, and belongs to the technical field of calculation and reckoning, and the method comprises the steps: presetting training parameters, obtaining an input feature map, and storing the input feature map in a cache region; judging and processing the input feature map in the cache region, processing the input feature map through a register array to obtain prefetched data, and circularly processing the prefetched data to obtain sequenced data; performing convolution calculation processing on the sequenced data to obtain convolution data; sequentially performing batch normalization, RELU activation function activation, quantization, pooling and FIFO cache processing on the convolution data to obtain an output result; performing judgment processing on the processing layer number of the output result so as to realize full-on-chip storage and obtain a target coordinate; and the memory bandwidth and the power consumption are effectively reduced.

Description

technical field [0001] The invention relates to a CNN accelerator data access method and system, belonging to the technical field of calculation and calculation. Background technique [0002] In recent years, deep learning has greatly promoted the development of machine learning. Convolutional Neutral Network (CNN) is proposed by the mechanism of biological receptive field. It is a deep neural network with local connection, weight sharing and convergence characteristics. Feedforward neural network is also an excellent learning model in deep learning. Because of its extremely advanced performance, it is widely used in many application fields such as processing image information and speech information. Compared with traditional methods, while achieving high-precision and real-time target recognition capabilities, it also requires more computing processing and memory resources, which must rely on large-scale servers, and current general-purpose processors can no longer meet its...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/063
CPCG06N3/063G06N3/045
Inventor 张娟梁天柱张广明疏建王汉霖
Owner 昆山市工业技术研究院有限责任公司