General convolutional neural network acceleration structure based on ZYNQ and design method

A convolutional neural network and acceleration structure technology, which is applied in the field of general convolutional neural network acceleration structure and design, can solve the problems of inability to realize the reuse of other algorithms, poor versatility, increasing computational complexity and computing power requirements, etc. Structural adaptability and mass data exchange issues, high versatility, effect of low design complexity

Active Publication Date: 2019-10-18
HARBIN UNIV OF SCI & TECH
View PDF6 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The currently proposed FPGA design methods mainly focus on specific neural network acceleration methods, and only realize the acceleration of the circuit structure for specific algorithms. The versatility is poor, and the reuse of other algorithms cannot be realized.
[0004] With the increase of the number of deep convolutional neural network layers and the explosive growth of the number of parameters, its computational complexity and computing power requirements have also increased.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • General convolutional neural network acceleration structure based on ZYNQ and design method
  • General convolutional neural network acceleration structure based on ZYNQ and design method
  • General convolutional neural network acceleration structure based on ZYNQ and design method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be described in detail below in conjunction with the accompanying drawings and specific implementation cases.

[0028] Such as figure 1 , a general-purpose convolutional neural network acceleration structure based on ZYNQ is mainly composed of ARM processor, bus interconnection module, register, convolution operation path, auxiliary operation path, pooling operation path, memory access module, DDR4 controller and memory stick . The ARM processor is used to configure and schedule the hardware circuit designed in the FPGA, including feature map size, feature map channel number, convolution kernel size, convolution kernel channel number, output result size, output result channel number, convolution Stride and convolution mode, the register receives the start signal to start each sub-module, internally calculates the index value of each convolution operation cycle according to the configuration information, and the register value according to the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for accelerating multichannel convolution operation in a convolutional neural network. the accelerator can accelerate neural networks of any structure, can achieve programming and online configuration can be realized; the supported feature map size, feature map channel number, convolution kernel size, convolution kernel channel number and convolution stride are flexible and variable, the control logic is simple, the convolution operation parallelism degree is high, the accelerator can be applied to any ZYNQ architecture platform, and a user can cut an acceleration circuit according to dsp resources in a chip of the user; 128 dsp (Digital Signal Projection) resources can be supported at least. The universal convolutional neural network acceleration structurebased on ZYNQ comprises an ARM processor, bus interconnection, a DDR4 controller, a memory bank, a register, a convolution operation path, an auxiliary operation path, a pooling operation path and a memory access module.

Description

technical field [0001] The invention relates to the technical field of convolutional neural network hardware acceleration, in particular to a general convolutional neural network acceleration structure and design method based on ZYNQ. Background technique [0002] Convolution Neural Network (CNN) is widely used in the field of computer vision, especially in object detection and image recognition, showing good application prospects. Edge computing is a brand-new computing model whose concept is to process data directly at the edge near the data center without sending it back to the server for processing. The use of edge computing in target detection can bring a series of benefits: directly process the image on the hardware device at the acquisition end without sending it back to the host computer, saving data transmission time and reducing data transmission overhead. It is of great practical significance to realize efficient processing on hardware devices by optimizing and a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/082G06N3/045
Inventor 刘杰马力强
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products