FPGA parallel acceleration method based on convolution neural network (CNN)

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional neural network and network technology, applied in the field of FPGA parallel acceleration of convolutional neural networks, can solve the problems of not fully exerting the FPGA computing potential and poor scalability.

Inactive Publication Date: 2017-12-12

NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT

View PDF2 Cites 40 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Previously, some scholars have implemented CNN with different structures on FPGA to do simple real-time image recognition or classification, but the computing potential of FPGA has not been fully utilized, and it has poor scalability.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0012] Below, in conjunction with accompanying drawing, the present invention is described in detail as follows:

[0013] The FPGA parallel acceleration method of a convolutional neural network of the present invention comprises the following points:

[0014] One is the CNN model structure. The CNN model structure adopted in the present invention is made up of 1 input layer input, 1 output layer output, 2 convolution layers, 2 pooling and a fully connected network Softmax, such as figure 1 shown. In this experiment, the input image set is the handwritten digital image set MNIST. The size of each image is 28×28 pixels. The specific network structure is as follows:

[0015] Input layer: 28×28;

[0016] C1Conv layer: 3kernels, each with size 5×5, stride=1;

[0017] S1Max-pooling layer: each with size 2×2, stride=2, β=1.0 b=0.0;

[0018] C2Conv layer: 6kernels, each with size 5×5, stride=1;

[0019] S2Max-pooling layer: each with size 2×2, stride=2, β=1.0 b=0.0;

[0020] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an FPGA parallel acceleration method based on a convolution neural network (CNN), and the method comprises the following steps: (1), building a CNN model; (2), configuring a hardware architecture; (3), configuring a convolution operation unit. The beneficial effects of the invention lies in that the method employs the FPGA for implementing the CNN, increases the speed of CNN calculation, compared with a GPU, relative to a handwriting image dataset MNIST by nearly five times through designing an optimization hardware acceleration scheme, and achieves the 10-time acceleration when compared with a 12-core CPU; the power consumption is one third of the power consumption of the CPU.

Description

technical field [0001] The invention relates to the field of computer computing, in particular to a FPGA parallel acceleration method of a convolutional neural network. Background technique [0002] Convolutional Neural Network (Convolutional Neutral Network) is a kind of artificial neural network. CNN is the first learning algorithm to truly successfully train a multi-layer network structure. It uses the spatial relationship and adopts the weight sharing network structure to make it more similar to the biological neural network, which reduces the complexity of the network model and reduces the number of weights to improve the training performance of the general forward BP algorithm. This advantage is more obvious when the input of the network is a multi-dimensional image. On the other hand, in CNN, the image can be directly used as the bottom layer input of the network, and the information is then transmitted to different layers in turn. Each layer passes a digital filter...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/063

CPCG06N3/065

Inventor 徐杰包秀国陈训逊王博王东安

Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

FPGA parallel acceleration method based on convolution neural network (CNN)

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology