Double-FPGA cooperative work method for deep neural network

A deep neural network and collaborative work technology, applied in the field of airborne intelligent computing, can solve problems such as limited hardware resources, and achieve the effect of improving computing speed and parallelism

Inactive Publication Date: 2018-06-29
XIAN AVIATION COMPUTING TECH RES INST OF AVIATION IND CORP OF CHINA
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention proposes a dual-FPGA cooperative working method for deep neural networks. By evaluating the computational complexity of each computing unit of the deep neural network, the network model is divided to realize the cooperative work of dual FPGA chips to solve the hardware resources of a single FPGA chip. Limited problems, better use of the parallel computing characteristics of the deep neural network, improve the computing speed, so as to be better applied to the airborne embedded environment with high real-time requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Double-FPGA cooperative work method for deep neural network
  • Double-FPGA cooperative work method for deep neural network
  • Double-FPGA cooperative work method for deep neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention is described in further detail.

[0019] When the present invention is implemented, it mainly considers the requirements of the neural network model for hardware resources and data interaction. Compared with a single FPGA, it can support parallel computing of a larger-scale neural network without dynamic reconfiguration. . Taking LeNet5 as an example, the network includes an input layer, a convolutional layer C1, a downsampling layer P1, a convolutional layer C2, a downsampling layer P2, a fully connected layer F1, a fully connected layer F2, and an output layer.

[0020] Serial division method: According to the calculation amount of the network, the convolutional layer C1 and the downsampling layer P1 are implemented on the first chip, and the convolutional layer C2, the downsampling layer P2 and two fully connected layers are implemented on the second chip , the number of network layers implemented on the second chip is more, but the calculation...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of onboard intelligent computation and provides a double-FPGA cooperative work method for a deep neural network. The method comprises the steps that a deep neural network model is analyzed, according to the characteristic that the deep neural network model is composed of multiple sub layers, a calculation task is divided, and double-FPGA-chip cooperative work isachieved; the method of dividing the calculation task comprises serial division and parallel division. The limit of hardware resources can be effectively broken, time is replaced by space, the parallel degree of the neural network in an embedded calculation environment is greatly increased, and then the calculation speed of the network is increased. Similarly, the method can be further expanded toa multi-FPGA cooperative work method, and the neural network with a larger scale is achieved.

Description

technical field [0001] The invention belongs to the field of airborne intelligent computing, and proposes a double FPGA cooperative working method oriented to a deep neural network. Background technique [0002] Deep neural networks have shown better and better application effects in intelligent computing tasks in many fields, and they also have excellent application prospects in the aviation field. However, the current deep neural network with excellent performance is very large in scale and runs on workstations, supercomputers and even computer clusters. For example, the earliest Google Brain used 1000 16-core CPUs to train the deep neural network to recognize cats, and defeated the world champion last year. AlphaGo uses 1920 CPUs and 280 GPUs, so it is difficult to implement a deep neural network in an embedded environment with limited hardware resources. At present, one solution is based on FPGA's rich programmable logic and wiring resources. However, even if the curren...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/50
CPCG06F30/331
Inventor 程陶然白林亭文鹏程郭锋李亚晖刘作龙
Owner XIAN AVIATION COMPUTING TECH RES INST OF AVIATION IND CORP OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products