A multi-core implementation method of inception structure

An implementation method, multi-core technology, applied in neural architecture, complex mathematical operations, biological neural network models, etc., can solve problems such as overfitting, increase the amount of calculation, and increase the total number of parameters in the network, and achieve simple implementation and operation. The effect of convenient, high-performance computing power

Active Publication Date: 2021-05-14
NAT UNIV OF DEFENSE TECH
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the AlexNet model regression has added a fully connected layer, the number of parameters reaches 60M. If the hidden layer in the network is increased and the number of layers of the neural network is deepened to further improve the recognition success rate of the network, it will not only increase the total number of parameters in the network, but also increase the number of parameters in the network. Increasing the amount of calculation is also extremely prone to overfitting

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-core implementation method of inception structure
  • A multi-core implementation method of inception structure
  • A multi-core implementation method of inception structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0024] Such as Figure 5 Shown, the multi-core implementation method of a kind of Inception structure of the present invention, its steps are:

[0025] S1: According to the number of cores M of the multi-core vector processor, the number P of the single-core vector processor VPE, the storage capacity V of the in-core storage space AM of the single-core vector processor, and the number N of all convolution cores in a single layer in the Inception structure , to evenly distribute the calculation amount that each core can complete simultaneously among the M cores.

[0026] S2: Transfer N / M convolution kernels from DDR to the in-core vector memory AM of M cores in turn, put the output results of the upper level into DDR, and use the scalar LOAD instruction to fetch data from DDR, And broadcast the data to the in-core AM of M cores.

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A multi-core implementation method of the Inception structure, the steps are: S1: According to the parameters such as the number of cores M of the multi-core vector processor and the number N of all convolution cores in a single layer in the Inception structure, to evenly allocate M cores. Each core can be The amount of calculation completed at the same time; S2: N / M convolution kernels are sequentially transferred from DDR to the in-core vector memory bank AM of M kernels, and the output results of the previous level are placed in DDR; S3: M kernels At the same time, the calculation of N convolution kernels is completed, and N output feature maps are obtained at the same time; S4: The output results are transferred from the M kernels to the DDR through DMA, and the M kernels simultaneously load the convolution kernel data of the next level; S5: Broadcast data from DDR to M cores, and complete the calculation of all output feature maps of this stage; S6: Repeat the steps until the calculation of all stages in the Inception structure is completed. The invention has the advantages of simple implementation, convenient operation, can improve the parallelism of the multi-core processor, accelerate the calculation speed of the GoogleNet network model, and improve the calculation efficiency of the multi-core processor.

Description

technical field [0001] The present invention mainly relates to the fields of artificial intelligence, machine learning, and convolutional neural network, and in particular refers to a multi-core implementation method of an Inception structure, which is a multi-core implementation method applied to accelerating GoogLeNet networks. Background technique [0002] The AlexNet model is an important network model in the field of convolutional neural networks. It won the first place in the ILSVRC competition in 2012. Since the AlexNet model regression has added a fully connected layer, the number of parameters reaches 60M. If the hidden layer in the network is increased and the number of layers of the neural network is deepened to further improve the recognition success rate of the network, it will not only increase the total number of parameters in the network, but also increase the number of parameters in the network. Increasing the amount of calculation is also extremely prone to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/15G06N3/04
CPCG06F17/15G06N3/045
Inventor 郭阳张军阳杨超田希扈啸李斌
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products