Vectorization implementation method for pooling of multi-sample multi-channel convolutional neural network

A technology of convolutional neural network and implementation method, which is applied in the field of vectorized implementation of multi-sample and multi-channel convolutional neural network pooling, and can solve problems such as the mismatch of the number of processing units, the uncertain size of the third dimension, and the impact of loading data efficiency. , to save power consumption and area, avoid data shuffling, and improve overall computing efficiency

Active Publication Date: 2020-02-14
NAT UNIV OF DEFENSE TECH
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the size of the third dimension in this type of method is uncertain, which does not match the number of processing units of the vector processor, and the size of the third dimension of dif

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vectorization implementation method for pooling of multi-sample multi-channel convolutional neural network
  • Vectorization implementation method for pooling of multi-sample multi-channel convolutional neural network
  • Vectorization implementation method for pooling of multi-sample multi-channel convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.

[0046] Assuming that the number of cores of the target vector processor is q, the number of VPEs of each core is p, the two-dimensional image input data of the convolutional neural network pooling layer currently calculated is preH*preW, and the number of channels is preC, The filter size is kernelH*kernelW, the step size is stepLen; the total number of samples in the data set is M, and the Mini-batch size is MB, where MB=q*p, M=num*MB, and num is a positive integer. like figure 2 As shown, the specific steps of the vectorized implementation method of multi-sample multi-channel convolutional neural network pooling in this embodiment include:

[0047] Step 1: Store the input feature data set data of the convolutional neural network pooling layer according to th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a vectorization implementation method for pooling of a multi-sample multi-channel convolutional neural network. The method comprises the following steps: step 1, storing inputfeature data set data of a convolutional neural network pooling layer according to a sample dimension priority mode; 2, dividing an input feature data set data matrix into a plurality of matrix blocksby the vector processor according to columns; 3, sequentially extracting matrix blocks with specified sizes by the vector processor according to rows, and transmitting the matrix blocks to a data buffer area of an array memory of the vector processor; 4, each core of the vector processor performs pooling vectorization calculation on the matrix blocks in the respective data buffer area in parallel, and calculation results are transmitted to an off-chip memory in sequence; and step 5, repeating the step 3 to the step 4 until all pooling layer calculation is completed. The method can give full play to the calculation performance of the vector processor, and has the advantages of simple implementation method, high implementation efficiency, low power consumption, good effect and the like.

Description

technical field [0001] The invention relates to the technical field of vector processors, in particular to a method for realizing vectorization of multi-sample multi-channel convolutional neural network pooling. Background technique [0002] In recent years, deep learning models based on deep convolutional neural networks have made remarkable achievements in image recognition and classification, target detection, video analysis, etc. The rapid development of related technologies such as data processing and processors. Convolutional Neural Networks (CNN) is a type of Feedforward Neural Networks (Feedforward Neural Networks) that includes convolution calculations and has a deep structure. It is one of the representative algorithms for deep learning. The input layer of the convolutional neural network can process multi-dimensional data. Since the convolutional neural network is the most widely used in the field of computer vision, when designing the convolutional neural networ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04G06N3/063G06N3/08
CPCG06N3/063G06N3/08G06N3/045
Inventor 刘仲鲁建壮雷元武田希陈海燕刘胜吴虎成李勇王耀华李程
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products