Convolutional neural network calculation method for many-core processor based on double coordinate systems

A convolutional neural network and many-core processor technology, applied in the field of Shenwei heterogeneous many-core processors and deep learning, can solve problems such as high pressure on computing and communication, reduce redundancy, reduce space and time complexity The effect of reducing the space complexity

Pending Publication Date: 2022-07-12
UNIVERSITY OF CHINESE ACADEMY OF SCIENCES
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to provide a convolutional neural network calculation method for many-core processors based on a dual coordinate system, which solves the problem of high communication pressure between calculation and communication under existing communication constraints

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network calculation method for many-core processor based on double coordinate systems
  • Convolutional neural network calculation method for many-core processor based on double coordinate systems
  • Convolutional neural network calculation method for many-core processor based on double coordinate systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] see Figure 1-5 As shown, it is a block diagram of the DCS-CNN of the present invention. Specifically, the DCS-CNN of the present invention includes the following steps:

[0032] S1, perform logical padding (LP) on the input data instead of traditional physical padding (PP).

[0033]

[0034] Logical Fill (LP) is essentially a one-way implementation of DSCM that transforms the input image from a Physical Coordinate System (PCS) to a Logical Coordinate System (LCS) using Equation (1). Specifically, given a kernel size of K and a stride of S, LP first computes its actual values ​​in four directions (i.e., up, down, left, and right) for 64 slave kernels (CPEs). Size of the padding region to ensure efficient computation in convolutional layers (conv) or pooling layers (pool):

[0035]

[0036] where P 1 and P 2 (like figure 2 shown) correspond to the padding width and height of all CPEs, respectively. The above formula (2) is a flexible concept because or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-core processor convolutional neural network calculation method based on a dual coordinate system, relates to the field of SW heterogeneous multi-core processors and deep learning, solves the problem that the communication pressure between calculation and communication is relatively large under the existing communication constraint, and provides the following scheme. The method comprises the following steps: S1, carrying out logic filling (LP) instead of traditional physical filling (PP) on input data; s2, segmenting the logically filled data so as to distribute the data to a slave core array (CPEs) on a super computer chip; and S3, performing general matrix multiplication (iGEMM) calculation suitable for a dual coordinate system on the distributed data by the slave core array, and outputting a result. The device has the characteristics that the communication pressure between calculation and communication under the communication constraint is reduced, and the space and time complexity is reduced, so that faster matrix multiplication becomes possible.

Description

technical field [0001] The invention relates to the field of Shenwei heterogeneous many-core processors and deep learning, in particular to a convolutional neural network calculation method for many-core processors based on a dual coordinate system. Background technique [0002] Convolutional layers in convolutional neural networks extract features from the input image. Feature maps are sensitive to the location of features in the input image. A downsampling operation on the feature map is used to reduce this sensitivity. Pooling is a common downsampling operation. [0003] Pooling makes the model more robust to changes in feature locations in the input image. Pooling works similarly to convolution, sliding a kernel window over the input image and applying a pooling function to the data in the window. Two common pooling functions are: max pooling and mean pooling. The specific operations are as follows: max pooling summarizes the most active regions in the feature, ie: s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06T1/60G06T1/20G06N3/04G06F13/28G06F15/16G06T7/10
CPCG06T1/60G06T1/20G06F13/28G06F15/16G06T7/10G06N3/045Y02D10/00
Inventor 徐涵左哲铭李杰
Owner UNIVERSITY OF CHINESE ACADEMY OF SCIENCES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products