Convolution operation method based on expansion access on heterogeneous many-core architecture
A convolution operation and heterogeneous technology, applied in the field of deep learning, can solve problems such as inability to use processor computing resources, poor optimization effect, system bandwidth pressure, etc., to save memory bandwidth resources, reduce memory access requirements, and improve performance effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0019] Embodiment: The present invention provides a convolution operation method based on dilation fetching on a heterogeneous many-core architecture, which specifically includes the following steps:
[0020] S1. Input input, weight, and stride, where input is Hi*Wi, weight is K*K, calculate the shape of output output according to the shape of input and weight, and obtain Ho*Wo;
[0021] S2. According to the shape of the output, in the Ho and Wo dimensions, according to the logic number of each core, the convolution calculation tasks are evenly distributed to the cores, and each core processes a calculation task whose size is Ho_BLOCK*Wo_BLOCK;
[0022] S3. Each core calculates the required input size Hi_BLOCK* Wo_BLOCK according to its own task size, Hi_BLOCK=Ho_BLOCK*stride+K-1, Wi_BLOCK= Wo_BLOCK*stride+K-1;
[0023] S4. Each core performs convolution calculation through the obtained input (Hi_BLOCK* Wo_BLOCK) and weight;
[0024] S5. Steps S3 and S4 are repeated until the...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


