Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method, computing device, and computer-readable storage medium for convolution computation

A computer and convolution technology, applied in the field of machine learning, can solve the problems of wasting bandwidth and power, consumption, and bandwidth occupation

Active Publication Date: 2021-09-21
SHANGHAI BIREN TECH CO LTD
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the same data exists for the pixel data that needs to be loaded for adjacent cycles, reloading the complete 2x2 pixel data will result in wasted bandwidth and power
[0003] To sum up, the traditional convolution calculation scheme needs to reload the complete pixel corresponding to each weight every cycle, thus resulting in more bandwidth and higher power consumption

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, computing device, and computer-readable storage medium for convolution computation
  • Method, computing device, and computer-readable storage medium for convolution computation
  • Method, computing device, and computer-readable storage medium for convolution computation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0023] In the description of the embodiments of the present disclosure, the term "comprising" and its similar expressions should be interpreted as an open inclusion, that is, "including but not limited to". The term "based on" should be understood as "based at least in part on". The term "one embodiment" or "the embodiment" should be read as "at least one embodiment...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present disclosure relates to a method for convolution calculation, a computing device and a computer-readable storage medium. The method includes: if it is determined that the current period is the initial period, loading data of n*n pixels from the input buffer to the systolic array, the systolic array includes a processing unit with a first predetermined number of rows multiplied by a second predetermined number of columns, the initial period and the initial weight Associated, n is a natural number; if it is determined that the current cycle is another cycle after the initial cycle, based on the sequence of the current cycle, load the data of a column of pixels, the data of a row of pixels or the data of one pixel to the systolic array; and if it is determined that the current cycle is completed For the corresponding data loading, a group of n*n pixel data is selected from one or more groups of n*n pixel data in the systolic array via the output multiplexing unit, so as to be provided to the general matrix multiplication unit. According to the embodiments of the present disclosure, bandwidth occupation and power consumption can be effectively reduced.

Description

technical field [0001] Embodiments of the present disclosure generally relate to the field of machine learning, and more specifically relate to a method for convolution calculation, a computing device, and a computer-readable storage medium. Background technique [0002] A typical operation in machine learning is convolution calculation. Convolution calculation is to multiply and add the weight in the convolution kernel and the corresponding input pixel block to generate the value of the corresponding output feature. Block convolution requires the img2col (image-to-column) function to convert pixel blocks into matrices. Traditional convolution calculation methods, such as block convolution calculations based on the img2col function, require each cycle to load complete pixels of each weight. For example, 3x3 block convolution (9 weights, from weight 0 to weight 8, corresponding to the initial period to the eighth period , namely cycle 0 to cycle 8), each cycle needs to load ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06T1/20G06F17/15G06N3/063
CPCG06F17/153G06N3/063G06T1/20
Inventor 不公告发明人
Owner SHANGHAI BIREN TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products