Convolution calculation acceleration method and device

A convolution and convolution kernel technology, applied in the computer field, can solve problems such as low computational efficiency, waste of energy, and area, and achieve the effects of avoiding energy waste, improving hardware utilization, and improving the computational efficiency of depth-first convolution

Active Publication Date: 2019-08-23
湃方科技(天津)有限责任公司
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a convolution calculation acceleration method and device, which are used to solve the problem that only one line of multiplier-adder is in the operation state when the existing convolution calculation accelerator calculates depth-first convolution, which wastes energy and area and has low calculation efficiency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolution calculation acceleration method and device
  • Convolution calculation acceleration method and device
  • Convolution calculation acceleration method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047]In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0048] For the existing convolution calculation accelerator to calculate the depth-first convolution, only one line of multiply-adders along the vertical direction is in the operating state, and the rest of the line of multiply-adders are in the idle state, which not only causes waste of energy and area, but also affects To solve the problem o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a convolution calculation acceleration method and device, and the method comprises the steps: obtaining a current depth time sequence i when the convolution type is depth-first convolution; inputting the column pixels in the convolution kernel corresponding to the current depth time sequence i and the row pixels in the feature map into a convolution calculation unit, wherein the convolution calculation unit is an array composed of multipliers and adders; if the imodK is equal to 0, translating the first row of pixels of the convolution kernel to the last row, wherein K is the number of rows and columns of the convolution kernel; and if the imodK is equal to 0 and i is greater than or equal to K<2>, taking the (i-k<2>)/k+1 modk th line feature in theconvolution calculation unit as the output feature of the (i-k<2>)/k+1 th line depth first convolution. According to the method and the device provided by the embodiment of the invention, the hardware utilization rate of the convolution calculation unit is effectively improved, the energy waste caused by the idleness of a large number of multipliers is avoided, and the depth-first convolution calculation efficiency is improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a convolution calculation acceleration method and device. Background technique [0002] With the development of artificial intelligence technology, Convolutional Neural Network (CNN) has been fully utilized. The current mainstream convolutional neural network model has a complex structure and a large amount of calculation data. Conventional CPUs (Central Processing Units, central processing units) and GPUs (Graphics Processing Units, graphics processors) are difficult to meet the power consumption restrictions and calculations of convolution calculations. Need for speed. [0003] A common solution is to introduce heterogeneous computing technology, which combines the general-purpose computing power of the CPU and the high performance of the convolutional computing accelerator to achieve efficient and low-power convolutional neural network computing. Existing convolution compu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06T1/20G06N3/063
CPCG06T1/20G06N3/063Y02D10/00
Inventor 苏放刘勇攀田辉武通达李金阳马君
Owner 湃方科技(天津)有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products