Neural network acceleration method and device based on pulsation array, computer equipment and storage medium

A neural network and systolic array technology, applied in the field of neural networks, can solve the problem of wasting systolic array computing and scheduling resources, etc.

Pending Publication Date: 2019-08-16
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present application provides a systolic array-based neural network acceleration method, device, computer equipment, and storage medium

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network acceleration method and device based on pulsation array, computer equipment and storage medium
  • Neural network acceleration method and device based on pulsation array, computer equipment and storage medium
  • Neural network acceleration method and device based on pulsation array, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0043] The flow charts shown in the drawings are just illustrations, and do not necessarily include all contents and operations / steps, nor must they be performed in the order described. For example, some operations / steps can be decomposed, combined or partially combined, so the actual order of execution may be changed according to the actual situation. In addition, although the division of functional blocks is made in the schematic diagram of the device,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of model construction, and a filter and a feature map are segmented according to a preset rule when a convolution step length is not 1, and the convolution is enabled to be equivalent to a convolution with a step length of 1 so as to be adaptive to a pulsation array. The invention particularly discloses a neural network acceleration method and device based on a pulse array, computer equipment and a storage medium. The method comprises the steps of acquiring convolution parameters of a convolution filter; if the convolution step length is not 1 and the size ofthe convolution filter is greater than 1 * 1, segmenting a plurality of sub-filters from the convolution filter according to a preset filter segmentation rule; obtaining a to-be-convolutional featuremap and segmenting a plurality of feature sub-maps from the to-be-convolutional feature map according to a preset feature map segmentation rule; based on the pulsation array, carrying out convolutioncalculation on the characteristic sub-graphs corresponding to the sub-filters according to the sub-filters, wherein the step length of convolution calculation is 1; and superposing convolution calculation results corresponding to the sub-filters, and outputting the result of superposition as a result of convolution calculation of the to-be-convolutional feature map by using the convolution filter.

Description

technical field [0001] The present application relates to the technical field of neural networks, and in particular to a neural network acceleration method, device, computer equipment and storage medium based on a systolic array. Background technique [0002] The most important part of the commonly used neural network is the calculation of convolution. In the convolution calculation, the convolution filter is often not equal to 1. In this case, some mainstream neural network computing libraries, such as CUDNN (NVIDIA The deep network computing library) will be significantly slower when computing this convolution. Some deep learning accelerators such as Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA), dedicated network processors (network processunits, NPU), etc. are usually implemented with a systolic array structure in the convolution part. The case where the convolution filter of the filter is not equal to 1 is very unfriendly. [0003] The existing t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04G06N3/063
CPCG06N3/063G06N3/045
Inventor 郭跃超高鹏谢国彤唐义君张萌
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products