Unlock instant, AI-driven research and patent intelligence for your innovation.

Device and method for performing pooling operation

A computing module and gradient vector technology, applied in the field of artificial neural network, can solve the problem of off-chip bandwidth performance bottleneck, high power consumption overhead, and no multi-layer artificial neural network operation.

Active Publication Date: 2017-11-07
CAMBRICON TECH CO LTD
View PDF5 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the GPU is a device specially used to perform graphics and image calculations and scientific calculations, without special support for multi-layer artificial neural network operations, it still requires a lot of front-end decoding work to perform multi-layer artificial neural network operations, which brings a lot of problems. additional cost
In addition, the GPU only has a small on-chip cache, and the model data (weights) of the multi-layer artificial neural network need to be repeatedly moved from off-chip, and the off-chip bandwidth has become the main performance bottleneck.
In addition, the GPU has only a small on-chip cache, and the model data (weights) of the multi-layer artificial neural network need to be repeatedly moved from off-chip. The off-chip bandwidth has become the main performance bottleneck, and it has brought huge power consumption overhead.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Device and method for performing pooling operation
  • Device and method for performing pooling operation
  • Device and method for performing pooling operation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The artificial neural network based on the pooling computing device of the present invention includes multiple neurons in two or more layers. For maxpooling, in the forward operation, compare each input vector in turn in the pooling kernel, take the maximum value, and get the output vector. If reverse training is required, save the corresponding index vector index at the same time; slide the pooling kernel, and do the above in a loop Computational operations until the end of the pooling operation of this layer. During reverse training, the input gradient vector is output to the corresponding storage location according to the index vector index saved during the forward operation, and the output gradient vector is obtained; for avgpooling, each input vector is accumulated in the pooling kernel during the forward operation ; Then multiply by 1 / kernel_size to get the output vector, kernel_size represents the size of the pooling core; slide the pooling core, and cycle throug...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a device for performing pooling operation. The device comprises an instruction storage unit, a controller unit, a data access unit and an operation module. The instruction storage unit reads in instructions through the data access unit and buffers the read instructions; the controller unit reads the instructions from the instruction storage unit and decodes the instructions into control signals for controlling the behavior of an operation module and then distributes the control signals to the operation module; the data access unit is used for accessing external address space and completing data loading and storing; the operation module is used for completing maximum value seeking operation of the maxpooling operation or accumulating and multiplying operation of the avgpooling operation. For the maxpooling, in the forward operation, the operation module circularly reads input vectors of the poling kernel and performs comparison of size to obtain new output vectors of the kernel and save an index vector corresponding to each output vector at the same time until the pooling operation of the layer is completed. The invention can solve the problems of insufficient computing performance of CPU and GPU and high front-end decoding cost.

Description

technical field [0001] The present invention relates to an artificial neural network, in particular to a device and method for performing pooling operations. Background technique [0002] Multi-layer artificial neural networks are widely used in the fields of pattern recognition, image processing, function approximation, and optimization calculations. In recent years, multi-layer artificial networks have been favored by academic circles and researchers due to their high recognition accuracy and good parallelism. industry is getting more and more attention. [0003] The pooling operation refers to the downsampling operation of local features in the feature layer of the neural network to reduce the dimension of the feature layer. There are two types of pooling operations: maxpooling refers to taking the maximum value as the result in the kernel; avgpooling refers to taking the average value as the result in the kernel. The kernel here is the pooling core, the size is specifi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/30G06N3/08
CPCG06N3/063G06N3/084G06N3/045
Inventor 刘少礼宋琎陈云霁陈天石
Owner CAMBRICON TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More