Unlock instant, AI-driven research and patent intelligence for your innovation.

Convolutional neural network operation method and device

A technology of convolutional neural network and operation method, which is applied in the field of convolutional neural network operation method and device, can solve problems such as low reuse rate of intermediate activation values, waste of resources, occupying video memory, etc., to reduce video memory occupancy rate and improve resources Effects of Utilization and Operating Speed

Active Publication Date: 2020-12-04
INSPUR SUZHOU INTELLIGENT TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

ResNet50 is composed of multiple modules, and the activation value calculated in the middle will be saved in its forward calculation, which takes up a lot of video memory, and most of the activation value will not be used in the subsequent calculation process, thus causing a waste of resources
[0004] There is currently no effective solution to the problems of low reuse rate of intermediate activation values ​​of convolutional neural networks, occupying video memory, and slowing down the running speed in the prior art.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network operation method and device
  • Convolutional neural network operation method and device
  • Convolutional neural network operation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0042] It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are to distinguish two entities with the same name but different parameters or parameters that are not the same, see "first" and "second" It is only for the convenience of expression, and should not be construed as a limitation on the embodiments of the present invention, which will not be described one by one in the subsequent embodiments.

[0043] Based on the above purpose, the first aspect of the embodiments of the present invention proposes an embodiment of a method for operating a convolutional neural network that reduces the video memory usage of the convolutional neural network du...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a convolutional neural network operation method and device, and the method comprises the steps: sequentially reading a first activation value, a second activation value and a third activation value of a convolution module from an activation value storage, using the first convolution kernel to convolve the first activation value in the temporary value memory to obtain a second activation value, using the second convolution kernel to convolve the second activation value to obtain a third activation value, using the third convolution kernel to convolve the third activationvalue to obtain a fourth activation value, and sequentially covering and writing the fourth activation value in the activation value memory; reading the first activation value and the fourth activation value from the activation value memory, superposing the first activation value and the fourth activation value in the temporary value memory based on linear correction to obtain a fifth activationvalue with a residual error, and overwriting the fifth activation value into the activation value memory. According to the invention, the video memory occupancy rate of the convolutional neural network in operation can be reduced, and the resource utilization rate and the operation speed are improved.

Description

technical field [0001] The present invention relates to the field of neural networks, and more specifically, to a method and device for operating a convolutional neural network. Background technique [0002] In recent years, deep learning networks have developed in a deeper and larger direction. The ResNet series of networks have proved that deeper networks can fit more data, but deep networks impose requirements on GPU memory. The maximum video memory supported by a modern GPU (graphics processing unit) is only 32G. Due to the high price of the GPU, expanding the number of GPUs will put a high pressure on the cost of the enterprise. Therefore, it is particularly important to save the GPU video memory occupied during the calculation process. [0003] ResNet50 is a 50-layer convolutional neural network that is widely used in image classification, object detection and other tasks in computer vision. ResNet50 is composed of multiple modules. In its forward calculation, the act...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 王萌
Owner INSPUR SUZHOU INTELLIGENT TECH CO LTD