Unlock instant, AI-driven research and patent intelligence for your innovation.

Convolutional neural network calculation optimization method and device, computer device and medium

A convolutional neural network and optimization method technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problem of poor acceleration performance, increase the computational load of convolutional neural networks, and reduce the efficiency of convolutional neural networks. problems, to achieve the effect of improving computing speed, improving compatibility and performance

Active Publication Date: 2019-11-12
上海燧原智能科技有限公司
View PDF15 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, based on the consideration of reducing chip area and power consumption and reducing the complexity of hardware design, the design of Tensor Core will only provide extreme acceleration for some specific convolution shapes (rows and columns of convolution kernels), and for some that do not meet Convolution of a specific shape, the acceleration performance of Tensor Core is poor
If the trained parameters of the convolutional neural network model are changed to adapt to the Tensor Core, the convolutional neural network will need to be retrained, which will increase the computational load of the convolutional neural network and reduce the efficiency of the convolutional neural network.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network calculation optimization method and device, computer device and medium
  • Convolutional neural network calculation optimization method and device, computer device and medium
  • Convolutional neural network calculation optimization method and device, computer device and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] figure 1It is a flow chart of a convolutional neural network calculation and optimization method in Embodiment 1 of the present invention. This embodiment is applicable to the case of convolution calculation for a size-optimized convolutional neural network. This method can be provided by the embodiment of the present invention The convolutional neural network calculation and optimization device can be implemented by means of software and / or hardware, and can generally be integrated into electronic devices, such as terminal devices or servers. Such as figure 1 As shown, the method of this embodiment specifically includes:

[0045] S110. Obtain the feature map of the optimized convolutional neural network to be input; wherein, the optimized convolutional neural network is based on the optimal size of the feature map and the optimal size of the convolution kernel of the local device. The convolutional neural network is adjusted, and the size of the input feature map in ...

Embodiment 2

[0067] Figure 2a It is a flow chart of a convolutional neural network calculation and optimization method in Embodiment 2 of the present invention. This embodiment is embodied on the basis of the above-mentioned embodiments, and the steps are based on the feature map corresponding to the optimized convolutional neural network. The relationship between the optimal size and the size of the feature map to be input, determine the matching input adjustment method, specifically: if the size of the feature map to be input is greater than the optimal size of the feature map, determine the input adjustment method is input segmentation processing; the input segmentation processing is used to divide the feature map to be input into a plurality of input feature map units having the same optimal size as the feature map, which are adjacent to each other in the feature map to be input The two input feature map units are not all the same; if the size of the feature map to be input is smaller t...

Embodiment 3

[0132] image 3 It is a flow chart of a convolutional neural network calculation optimization method in Embodiment 3 of the present invention. This embodiment is applicable to the case of convolution calculation for a size-optimized convolutional neural network. This method is applied to the local device In the adapted convolutional neural network, the method can be executed by the convolutional neural network computing optimization device provided by the embodiment of the present invention, which can be implemented in the form of software and / or hardware, and can generally be integrated into electronic equipment, such as , terminal equipment or server, etc. Such as image 3 As shown, the method of this embodiment specifically includes:

[0133] S310. Obtain the feature map to be input through the first node, and determine a matching input adjustment mode based on the relationship between the optimal size of the feature map and the size of the feature map to be input, and ad...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a convolutional neural network calculation optimization method and device, a computer device and a medium. The method comprises the steps of obtaining a to-be-input feature map of an optimized convolutional neural network; determining a matched input adjustment mode according to the relationship between the optimal size of the feature map corresponding tothe optimized convolutional neural network and the size of the to-be-input feature map, and adjusting the size of the to-be-input feature map; inputting the adjusted feature map to be input into the optimized convolutional neural network to obtain an output feature map output by the optimized convolutional neural network; and determining an output adjustment mode of the output feature map according to the input adjustment mode, adjusting the output feature map, and taking the adjusted output feature map as a target output feature map result of the convolutional neural network. According to theembodiment of the invention, the convolutional neural network can adapt to the acceleration performance of hardware equipment, and the calculation speed of the convolutional neural network is improved.

Description

technical field [0001] The embodiments of the present invention relate to the field of data processing, and in particular to a convolutional neural network calculation optimization method, device, computer equipment and media. Background technique [0002] With the rapid development of big data, the demand for neural network models is increasing. Usually, the neural network model needs to be trained by sample data to form a model with preset functions, so as to perform data processing operations such as prediction of unknown samples. [0003] At present, as the amount of data increases, the computational load of the neural network model becomes larger and larger, resulting in low computational efficiency of the neural network model. In order to solve the above-mentioned problems, an existing method is to use hardware to accelerate. For example, in a convolutional neural network model, hardware circuits, such as Tensor Core, can be designed for convolution to speed up the o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 王浩杨宏璋
Owner 上海燧原智能科技有限公司