Unlock instant, AI-driven research and patent intelligence for your innovation.

Acceleration method of convolutional neural network, computer readable storage medium and application of method

A convolutional neural network and convolution technology, applied in the field of neural network convolution, can solve problems such as low computing efficiency and large CPU resource consumption, and achieve the effect of improving convolution efficiency, reducing reading time, and shortening required time.

Active Publication Date: 2020-05-19
FUJIAN STAR NET EVIDEO INFORMATION SYST CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] To this end, it is necessary to provide an acceleration method for convolutional neural networks, which is used to solve the existing technical problems of large CPU resource occupation and low computational efficiency when processing convolution calculations through CPUs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Acceleration method of convolutional neural network, computer readable storage medium and application of method
  • Acceleration method of convolutional neural network, computer readable storage medium and application of method
  • Acceleration method of convolutional neural network, computer readable storage medium and application of method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to explain in detail the technical content, structural features, achieved goals and effects of the technical solution, the following will be described in detail in conjunction with specific embodiments and accompanying drawings.

[0034] Noun description:

[0035] ROUNDUP8: It is an upward rounding operation. For example, ROUNDUP8(X) means that X is rounded up by 8;

[0036] Register: refers to the SIMD register of NEON;

[0037] Calculation: refers to 32-bit single-precision floating-point calculation; a SIMD register can store 4 single-precision floating-point numbers;

[0038] IDX / ri: means that IDX divides ri and rounds to an integer;

[0039] IDX% ri: means that IDX performs remainder operation on ri.

[0040] see figure 1 , the present embodiment provides a method for accelerating a convolutional neural network, the method for accelerating a convolutional neural network includes steps:

[0041] S101. Calculate the sorting of the input / output data of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an acceleration method of a convolutional neural network, a computer readable storage medium and an application of the method. The acceleration method of the convolutional neural network comprises the following steps: calculating the sequence of input / output data of convolution, and enabling the input data to be continuously stored in a memory according to an access sequence during convolution; sorting the convolved coefficient data to enable the coefficient data to be continuously stored in a memory according to an access sequence during convolution; dividing the matrix into more than two sub-matrixes; and distributing SIMD registers for the input data, the coefficient data and the output data of a submatrix multiplication operation to carry out convolution calculation, and enabling the number numreg of the SIMD registers required by the convolution of the sub-matrixes to meet the equation that numreg = ri + ct + ri * sc, wherein numreg is smaller than or equalto the maximum value of the number tn of the SIMD registers of a CPU processor. The input data is continuously stored in a memory, and the matrix is divided into the sub-matrixes, so that the SIMD register is supplemented and fully used for convolution of the sub-matrixes, and the convolution efficiency of the CPU is improved.

Description

technical field [0001] The present invention relates to the technical field of neural network convolution, in particular to an acceleration method for convolutional neural network, a computer-readable storage medium and an application thereof. Background technique [0002] Convolution calculations are applied in various fields of computer vision. As the complexity of deep convolutional networks continues to increase, the amount of calculations also increases, resulting in very high requirements for hardware computing capabilities in deep convolutional neural networks. However, the computing power of embedded devices is limited, and it is difficult to achieve the computing effect of general-purpose computers. [0003] With the development of embedded CPU, ARM's CPU has occupied a huge market share, among which contex-A series processors have been widely used in various intelligent devices such as smart phones, set-top boxes and smart TVs. The 128-bit SIMD (Single Instruction...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/063G06F9/38G06K9/00
CPCG06N3/063G06F9/3887G06V40/10G06N3/045Y02D10/00
Inventor 许勇刘灵辉郑维宏
Owner FUJIAN STAR NET EVIDEO INFORMATION SYST CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More