Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural network cutting method and device, equipment and storage medium

A neural network and cutting device technology, applied in the field of neural network cutting methods, devices, equipment and storage media, can solve problems such as damage to the structure, the hardware platform is difficult to support the network, etc.

Pending Publication Date: 2022-01-21
IFLYTEK CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the present application provides a neural network clipping method, device, device and storage medium to solve the problem that the existing neural network clipping method destroys the structure of the network parameters of the original neural network, causing most hardware platforms to It is difficult to support the problem of the tailored network, and the technical solution is as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network cutting method and device, equipment and storage medium
  • Neural network cutting method and device, equipment and storage medium
  • Neural network cutting method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0051] see figure 1 , which shows a schematic flow chart of the neural network clipping method provided by the embodiment of the present application, the method may include:

[0052] Step S101: Determine the correlation among convolution kernel parameters of the convolution kernel to be tailored in the neural network to be tailored.

[0053] Specifically, the correlation between the convolution kernel parameters of the convolution kernels to be tailored can be determined by processing the convolution kernel parameters of the convolution kernels to be tailored in the neural network to be tailored.

[0054] Step S102: Determine a clipping mask corresponding to the convolution kernel to be clipped according to the correlation between convolution kernel parameters to be clipped.

[0055] Wherein, the clipping mask corresponding to the convolution kernel to be clipped can represent the importance of each convolution kernel parameter of the convolution kernel to be clipped.

[005...

no. 2 example

[0069] Through the neural network clipping method provided in the above-mentioned embodiments, it can be known that which convolution kernel parameters of the convolution kernel to be clipped are clipped depends on the clipping mask corresponding to the convolution kernel to be clipped. It can be seen that the clipping mask corresponding to the convolution kernel to be clipped The determination of is very important, and this embodiment focuses on the specific implementation process of determining the clipping mask corresponding to the convolution kernel to be clipped.

[0070] The above-mentioned embodiment mentioned that the clipping mask corresponding to the convolution kernel to be clipped is determined according to the correlation between the convolution kernel parameters of the convolution kernel to be clipped, and the specific method of determining the clipping mask corresponding to the convolution kernel to be clipped is introduced Before, the implementation process of d...

no. 3 example

[0110] The embodiment of the present application also provides a neural network clipping device. The following describes the neural network clipping device provided in the embodiment of the present application. The neural network clipping device described below and the neural network clipping method described above can be referred to in correspondence.

[0111] see Figure 5 , shows a schematic structural diagram of a neural network clipping device provided in an embodiment of the present application, which may include: a convolution kernel parameter correlation determination module 501 , a clipping mask determination module 502 , and a convolution kernel clipping module 503 .

[0112] The convolution kernel parameter correlation determination module 501 is used to determine the correlation between the convolution kernel parameters of the convolution kernel to be trimmed in the neural network to be trimmed.

[0113] The clipping mask determination module 502 is configured to d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a neural network cutting method and device, equipment and a storage medium, and the method comprises the steps: determining the correlation between convolution kernel parameters of a to-be-cut convolution kernel in a to-be-cut neural network; determining a clipping mask corresponding to the to-be-clipped convolution kernel according to the correlation between the convolution kernel parameters of the to-be-clipped convolution kernel; and according to the cutting mask corresponding to the to-be-cut convolution kernel, cutting the convolution kernel parameter of the to-be-cut convolution kernel. The cutting method provided by the invention does not destroy the structured information of the network parameters of the original neural network, so that the neural network cut by the cutting method provided by the invention can be deployed in most hardware platforms without being supported by additional special hardware, and in addition, the convolution kernel parameters of the to-be-clipped convolution kernel are clipped according to the clipping mask corresponding to the to-be-clipped convolution kernel instead of directly clipping the whole convolution kernel or the whole channel, so that refined clipping of the neural network is realized.

Description

technical field [0001] The present application relates to the technical field of network pruning, and in particular to a neural network pruning method, device, equipment and storage medium. Background technique [0002] In recent years, deep learning technology has achieved remarkable results in image recognition, speech recognition, natural language processing and other fields. However, due to the huge amount of parameters of the deep neural network, it is difficult to use it in many scenarios, such as embedded systems and mobile phones. In order to solve this problem, currently popular methods include network model search, network model quantization, and network model clipping. Among them, network model clipping is widely used because of its simplicity, reliability, and maturity. [0003] The current network model pruning is mainly weight pruning. Weight pruning is a refined pruning method, which prunes the independent parameters whose median value of the network paramete...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08
CPCG06N3/082
Inventor 郭智豪常欢吴嘉嘉谢名亮殷兵
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products