Neural network sparsification device and method and corresponding product

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A neural network model and sparse technology, which is applied in the field of sparse training of neural network models, can solve the problems of high output overhead, many methods, and unfriendly hardware access memory, so as to improve accuracy and reduce input/output overhead. Effect

Pending Publication Date: 2022-05-06

ANHUI CAMBRICON INFORMATION TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although the existing fine-grained parameter sparse method model performs well, it is not friendly to hardware memory access, that is, the on-chip and off-chip input / output overhead is large, and the performance is low; on the other hand, the structured sparse method based on channels and convolution kernels Although the method improves the hardware performance, the loss of model accuracy is large; finally, most of the existing sparse algorithms are offline fine-tuning, that is, the pre-training model is sparse and then fine-tuned. The offline fine-tuning method has many restrictions and cannot be used in model training There are more substantial performance gains

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment approach 1301

[0106] Embodiment 1301 only has a mask adjustment stage. Both the parameter initial value W0 and the mask tensor initial value M0 are randomly generated by the random generation module 61, or the mask tensor initial value M0 is determined based on the parameter initial value W0, and the training parameters are updated at the same time. Mask matrix to obtain the trained parameters Wf and the updated mask tensor Mf.

Embodiment approach 1302

[0107] Embodiment 1302 has only an unmasked phase and a masked adjustment phase. In the unmasked stage, only the parameters are trained, and the parameter initial value W0 is randomly generated by the random generation module 61, and the updated parameter W1 is obtained after training. In the mask adjustment stage, the training parameters update the mask matrix at the same time. The initial value of the parameter in this stage is the updated parameter W1, and the initial value M0 of the mask tensor is randomly generated by the random generation module 61, or the updated parameter W1 is used to generate Obtain the initial value M0 of the mask tensor, and finally obtain the trained parameter Wf and the updated mask tensor Mf.

Embodiment approach 1303

[0108] Embodiment 1303 only has a mask adjustment phase and a mask fixation phase. In the mask adjustment stage, the parameter initial value W0 and the mask tensor initial value M0 are randomly generated by the random generation module 61, or the mask tensor initial value M0 is determined based on the parameter initial value W0, and the training parameters update the mask matrix at the same time, To obtain the updated parameter W1 and the updated mask tensor Mf. In the mask fixing stage, the training is continued with the updated mask tensor Mf mask parameters. The initial value of the parameters in this stage is the updated parameter W1, and finally the trained parameter Wf is obtained.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a device, a board card and a method for sparse training of a neural network model and a readable storage medium, and the processing device is included in an integrated circuit device, and the integrated circuit device comprises a universal interconnection interface and a computing device. And the computing device interacts with the processing device to jointly complete the computing operation specified by the user. The integrated circuit device can further comprise a storage device, and the storage device is connected with the computing device and the processing device and used for data storage of the computing device and the processing device.

Description

technical field [0001] The present disclosure relates generally to the field of neural networks. More specifically, the present disclosure relates to a device, a board, a method and a readable storage medium for sparse training of a neural network model. Background technique [0002] In recent years, with the rapid development of deep learning, the performance of algorithms in a series of fields such as computer vision and natural language processing has made leaps and bounds. However, the deep learning algorithm is a computing-intensive and storage-intensive tool. With the increasing complexity of information processing tasks, the requirements for real-time and accuracy of the algorithm continue to increase, and the neural network is often designed deeper and deeper, making Its computing power and storage space requirements are increasing, making it difficult for existing artificial intelligence technology based on deep learning to be directly applied to mobile phones, sat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N3/08G06N3/063

CPCG06N3/082G06N3/063G06N3/084G06N3/047G06N3/045G06F18/285

Inventor 不公告发明人

Owner ANHUI CAMBRICON INFORMATION TECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Neural network sparsification device and method and corresponding product

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment approach 1301

Embodiment approach 1302

Embodiment approach 1303

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology