Unlock instant, AI-driven research and patent intelligence for your innovation.

Neural network model compression method and device, storage medium and chip

A neural network model and neural network technology, applied in the field of artificial intelligence, can solve the problems of time-consuming, degraded user experience, and the difficulty of neural network models to achieve satisfactory results, so as to reduce the impact of accuracy and improve accuracy.

Pending Publication Date: 2021-03-05
HUAWEI TECH CO LTD
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Neural network model compression technology needs to provide a large amount of training data to make the network converge to a better result. However, it is very time-consuming for users to upload large amounts of data to the cloud, which will cause a decline in user experience
Some neural network model compression techniques only use a small amount of training data for model compression, but the compressed neural network model is difficult to achieve satisfactory results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network model compression method and device, storage medium and chip
  • Neural network model compression method and device, storage medium and chip
  • Neural network model compression method and device, storage medium and chip

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The technical solution in this application will be described below with reference to the accompanying drawings.

[0042] Since the embodiment of the present application involves the application of a large number of neural network models, for ease of understanding, the following first introduces related terms and neural network models and other related concepts involved in the embodiment of the present application.

[0043] (1) Neural network model

[0044] The neural network model can be composed of neural units, and the neural units can be referred to as x s and the intercept b as the input operation unit, the output of the operation unit can be:

[0045]

[0046] Wherein, s=1, 2, ... n, n is a natural number greater than 1, W s for x s The weight of , b is the bias of the neuron unit. f is the activation function of the neural unit, which is used to introduce nonlinear characteristics into the neural network model to convert the input signal in the neural unit ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a neural network model compression method in the field of artificial intelligence. The method comprises the steps that a server acquires a first neural network model uploaded byuser equipment and training data of a first neural network; obtaining a PU classifier according to the training data of the first neural network and the unmarked data stored in the server; using a PUclassifier to select extended data from the unmarked data stored in the server, the extended data having attributes and distribution similar to those of the training data of the first neural networkmodel; and training a second neural network model by utilizing a knowledge distillation KD method according to the extended data, taking the first neural network model as a teacher network model, andtaking the second neural network model as a student network model. And a PU classifier is adopted to select data with attributes and distribution similar to those of the training data of the first neural network model from the unmarked data, so that the compression accuracy of the neural network model is improved, and the transmission of a large amount of positive sample data is avoided.

Description

technical field [0001] The present application relates to the field of artificial intelligence, in particular to a method and device for compressing neural network models. Background technique [0002] Artificial intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. In other words, artificial intelligence is the branch of computer science that attempts to understand the nature of intelligence and produce a new class of intelligent machines that respond in ways similar to human intelligence. Artificial intelligence is to study the design principles and implementation methods of various intelligent machines, so that the machines have the functions of perception, reasoning and decision-making [0003] Computer vision is an integral part of various i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06K9/62G06V10/82
CPCG06N3/063G06F18/241G06F18/253G06F18/214G06N3/084G06V10/82G06V10/454G06N3/047G06N3/045G06N3/08G06F18/2415
Inventor 许奕星陈汉亭韩凯王云鹤许春景
Owner HUAWEI TECH CO LTD