Supercharge Your Innovation With Domain-Expert AI Agents!

A neural network model compression method and device

A neural network model and neural network technology, applied in the field of neural network model compression methods and devices, can solve problems such as use requirements that cannot achieve accuracy

Active Publication Date: 2021-05-14
GUOXIN YOUE DATA CO LTD
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above-mentioned methods directly perform model compression on the model to be compressed, and perform model compression on the premise of sacrificing the accuracy of the model, which often cannot meet the use requirements for accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A neural network model compression method and device
  • A neural network model compression method and device
  • A neural network model compression method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is a part of embodiments of the present invention, but not all embodiments. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations. Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a method and device for compressing a neural network model, wherein the method includes: inputting training data into a neural network model to be compressed and a target neural network model; extracting feature vectors and classifying the training data based on the neural network model to be compressed As a result, the target neural network model is trained to obtain a compressed neural network model; wherein, the number of parameters of the target neural network model is less than the number of parameters of the neural network model to be compressed. The embodiment of the present invention guides the target neural network model to train based on the feature vectors and classification results extracted from the training data by the neural network model to be compressed, and the finally obtained compressed neural network model and the classification result of the same training data by the neural network model to be compressed are In the same way, there will be no loss of accuracy in the process of model compression, and the size of the model can be compressed under the premise of ensuring accuracy to meet the dual requirements for accuracy and model size.

Description

technical field [0001] The present invention relates to the technical field of machine learning, in particular to a neural network model compression method and device. Background technique [0002] With the rapid development of neural networks in the fields of image, voice, and text, it has promoted the landing of a series of intelligent products. In order to allow the neural network to better learn the characteristics of the training data to improve the model effect, the parameters used to represent the neural network model increase rapidly, and the number of layers of the neural network continues to increase, resulting in a large number of parameters in the deep neural network model. Model training and application Insufficient amount of calculation in the process; this leads to the fact that most of the products based on neural network rely on the drive of server-side computing power, and are very dependent on a good operating environment and network environment, which lim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/08G06K9/62
CPCG06N3/08
Inventor 孙源良王亚松刘萌樊雨茂
Owner GUOXIN YOUE DATA CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More