Model compression method and device

A technology of compression method and compression algorithm, which is applied in the field of data processing, can solve the problems of affecting compression efficiency, difficult to support the deployment of high-performance parameter network models, and low processing capacity of processing equipment.

Active Publication Date: 2019-08-23
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Or, another part of the reason is that some processing devices that need to deploy network models do not have high processing capabilities, and it is difficult to support the deployment of network models with high-performance parameters. Therefore, it is necessary to compress such network models without losing too many performance parameters. Under the condition that it can be deployed in processing equipment with low proce...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model compression method and device
  • Model compression method and device
  • Model compression method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make those skilled in the art better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.

[0040] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of this application and the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It is to be understood that the data so used may be inte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a model compression method and device. The method comprises the steps of obtaining a to-be-compressed model and compression preference configuration for the to-be-compressed model; determining a compression algorithm component and a corresponding algorithm hyper-parameter value according to the model type and the compression preference configuration of a model to be compressed, and using the compression algorithm component and the algorithm hyper-parameter value to carry out first compression on the to-be-compressed model to obtain a candidate compression result corresponding to the first compression; if the coincidence degree of the performance parameter of the candidate compression result corresponding to the first compression and the compressionpreference configuration does not meet the preset condition, executing the second compression; and continuing to generate a parameter adjustment strategy to adjust the compression algorithm componentand the algorithm hyper-parameter value used by the next compression until the coincidence degree of the performance parameter of the candidate compression result corresponding to the certain compression and the compression preference configuration meets a preset condition. A compression algorithm does not need to be adjusted manually, so that the influence caused by human experience is avoided,and the compression efficiency is improved.

Description

technical field [0001] The present application relates to the field of data processing, and in particular, to a model compression method and device. Background technique [0002] The network model can be deployed on different types of processing devices, and the processing device can implement specific functions, such as image recognition, data classification, etc., through the deployed network model. [0003] In some cases, however, the network model needs to be compressed prior to deployment in processing devices. Part of the reason is that the performance parameters of some network models are not good. For example, the model occupies a large space, the computing performance is not high, and the running speed is low. Such network models are generally network models designed based on human experience, or those with insufficient development experience. The network model developed by the developer. When this kind of network model is deployed to processing equipment, it not ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/08
CPCG06N3/082
Inventor 侯金龙黄俊洲吴家祥张尧
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products