Model quantification method and device and terminal equipment

A quantification method and model technology, applied in the direction of neural learning method, biological neural network model, character and pattern recognition, etc., can solve the problems of large precision loss and increase calculation error, and achieve the effect of reducing calculation error and improving accuracy

Pending Publication Date: 2022-02-18
SHENZHEN INTELLIFUSION TECHNOLOGIES CO LTD +1
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, compared with the deep learning model before quantization, the quantization model usually introduces a large precision loss and increases the calculation error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model quantification method and device and terminal equipment
  • Model quantification method and device and terminal equipment
  • Model quantification method and device and terminal equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In the following description, specific details such as specific system structures and technologies are presented for the purpose of illustration rather than limitation, so as to thoroughly understand the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.

[0026] Before explaining the embodiments of the present application, some terms in the embodiments of the present application are briefly introduced.

[0027] The embodiments of the present application will be described in detail below.

[0028] The model quantization method provided in the embodiment of the present application may be applied to a terminal device.

[0029] Exemplar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is suitable for the technical field of model quantification, and provides a model quantification method and device, terminal equipment and a storage medium, and the model quantification method comprises the steps: processing input data through a floating point model, and obtaining target output; for each to-be-quantized layer, performing quantization processing on the corresponding to-be-quantized layer according to the quantization function of the to-be-quantized node in the corresponding to-be-quantized layer to obtain a quantization layer corresponding to the corresponding to-be-quantized layer; processing the first input of the corresponding to-be-quantized layer through the quantization layer corresponding to the corresponding to-be-quantized layer to obtain a second output; according to the second output and the first output, making a quantization function corresponding to the corresponding to-be-quantized layer optimized, and obtaining a target quantization function corresponding to the corresponding to-be-quantized layer; and quantizing the floating point model according to the target quantization function corresponding to each to-be-quantized layer to obtain a target quantization model. Through the method, the precision of the quantization model can be improved.

Description

technical field [0001] The present application belongs to the technical field of model quantization, and in particular relates to a model quantization method, device, terminal equipment, and computer-readable storage medium. Background technique [0002] Artificial intelligence technology has developed rapidly in recent years and has continuously penetrated into various application fields represented by computer vision, natural language processing, and speech recognition. However, in practical application scenarios, the huge amount of data and computational complexity of the deep learning model pose a huge challenge to the computing power of the hardware. Therefore, quantization methods for deep learning models have also been derived. Quantization technology can reduce the memory usage of neural network models, improve data throughput, and reduce inference latency. However, compared with the deep learning model before quantization, the quantization model usually introduces...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08G06K9/62G06V10/764G06V10/82
CPCG06N3/084G06N3/045G06F18/24
Inventor 刘勇蔡万伟
Owner SHENZHEN INTELLIFUSION TECHNOLOGIES CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products