Unlock instant, AI-driven research and patent intelligence for your innovation.

Precision improving method of offline quantification tool

A technology of precision and tools, applied in the field of deep learning, can solve problems such as low precision, and achieve the effect of improving processing precision, speeding up processing speed, and reducing economic costs

Active Publication Date: 2020-07-03
开放智能机器(上海)有限公司
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Int8 network model quantization is divided into two implementation schemes: one is the perceptual quantization training that requires training framework support, directly outputs the Int8 network model, and the quantized Int8 network model The accuracy is high, but the existing Float32 network model needs to be retrained, which requires a large amount of data sets and long-term training support; the other is based on the existing Float32 network model, through model quantification tools and a small amount of calibration Image offline output Int8 network model, the operation is relatively simple, no retraining is required, but the accuracy is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Precision improving method of offline quantification tool
  • Precision improving method of offline quantification tool
  • Precision improving method of offline quantification tool

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0045] It should be noted that, in the case of no conflict, the embodiments of the present invention and the features in the embodiments can be combined with each other.

[0046] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments, but not as a limitation of the present invention.

[0047] An accuracy improvement method for offline quantization tools, which is used to improve the accuracy of d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of deep learning, in particular to a precision improvement method of an offline quantization tool. The precision improvement method comprises the steps that S1, a processing unit carries out linear processing on each quantization conversion factor; s2, the processing unit obtains first processing data output by all convolution layers; s3, the processing unit obtains second processing data output by the convolution layer; s4, determining a quantization parameter related to the current update according to the first processing data and the second processing data, and updating the corresponding quantization conversion factor by adopting the determined quantization parameter; s5, judging whether a quantization conversion factor which is not updated yet existsin the quantization conversion factor set or not, and if yes, returning to the step S3; and if not, outputting the updated quantization conversion factor set as a precision improvement result, and then ending. The technical scheme has the beneficial effects that the processing precision of the network model is further improved, the processing speed is increased, and the economic cost is reduced.

Description

technical field [0001] The invention relates to the field of deep learning, in particular to a method for improving the accuracy of an offline quantization tool. Background technique [0002] With the continuous development of AI technology, the neural network algorithm based on deep learning has become the mainstream method of AI research. Due to considerations of cost, power consumption, privacy and other issues, more and more application scenarios migrate the calculation of AI algorithms from the cloud to mobile embedded terminal devices. [0003] The current embedded terminal equipment has limited computing power and storage resources. When the neural network algorithm model is deployed on the embedded terminal, it is necessary to adopt the network model compression technology, and the most widely used in the industrial level is to quantize the floating point (Float32) network model into Integer (Int8) network model. Reduce storage requirements and improve network mode...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04
CPCG06N3/045
Inventor 唐琦黄明飞王海涛
Owner 开放智能机器(上海)有限公司