Quantization processing method and device and quantization processing chip

A quantitative processing and chip technology, applied in the field of quantitative processing, can solve problems such as increasing the cost of quantitative processing

Active Publication Date: 2021-08-20
BEIJING SENSETIME TECH DEV CO LTD
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The channel quantization method can improve the quantization accuracy, but it needs to use a dedicated channel quantization processing chip for processing, which leads to an increase in the cost of quantization processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quantization processing method and device and quantization processing chip
  • Quantization processing method and device and quantization processing chip
  • Quantization processing method and device and quantization processing chip

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Exemplary embodiments will be described in detail herein, examples of which are illustrated in the accompanying drawings. Where the following description refers to the drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure as recited in the appended claims.

[0036] The terminology used in the present disclosure is for the purpose of describing particular embodiments only and is not intended to limit the present disclosure. As used in this disclosure and the appended claims, the singular forms "a," "the," and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It will also be understood that the term "...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a quantization processing method and device, and a quantization processing chip, and the method comprises the steps: firstly, determining the initial quantization parameter of each channel based on the distribution characteristics of the network parameters of each channel, and enabling the initial quantization parameter of one channel to be the optimal quantization parameter theoretically meeting the distribution characteristics of the network parameters of the channel; however, the initial quantization parameter may not satisfy a layer quantization hardware deployment condition; therefore, the optimized quantization parameter corresponding to each channel is searched in the search space determined on the basis of the initial quantization parameter, the quantization performance of the searched optimized quantization parameter is close to that of the initial quantization parameter, and a layer quantization hardware deployment condition can be met, so that the method can be applied to a general quantization processing chip, and the quantization performance of the universal quantization processing chip is close to that of a channel quantization mode.

Description

technical field [0001] The present disclosure relates to the technical field of quantization processing, and in particular, to a quantization processing method and device, and a quantization processing chip. Background technique [0002] Quantization processing plays an important role in the accelerated deployment of neural networks. In the related art, there are two quantization processing methods: layer quantization and channel quantization. In the layer quantization method, the network parameters of each channel of the same network layer are quantized with the same quantization parameters; in the channel quantization method, the network parameters of the same network layer are quantized. The network parameters of each channel are quantized using different quantization parameters. The channel quantization method can improve the quantization accuracy, but a dedicated channel quantization processing chip needs to be used for processing, which leads to an increase in the cos...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F15/78G06N3/04G06N3/063
CPCG06F15/7814G06N3/063G06N3/045
Inventor 张行程姚超吉小洪
Owner BEIJING SENSETIME TECH DEV CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products