A quantization circuit and a quantization method of a convolution neural network

A convolutional neural network and circuit technology, applied in the field of artificial intelligence data processing, can solve the problems of equipment power consumption and running speed not meeting the requirements, new networks cannot be applied and verified, hindering the progress of algorithm optimization network, etc., to achieve deployment and operational reliability, reduced storage capacity and bandwidth requirements, deployment and operational assurance effects

Active Publication Date: 2019-03-15
INSPUR GROUP CO LTD
View PDF5 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Due to the limitation of mobile terminal processing and storage level development, the artificial neural network with doubled depth and size can only be run in processing machines with large-scale computing resources, and the power consumption and running speed of the device cann...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A quantization circuit and a quantization method of a convolution neural network
  • A quantization circuit and a quantization method of a convolution neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be further described below in conjunction with specific examples.

[0030] A convolutional neural network quantization circuit includes an original parameter pool, a comparator array, a quantization parameter calculation unit, an arithmetic operation unit, a fine-tuning unit and an activation unit.

[0031] The original parameter pool is used to store the original parameter data required for the calculation of each layer of the convolutional neural network, including each channel data and bias data of all convolution kernels of each layer, all expressed in a signed real number data format;

[0032] The comparator array is used to perform statistical operations on the data in the original parameter pool, and iteratively compares to obtain the maximum and minimum values ​​of the parameters of each layer of the convolutional neural network;

[0033] The quantization parameter calculation unit is used to perform arithmetic operations on the maximu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A quantization circuit and a quantization method of a convolution neural network are disclosed, The invention belongs to the technical field of artificial intelligence data processing, and comprises an original parameter pool, a comparator array, a quantization parameter calculation unit and an arithmetic operation unit. The original parameter pool is used for storing the original parameter data required for calculation of each layer of a convolution neural network, and comprises data of all channels and offset data of all convolution cores of each layer. The comparator array is used for performing statistical operation on the data in the original parameter pool, and iteratively comparing to obtain the maximum value and the minimum value of the parameters of each layer of the convolution neural network; The quantization parameter calculation unit is used for performing arithmetic operation on the maximum value and the minimum value to obtain each parameter used for model quantization;The arithmetic operation unit is used for quantifying the model, and the obtained results are expressed in an integer format of an unsigned bit specified number of bits. The invention can reduce the system power consumption through quantization, so that the deployment and operation of the convolution neural network on the terminal are more reliably guaranteed.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence data processing, in particular to a convolutional neural network quantization circuit and a quantization method. Background technique [0002] As an important direction in the development trend of artificial intelligence, convolutional neural network has been in a state of intense development. Various new models and new algorithms emerge in an endless stream, continuously injecting new impetus into this field. Among them, the increase in the depth and scale of the network model is the main development direction. In the process of continuous improvement in accuracy, the deployment and implementation of the neural network is facing great challenges. [0003] Due to the limitations of the development of mobile terminal processing and storage levels, the artificial neural network whose depth and size have doubled can only be run on processing machines with large-scale computing resour...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/06G06N3/04
CPCG06N3/063G06N3/045
Inventor 王子彤姜凯于治楼
Owner INSPUR GROUP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products