Supercharge Your Innovation With Domain-Expert AI Agents!

Method for processing convolution neural network

A technology of convolutional neural network and model, applied in the field of energy-saving convolutional neural network implementation

Inactive Publication Date: 2020-06-19
KNERON TAIWAN CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] CNN requires a lot of arithmetic operations, so it cannot be implemented on low-power devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for processing convolution neural network
  • Method for processing convolution neural network
  • Method for processing convolution neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0026] The embodiment discloses a quantization method, and the activation vector is described below in a fixed-precision notation.

[0027] The scalar factor s is defined by Equation 3 when using the dynamic fixed-point format to fully represent 32-bit floating-point values ​​in the activation vector (x).

[0028]

[0029] Where p represents the quantization bit length. In Equation 3, the dynamic quantization range is [[-max v , max v ]]. For the activation vector (x) in the convolution operation and the fully connected operation, max v The statistical maximum of the usual input features for a large set of data sets. available by figure 1 The statistical maximum value in is used for analysis.

[0030] Based on formula 3, s is a scalar factor, which is used to make up the gap between the floating-point value and the fixed-point value. The scalar factor s is a mathematical real number represented in 32-bit floating-point format. Apply scalar factor s to activation vec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for processing convolution neural network. After inputting input data to a floating pre-trained convolution neural network to generate floating feature maps for each layer of the floating pre-trained CNN model, a statistical analysis on the floating feature maps is performed to generate a dynamic quantization range for each layer of the floating pre-trained CNN model. Based on the obtained quantization range for each layer, the proposed quantization methodologies quantize the floating pre-trained CNN model to generate the scalar factor of each layer and the fractional bit-width of a quantized CNN model. It enables the inference engine performs low-precision fixed-point arithmetic operations to generate a fixed-point inferred CNN model.

Description

technical field [0001] The present invention relates to image processing, in particular to an energy-saving convolutional neural network implementation. Background technique [0002] Due to its outstanding success in the ImageNet competition, Convolution Neural Network (CNN) has become the most popular structure in computer vision processing. A typical pre-trained CNN model requires millions of operations, a large amount of memory space, and several watts of power for a single inference operation. Limited computing resources and storage space have become the main obstacles to implementing CNN on Internet of things (IoT) or portable devices. [0003] There are three main challenges in developing new CNN accelerators: [0004] Spatial data transfer using limited storage memory: Due to limited memory (<320KB SRAM) in IoT devices, real-time artificial intelligence (AI) applications cannot accept off-chip memory such as The delay of large data transfers between dynamic rand...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04
CPCG06N3/045G06N3/082G06N3/063
Inventor 伍捷马云汉谢必克李湘村苏俊杰刘峻诚
Owner KNERON TAIWAN CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More