Unlock instant, AI-driven research and patent intelligence for your innovation.

Quantization and fixed-point fusion method and device of neural network

A neural network and fusion method technology, applied in biological neural network models, neural architectures, etc., can solve the problems of waste of computing resources, reduce computing power, and waste of precision, and achieve the requirements of reducing resources, reducing the amount of calculation, and reducing the bandwidth. desired effect

Active Publication Date: 2022-06-28
APOLLO INTELLIGENT DRIVING (BEIJING) TECHNOLOGY CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, in the application scenario of autonomous parking, traditional neural network calculations are based on high-bit floating-point operations, resulting in a large waste of computing resources
Or, in the traditional neural network acceleration method, even if a low-bit floating-point or integer budget is used, precision is wasted in the process of processing intermediate floating-point operations, which not only wastes precision but also reduces computing power.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quantization and fixed-point fusion method and device of neural network
  • Quantization and fixed-point fusion method and device of neural network
  • Quantization and fixed-point fusion method and device of neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] Exemplary embodiments of the present application are described below with reference to the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.

[0064] figure 1 It is a flow chart of a method for quantizing and locating a neural network for fusion according to an embodiment of the present application. see figure 1 , the quantization and fixed-point fusion methods of the neural network include:

[0065] Step S110, performing quantization processing on the input data and weights of the current layer of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application discloses a neural network quantification and fixed-point fusion method, device, electronic equipment and storage medium, and relates to the field of artificial intelligence, especially the field of automatic driving (including autonomous parking). The specific implementation plan is: quantify the input data and weights of the current layer of the neural network; in the current layer, use the quantized weights to perform calculation operations on the quantized input data to obtain calculation results; The processing parameters are fixed-point processing; the preset processing parameters after the fixed-point processing are used to perform post-processing operations on the calculation operation results to obtain the output result of the current layer. Through the integration of quantization processing and fixed-point processing, the embodiment of the present application significantly reduces the bandwidth requirements for data transmission between operators, effectively reduces the calculation amount of the acceleration unit, and fully utilizes the advantages of the fixed-point calculation of the acceleration unit. The resource requirements for computing are reduced, and the computing efficiency is improved while saving resources.

Description

technical field [0001] The present application relates to the field of information technology, in particular to the field of artificial intelligence, especially the field of automatic driving (including autonomous parking). Background technique [0002] The calculation of traditional neural network is based on high-bit (bit) floating-point operations, which causes a lot of waste of computing resources, and is prone to overfitting, which reduces the generalization ability of the model. In the traditional acceleration method of neural network, even if a low-bit floating-point or integer budget is used, precision will be wasted in the process of processing intermediate floating-point operations, resulting in the final result being truncated before the subsequent process is used. operate. This approach wastes both precision and computational power. [0003] For example, the current neural network reasoning in embedded platform has the following solutions: (1) Select a small ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04
CPCG06N3/045
Inventor 齐南
Owner APOLLO INTELLIGENT DRIVING (BEIJING) TECHNOLOGY CO LTD