Quantization and fixed-point fusion method and device of neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A neural network and fusion method technology, applied in biological neural network models, neural architectures, etc., can solve the problems of waste of computing resources, reduce computing power, and waste of precision, and achieve the requirements of reducing resources, reducing the amount of calculation, and reducing the bandwidth. desired effect

Active Publication Date: 2022-06-28

APOLLO INTELLIGENT DRIVING (BEIJING) TECHNOLOGY CO LTD

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For example, in the application scenario of autonomous parking, traditional neural network calculations are based on high-bit floating-point operations, resulting in a large waste of computing resources

Or, in the traditional neural network acceleration method, even if a low-bit floating-point or integer budget is used, precision is wasted in the process of processing intermediate floating-point operations, which not only wastes precision but also reduces computing power.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0063] Exemplary embodiments of the present application are described below with reference to the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.

[0064] figure 1 It is a flow chart of a method for quantizing and locating a neural network for fusion according to an embodiment of the present application. see figure 1 , the quantization and fixed-point fusion methods of the neural network include:

[0065] Step S110, performing quantization processing on the input data and weights of the current layer of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The application discloses a neural network quantification and fixed-point fusion method, device, electronic equipment and storage medium, and relates to the field of artificial intelligence, especially the field of automatic driving (including autonomous parking). The specific implementation plan is: quantify the input data and weights of the current layer of the neural network; in the current layer, use the quantized weights to perform calculation operations on the quantized input data to obtain calculation results; The processing parameters are fixed-point processing; the preset processing parameters after the fixed-point processing are used to perform post-processing operations on the calculation operation results to obtain the output result of the current layer. Through the integration of quantization processing and fixed-point processing, the embodiment of the present application significantly reduces the bandwidth requirements for data transmission between operators, effectively reduces the calculation amount of the acceleration unit, and fully utilizes the advantages of the fixed-point calculation of the acceleration unit. The resource requirements for computing are reduced, and the computing efficiency is improved while saving resources.

Description

technical field [0001] The present application relates to the field of information technology, in particular to the field of artificial intelligence, especially the field of automatic driving (including autonomous parking). Background technique [0002] The calculation of traditional neural network is based on high-bit (bit) floating-point operations, which causes a lot of waste of computing resources, and is prone to overfitting, which reduces the generalization ability of the model. In the traditional acceleration method of neural network, even if a low-bit floating-point or integer budget is used, precision will be wasted in the process of processing intermediate floating-point operations, resulting in the final result being truncated before the subsequent process is used. operate. This approach wastes both precision and computational power. [0003] For example, the current neural network reasoning in embedded platform has the following solutions: (1) Select a small ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06N3/04

CPCG06N3/045

Inventor 齐南

Owner APOLLO INTELLIGENT DRIVING (BEIJING) TECHNOLOGY CO LTD

Quantization and fixed-point fusion method and device of neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology