Unlock instant, AI-driven research and patent intelligence for your innovation.

Operation and quantification method of neural network model, electronic equipment and medium

A technology of neural network model and operation method, which is applied in quantitative methods, electronic equipment and media, and the operation field of neural network model, and can solve problems such as large memory and occupation.

Pending Publication Date: 2022-08-09
ARM TECH CHINA CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, using the data within the entire input data range of the nonlinear function and the corresponding quantized data to construct the LUT will take up a large amount of memory.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Operation and quantification method of neural network model, electronic equipment and medium
  • Operation and quantification method of neural network model, electronic equipment and medium
  • Operation and quantification method of neural network model, electronic equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0123] Embodiments of the present application include, but are not limited to, operations for neural network models, quantification methods, electronic devices, and media.

[0124] In order to clearly understand the embodiments of the present application, the nonlinear functions mentioned in the embodiments of the present application are briefly introduced below.

[0125] It can be understood that the nonlinear function mentioned in the embodiment of the present application can be described as: when the value of the input data X is getting closer and closer to the end value of the input data range, the falling range or the rising range of the value of the output data Y gradually tends to be gentle. , and is close to the minimum or maximum value of the output data range, a class of nonlinear functions.

[0126] It can be understood that when the nonlinear function is a periodic function, the above description can be applied to one period of the periodic function.

[0127] For ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of quantification, and discloses a neural network model operation method, a neural network model quantification method, electronic equipment and a medium. The neural network model comprises nonlinear function operation; the operation method comprises the following steps: in the operation process of the neural network model, obtaining input data of a nonlinear function; quantized data corresponding to the input data are obtained from a quantization mapping table, and in the quantization mapping table, for the input data belonging to the first type of truncation range, the quantized data corresponding to the input data are quantized data corresponding to data with the same numerical value as the input data in the first type of truncation range in the quantization mapping table; for the data belonging to the second type of truncation range, the quantized data corresponding to the input data is the quantized data corresponding to the endpoint data of the first type of truncation range in the quantization mapping table. Based on the scheme, the memory occupied by the quantization mapping table can be effectively reduced, so that the memory occupied by the neural network model is further reduced.

Description

technical field [0001] The present application relates to the technical field of quantification, and in particular, to an operation, quantification method, electronic device and medium of a neural network model. Background technique [0002] At present, with the rapid application of deep learning technology in many fields, a large number of neural network models based on deep learning have appeared. However, the neural network model structure is generally more complex and occupies a large amount of memory, so it is necessary to quantify and compress the neural network model to reduce the memory occupation and speed up the inference speed of the model. [0003] The neural network model includes multiple nodes (or neurons), each node represents a specific output function, called an excitation function or an activation function, and the function type of some nodes is a linear function, that is, the input data and The relationship with the output data is linear, for example, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06F16/2453
CPCG06F16/24535G06N3/048
Inventor 章小龙黄敦博陈柏韬
Owner ARM TECH CHINA CO LTD