Network quantization method, inference method, and network quantization device
A quantitative method and network technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as poor reasoning accuracy, poor machine learning speed, and impact
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach 1
[0037] The network quantization method and network quantization device according to Embodiment 1 will be described.
[0038] [1-1. Network quantization device]
[0039] First, use figure 1 The configuration of the network quantization device according to this embodiment will be described. figure 1 It is a block diagram showing an outline of the functional configuration of the network quantization device 10 according to this embodiment.
[0040] The network quantization device 10 is a device that quantizes the neural network 14 . That is, the network quantization device 10 is a device that converts the neural network 14 with floating point precision into a neural network with fixed point precision, that is, into a quantized network. In addition, the network quantization device 10 may not quantize all the tensors used by the neural network 14, but only needs to quantize at least a part of the tensors. Here, the tensor refers to a value represented by an n-dimensional array (...
Embodiment approach 2
[0105] The network quantization method and the like according to Embodiment 2 will be described. The difference between the network quantization method involved in this embodiment and the quantization method involved in Embodiment 1 is that, according to the statistical information of the test data set, the test data set is classified into multiple types, and different types are performed according to each type. deal with. Hereinafter, an inference method using a quantized network generated by the network quantization method, network quantization device, and network quantization method according to this embodiment will be described focusing on differences from Embodiment 1.
[0106] [2-1. Network quantization device]
[0107] First, use Figure 9 The configuration of the network quantization device according to this embodiment will be described. Figure 9 It is a block diagram showing an outline of the functional configuration of the network quantization device 110 accordin...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


