The embodiment of the invention provides a quantization processing method and device, and a quantization processing chip, and the method comprises the steps: firstly, determining the initial quantization parameter of each channel based on the distribution characteristics of the network parameters of each channel, and enabling the initial quantization parameter of one channel to be the optimal quantization parameter theoretically meeting the distribution characteristics of the network parameters of the channel; however, the initial quantization parameter may not satisfy a layer quantization hardware deployment condition; therefore, the optimized quantization parameter corresponding to each channel is searched in the search space determined on the basis of the initial quantization parameter, the quantization performance of the searched optimized quantization parameter is close to that of the initial quantization parameter, and a layer quantization hardware deployment condition can be met, so that the method can be applied to a general quantization processing chip, and the quantization performance of the universal quantization processing chip is close to that of a channel quantization mode.