The invention discloses a 4-bit quantification method and system of a neural network. The method comprises the steps of loading a pre-training model of the neural network; in the pre-training model, counting an initial value of each saturation activation layer satRelu; adding pseudo quantization nodes into the neural network, and using the initial value of satRelu for retraining the neural networkto obtain a pseudo quantization model; judging whether the precision of the pseudo-quantization model converges to the set precision; if yes, carrying out reasoning pretreatment on the pseudo-quantization model, and converting the pseudo-quantization model into a 4-bit reasoning model which can be used for reasoning operation; otherwise, returning to carry out re-training of the neural network. The system mainly comprises a loading module, a statistics module, a retraining module, a judgment module and a conversion module. Through the method and the system, the training efficiency can be effectively improved on the basis of ensuring the accuracy of the training result.