Neural network regularization bit serial calculation compression method and device
A neural network and serial computing technology, applied in neural learning methods, biological neural network models, computing, etc., can solve problems such as uneven distribution of bit sparsity and unbalanced load of computing units, so as to improve computing power and reduce power consumption. energy consumption and improve energy efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] Because the existing serial accelerator can reduce the operation by skipping the operation which is 0 in the bit stream. Therefore, improving the bit sparsity in the neural network model, that is, the ratio of 0 in the weight binary can improve the efficiency of network operations. However, it is not enough to improve the network bit sparsity, and there is also the problem of uneven distribution of sparsity. For example, when multiple sets of data are simultaneously operated on the serial operation unit, the operation time of each set of data is different due to the number of 1 bits contained in each set is different. In order to ensure the synchronization between each group of computing units, the accelerator will force the group that completes first to wait for the group that completes later until all operations are completed before jumping to the next batch of data. Such a mechanism will lead to a great waste of resources.
[0033] The method of the present inventi...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


