Prediction method applied to neural network chip and prediction apparatus thereof

A technology of neural network and prediction method, which is applied in the fields of servers and readable storage media, neural network chip prediction methods and prediction devices, and can solve the problems of inability to realize rapid prediction of artificial neural network models, low calculation efficiency, long calculation time, etc.

Active Publication Date: 2017-10-24
上海中星微莘庄人工智能芯片有限公司
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the existence of a large number of convolution operations in the operation process, a large amount of temporary data is generated during the operation process and a large amount of d

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Prediction method applied to neural network chip and prediction apparatus thereof
  • Prediction method applied to neural network chip and prediction apparatus thereof
  • Prediction method applied to neural network chip and prediction apparatus thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. According to the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0034] figure 1 It is a flowchart showing a prediction method applied to a neural network chip according to an exemplary embodiment of the present invention. Such as figure 1 As shown, the method includes:

[0035] 110: During the neural network training process, divide the M output data of the current layer to obtain the distribution interval of the M output data of the current layer.

[0036] In the embodiment of the present invention, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a prediction method applied to a neural network chip and an apparatus thereof, a server and a readable storage medium. The method comprises the following steps of dividing M output data of a current layer and acquiring M output data distribution intervals of the current layer; carrying out statistics on the output data in each output data distribution interval of the M output data distribution intervals and acquiring a ratio of a number of the output data in each output data distribution interval and a total number of the M output data; based on the ratio corresponding to the M output data distribution intervals and a preset bit width, carrying out a bit width constraint on the M output data and acquiring a first initial bit and a first termination bit of N distribution intervals of the M output data distribution intervals; and based on the first initial bit and the first termination bit, carrying out bit width constraint on the M output data of the current layer so as to realize prediction of the neural network chip. In the invention, a data bandwidth is reduced and calculating efficiency is further increased.

Description

technical field [0001] The invention relates to the technical field of artificial neural network computing, in particular to a prediction method and device, a server and a readable storage medium applied to a neural network chip. Background technique [0002] Artificial Neural Network (ANN) is composed of a large number of nodes (or neurons) connected to each other. Each node represents a specific output function, called the activation function (ActivationFunction). Each connection between every two nodes has a weight that determines its connection strength. The value of this weight determines the state of the neuron and the performance of the entire neural network system. For the same network structure with different weights, the performance behavioral characteristics are often different. [0003] Weights and biases are important parameters that affect the performance of artificial neural network models. In the prior art, in the training phase of the artificial neural ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q10/04G06N3/04G06N3/08
CPCG06N3/08G06Q10/04G06N3/045
Inventor 刘小涛艾国张韵东
Owner 上海中星微莘庄人工智能芯片有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products