Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep neural network reasoning method and computing device

A technology of deep neural network and reasoning method, which is applied in the field of deep neural network reasoning method and computing equipment, and can solve the problem that the acceleration effect of deep neural network cannot meet the needs of inference speed.

Inactive Publication Date: 2019-01-15
HUAWEI TECH CO LTD
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a deep neural network reasoning method and computing equipment to solve the problem that the acceleration effect of the current deep neural network is difficult to meet the reasoning speed requirements of actual scenarios such as unmanned driving

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep neural network reasoning method and computing device
  • Deep neural network reasoning method and computing device
  • Deep neural network reasoning method and computing device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The embodiment of the present application provides a deep neural network reasoning method and related equipment, which can eliminate the problem of requiring a large amount of calculation by quantifying the input features of the model and model parameters, and pre-configuring the codebook, so as to solve the problem that the current deep neural network reasoning speed is relatively slow. slow problem.

[0056] In order to enable those skilled in the art to better understand the solutions of the present application, the following will describe the embodiments of the present application with reference to the drawings in the embodiments of the present application.

[0057] The terms "first", "second", "third", "fourth", etc. (if any) in the specification and claims of the present application and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the terms so used are...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application relates to the field of neural networks, in particular to a deep neural network reasoning method and a computing device. The method includes receiving input features inputted to an operation layer in a first depth neural network model; determining an index value corresponding to the operation layer; determining a codebook of the operation layer according to the index valueof the operation layer; quantifying the input features according to a preset first quantization rule; In the operation layer, the operation layer performs the operation on the input features accordingto the codewords corresponding to the quantized input features in the quantized input features query codebook. In accordance with an embodiment of that present application, each quantization model parameter and each quantization input characteristic are multiplied to obtain a codebook, as long as the actual input feature is quantized into the quantized input feature, the floating-point multiplication result of the quantized input feature can be obtained by directly consulting the codebook, so that the operation can be completed quickly, and the reasoning speed of the deep neural network can be greatly accelerated without the substantial floating-point multiplication operation.

Description

technical field [0001] The present application relates to the field of neural networks, in particular to a deep neural network reasoning method and computing equipment. Background technique [0002] With the rapid development and popularization of computer and information technology, industry application data is growing explosively. Industry and enterprise big data that can reach several terabytes (TB for short) or even petabytes (PB for short) often imply a lot of in-depth knowledge and value that are not available when the amount of data is small , Data analysis led by large-scale machine learning (including deep learning) is a key technology for converting big data into useful knowledge. As a key technology leading the development direction of artificial intelligence (AI), deep neural network has made remarkable achievements in many fields such as face recognition, image classification, target detection, video analysis, speech recognition, machine translation, etc. As a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N5/04G06N3/08
CPCG06N5/04G06N3/08
Inventor 张长征陈晓仕涂丹丹
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products