Unlock instant, AI-driven research and patent intelligence for your innovation.

Reasoning acceleration method, reasoning acceleration device and storage medium

An acceleration device and acceleration system technology, which is applied in the field of information processing, can solve problems such as high power consumption, large depth of neural network model, and reduced processing efficiency, and achieve the effects of low power consumption, fast operation speed, and improved universal applicability

Pending Publication Date: 2022-07-22
BEIJING XIAOMI MOBILE SOFTWARE CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the development of computer technology, neural network models have been successfully applied in many fields such as image recognition processing and automatic driving, and with the continuous enrichment of application requirements, more and more network layers in neural network models; the increase of network layers As a result, the model depth of the neural network model is getting larger and larger, the amount of calculation is also significantly increased, and the processing efficiency is greatly reduced.
[0003] In related technologies, usually through a central processing unit (Central Processing Unit, CPU), an image processor (Graphics Processing Unit, GPU), a neural network processor (Neural network Processing Unit, NPU) and a digital signal processor (Digital Signal Processor, DSP) ) and other hardware to accelerate the reasoning of neural network models; however, using CPU and GPU to run neural network models on mobile edge devices such as mobile phones requires a lot of terminal resources and high power consumption, which cannot be achieved for a long time (such as 24 hours) )run

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reasoning acceleration method, reasoning acceleration device and storage medium
  • Reasoning acceleration method, reasoning acceleration device and storage medium
  • Reasoning acceleration method, reasoning acceleration device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] Exemplary embodiments will be described in detail herein, examples of which are illustrated in the accompanying drawings. Where the following description refers to the drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with some aspects of the invention as recited in the appended claims.

[0073] In related technologies, the neural network model is usually compiled and optimized on NPU, CPU and GPU, such as figure 1 shown, figure 1 It is a processing schematic diagram of a CPU-based compilation collaborative optimization algorithm provided by the related art. By pruning the algorithm, the number of numerical operations in the algorithm is reduced, and then the number of instructions r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a reasoning acceleration method, a reasoning acceleration device and a storage medium, and the method comprises the steps: obtaining first model information of a to-be-accelerated first network model and second model information of a second network model corresponding to the first network model; inputting the first model information and the second model information into a reasoning acceleration system, performing fixed-point optimization processing on the first network model and the second network model by using a fixed-point optimization module of the reasoning acceleration system, and determining a third network model and fixed-point configuration information of the third network model; performing optimization acceleration on the third network model based on the SIMD instruction by utilizing a single instruction multiple data stream SIMD optimization module of the reasoning acceleration system to obtain SIMD configuration information of the accelerated target network model; and determining target model configuration information of the target network model according to the fixed-point configuration information SIMD configuration information.

Description

technical field [0001] The present disclosure relates to the technical field of information processing, and in particular, to an inference acceleration method, an inference acceleration device, and a storage medium. Background technique [0002] With the development of computer technology, the neural network model has been successfully applied in many fields such as image recognition processing and automatic driving, and with the continuous enrichment of application requirements, there are more and more network layers in the neural network model; the increase of the network layer As a result, the model depth of the neural network model is getting larger and larger, the calculation amount is also significantly increased, and the processing efficiency is greatly reduced. [0003] In the related art, a central processing unit (Central Processing Unit, CPU), an image processing unit (Graphics Processing Unit, GPU), a neural network processing unit (Neural network processing unit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/063G06N5/04
CPCG06N5/04G06N3/063G06N3/045
Inventor 罗博源史润宇王凯尹旭东刘梓城
Owner BEIJING XIAOMI MOBILE SOFTWARE CO LTD