Model deployment method, model deployment device, terminal equipment and readable storage medium

A model and equipment technology, applied in the fields of model deployment devices, readable storage media, and model deployment methods, can solve problems such as insufficient utilization of equipment computing capabilities, waste, and difficulty in improving throughput, and achieve the effect of reducing non-computing overhead

Pending Publication Date: 2021-08-06
深圳市智芯华玺信息技术有限公司
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above studies failed to well address the deployment of large parameter models
Specifically, most of the current work defaults to the fact that the machine running the deep learning model has enough memory and hard disk, and the model can be run directly on a single machine to obtain the output of the model; on cloud server clusters, distributed storage is generally used. Get better file read and write speed, and then get a better throughput rate, but this will waste more time on transmission, and cannot fully utilize the computing power of all devices
With the development of deep learning, researchers have found that computing resources are always limited; in addition, the equipment responsible for computing usually requires stronger computing power, and the computing power responsible for reading and writing is idle most of the time. A huge waste, on the other hand also makes it difficult to improve the throughput

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model deployment method, model deployment device, terminal equipment and readable storage medium
  • Model deployment method, model deployment device, terminal equipment and readable storage medium
  • Model deployment method, model deployment device, terminal equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the purpose, technical effects and technical solutions of the embodiments of the present invention more clear, the technical solutions in the embodiments of the present invention are clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention; obviously, the described embodiments It is a part of the embodiment of the present invention. Based on the disclosed embodiments of the present invention, other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall all fall within the protection scope of the present invention.

[0056] A model deployment method in an embodiment of the present invention includes the following steps:

[0057] Obtain the operator model set of the deep neural network model to be deployed; perform operator fusion or operator segmentation on the operator models in the operator model set that meet the preset conditions, and o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a model deployment method, a model deployment device, terminal equipment and a readable storage medium. The method comprises the following steps: obtaining an operator model set of a to-be-deployed deep neural network model; performing operator fusion or operator segmentation processing on operator models meeting a preset condition in the operator model set, and obtaining a processed operator model set; obtaining the operation time of each operator model in the processed operator model set on each device in a device set for deploying the model, and obtaining an operation time set; on the basis of the running time set, combining operator models in the processed operator model set through adoption of a preset search method, and obtaining a sub-model set; and deploying a to-be-deployed deep neural network model on the device set based on the sub-model set. According to the method, equipment with different computing power can be fully compatible, and the operation efficiency and the global throughput rate can be improved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, relates to the field of model deployment, and in particular to a model deployment method, a model deployment device, terminal equipment and a readable storage medium. Background technique [0002] Machine Learning (ML) is one of the fastest-growing fields in computer science today. Typical machine learning techniques can train statistical models for specific applications on a large number of pre-collected data sets to update model parameters (also called as "weights") until convergence; use the trained model for inference, that is, predicting outcomes on new data. Deep learning based on neural networks has become the most widely used ML algorithm due to its excellent results. The neural network model is a multi-layer directed acyclic graph (DAG). The model usually consists of operations such as convolution, matrix multiplication, pooling, batch regularization, etc., which are org...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/50G06N3/10G06N3/08G06N3/04
CPCG06F9/5072G06F9/505G06F9/5044G06N3/105G06N3/08G06F2209/5017G06N3/048G06N3/045
Inventor 李发兵林伟伟李想毛兴中
Owner 深圳市智芯华玺信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products