Unlock instant, AI-driven research and patent intelligence for your innovation.

Cross-platform model reasoning method and system, storage medium and equipment

A cross-platform and model technology, applied in the server field, can solve the problems of high hardware platform cost and insufficient back-end scalability, etc., to achieve the effect of solving insufficient scalability and reducing costs

Pending Publication Date: 2022-01-28
SUZHOU LANGCHAO INTELLIGENT TECH CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of this, the object of the present invention is to propose a general model reasoning method, system, storage medium and equipment that can be used on different hardware platforms, so as to solve the problem of the traditional reasoning framework switching target hardware platform in the prior art. High, insufficient back-end scalability and other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-platform model reasoning method and system, storage medium and equipment
  • Cross-platform model reasoning method and system, storage medium and equipment
  • Cross-platform model reasoning method and system, storage medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0046] It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used to distinguish two entities with the same name or different parameters. It can be seen that "first" and "second" " is only for the convenience of expression, and should not be understood as limiting the embodiment of the present invention. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, for example, of a process, method, system, product or other steps or elements inherent in a process, method, system, product, or device comprising a series of steps or elements.

[0047] Based on the above purpo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a cross-platform model reasoning method and system, a storage medium and equipment. The method comprises the steps od adding a deployment chip in a target back-end module of a deep learning compiling framework as a marking back end; placing a runtime library and a machine learning library of the deployed chip into the deep learning compiling framework so as to realize a performance function of the deployed chip on the deep learning compiling framework and establish an operator warehouse corresponding to the marking back end; analyzing the model file by the deep learning compiling framework to generate calculation graph representation in the deep learning compiling framework; in response to the fact that a target back end is set as the marking back end, searching the implementation of each operator in the computational graph from the operator warehouse, wherein the deep learning compiling framework generates a dynamic link library through the corresponding implementation of each operator and the corresponding performance function; and loading the dynamic link library on a deployment chip to execute model reasoning. According to the invention, the workload of switching the target hardware platform is reduced, and the expansibility of the back end is improved.

Description

technical field [0001] The present invention relates to the technical field of servers, in particular to a method, system, storage medium and equipment for cross-platform model reasoning. Background technique [0002] In the model inference task, it is a very challenging task to deploy the trained model on different target hardware platforms such as CPU, GPU, FPGA and other new artificial intelligence chips such as Cambrian MLU and ensure the efficiency of reasoning. Different types of chips may have large differences in memory hierarchy, supported instructions and data types, etc. Most of the existing model reasoning and optimization methods focus on a single type of chip device. When switching the target hardware platform, the model reasoning method often needs to be replaced accordingly, and the resulting cost will increase with the rapid iteration of artificial intelligence chips. [0003] Currently, TensorRT, the reasoning acceleration framework launched by NVIDIA, par...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F8/41G06F8/60G06N5/04
CPCG06F8/41G06F8/60G06N5/041
Inventor 王慕雪
Owner SUZHOU LANGCHAO INTELLIGENT TECH CO LTD