Model deployment method, electronic device, storage medium, and program product

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
By splitting the neural network model into operator modules and deploying them according to the capability indicators of the computing power cards, the problem of increased execution time caused by differences in computing power card capabilities is solved, and efficient execution of the model on multiple computing power cards and rational utilization of computing power resources are achieved.

CN122240133APending Publication Date: 2026-06-19INSPUR SUZHOU INTELLIGENT TECH CO LTD

View PDF 0 Cites 0 Cited by

Patent Information

Authority / Receiving Office: CN · China
Patent Type: Applications(China)
Current Assignee / Owner: INSPUR SUZHOU INTELLIGENT TECH CO LTD
Filing Date: 2026-05-21
Publication Date: 2026-06-19

Application Information

Patent Timeline

21 May 2026

Application

19 Jun 2026

Publication

CN122240133A

IPC: G06F8/60; G06F9/445; G06F9/50; G06N3/0499

AI Tagging

Application Domain

Resource allocation Biological models

Explore More Agents

Novelty Search
Search existing technologies and assess novelty
↗
FTO
Analyze whether a product may infringe others' patents
↗
Design FTO
Check prior-design risk for exterior design
↗
Drafting
Draft patent application text based on a technical solution
↗
Find Solutions with TRIZ
Generate feasible solution to solve your technical challenge
↗

Similar Technology Patents

A gate hoist online monitoring system and method
CN122217397ABarrages/weirs Resource allocation
A scheduling method and device for encrypted card virtual resources, equipment and medium
CN122195659AImplement dynamic schedulingAchieve on-demand allocationResource allocation Biological models
Resource management method and apparatus, communication device, and storage medium
CN122228483AResource allocation
A multi-level cache method and system based on structure-enhanced prediction and reinforcement learning
CN122220262AProgram initiation/switching Digital data information retrieval
Model inference method and device, computer device, computer readable storage medium, and computer program product
CN121960793BResource allocation Inference methods

Get free access to AI patent search and analysis

Check patentability, review prior art and ask IP Agent with full patent context.

Smart Images

Figure CN122240133A_ABST

Patent Text Reader

Abstract

This application discloses a model deployment method, electronic device, storage medium, and program product, relating to the field of computer technology. The method includes: splitting the model to be deployed into multiple operator modules; obtaining the capability indicators of a computing power card; determining the operators supported by the computing power card based on the capability indicators; determining the target computing power card corresponding to the operator module based on the supported operators; and deploying the operator module to the target computing power card. This solves the problem of difficulty in distinguishing the capability differences of different computing power cards when deploying a neural network model across multiple computing power cards, leading to an increase in the overall execution time of the model. This method merges related operators into operator modules when splitting the model to be deployed, avoiding time loss due to data transmission. It determines the operators supported by the computing power card, the energy consumption cost and communication cost of the computing power card through capability indicators, and then determines the most suitable target computing power card for executing the operator module. This enables the deployment of the model on multiple computing power cards, reducing model execution time and improving computing power utilization.

Need to check novelty before this filing date? Find Prior Art