Multi-model reasoning service deployment method and device based on k8s cluster
A multi-model and model technology, applied in the field of cloud computing, can solve the problems of complex operation, does not support cluster elastic scaling, etc., to achieve the effect of simple deployment and operation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are to distinguish two entities with the same name but different parameters or parameters that are not the same, see "first" and "second" It is only for the convenience of expression, and should not be construed as a limitation on the embodiments of the present invention, which will not be described one by one in the subsequent embodiments.
[0039] In one example, please refer to figure 1 As shown, the present invention provides a k8s cluster-based multi-model reasoning service deployment method, which specifically includes the following steps:
[0040] S100. Deploy a scheduling service in the smallest scheduling unit of the k8s cluster, and configure memory, computing resources, and scheduling policies for the scheduling service; wherein, the smallest scheduling unit is a pod;
[0041] S200. Deploy multiple model reasoning services according to the memory of...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com