A model switching method and related device
By adaptively selecting and switching AI models through the model management system, the problem of large models being unable to meet the needs of complex business applications has been solved, achieving more efficient task solving capabilities and cost optimization.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HUAWEI CLOUD COMPUTING TECHNOLOGIES CO LTD
- Filing Date
- 2024-12-17
- Publication Date
- 2026-06-19
AI Technical Summary
In existing technologies, large-scale model applications struggle to meet business needs when faced with complex operations, and the fixed use of the same model leads to reduced end-to-end accuracy, increased costs, and longer response times.
This paper provides a model switching method that adaptively selects and switches AI models through a model management system. It selects a suitable model based on feature matching of query requests and optimizes model parameters by combining integer programming and mixed integer programming to achieve adaptive model switching and parameter configuration.
It improved the ability to solve tasks ranging from simple to complex, increased end-to-end accuracy, shortened response latency, reduced costs, and met business needs.