Novel multi-algorithm access method and server
A multi-algorithm and server technology, applied in the field of algorithm deployment, can solve problems such as waste of resources, inability to implement single request algorithm serialization, support for algorithm pre-processing and post-processing of algorithms cannot meet the needs, and achieve the effect of avoiding time consumption
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0047] A new method for accessing multiple algorithms, including at least one GPU server, the method includes,
[0048] Configure the algorithm service, write the configuration file, and configure the algorithm service according to the algorithm service configuration information;
[0049] Start the algorithm service, start the algorithm service through the monitoring process, and start the monitoring process according to the configuration of the algorithm service;
[0050] Algorithm service application connection, according to the configured algorithm service, apply for connection algorithm service;
[0051] Algorithm call, according to the input of the algorithm, submit the algorithm request to the algorithm service, so as to call the algorithm.
[0052]Access a variety of deep learning algorithm forward frameworks; such as TensorRT, Ncnn; non-deep learning algorithms, such as matrix multiplication, random forest, etc., and provide personalized algorithm access such as detec...
Embodiment 2
[0063] On the basis of embodiment 1, a new type of multi-algorithm access server is implemented through the new multi-algorithm access method implemented in embodiment 1, including at least one GPU server, which is realized through a new multi-algorithm access method The server includes a main control process module, a resource allocation process module and a monitoring process module; the main control process module is used to read configuration file information and send connection requests; the resource allocation process module is used to allocate shared memory and shared video memory in management algorithm services; monitoring The process module is used to monitor the status of the main control process module and the resource allocation process module, and recycle and restart the algorithm resources.
Embodiment 3
[0065] On the basis of the above embodiments, this embodiment also includes an algorithm process module, which is used to schedule GPU resources, start the algorithm and monitor algorithm tasks, read the configuration file according to the master control process module, and process the algorithm return according to the algorithm task result.
[0066] It also includes an external interface module, which is used to call the interface of the algorithm service externally, and the interface of the algorithm service includes a c interface, a c++ interface and a python interface.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com