Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Novel multi-algorithm access method and server

A multi-algorithm and server technology, applied in the field of algorithm deployment, can solve problems such as waste of resources, inability to implement single request algorithm serialization, support for algorithm pre-processing and post-processing of algorithms cannot meet the needs, and achieve the effect of avoiding time consumption

Pending Publication Date: 2021-11-09
杭州英歌智达科技有限公司
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If these applications have parameters that use the same algorithm (such as the model parameters of the deep learning algorithm), resources will be wasted; at the system level, the priority of the algorithm and the competition for resources cannot be controlled, resulting in unexpected results
[0004] Current open source algorithm services (such as Nvidia's Triton) cannot realize the serialization of different algorithms in a single request, resulting in unnecessary data copy and synchronization; The service does not provide a specific optimization solution, and the resources are not fully scheduled to meet the demand
[0005] The current open source algorithm services mainly provide access to the backend of deep learning algorithm models. They are not friendly to the access support of non-deep learning algorithms, and the support for algorithm pre-processing and algorithm post-processing cannot meet the needs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel multi-algorithm access method and server
  • Novel multi-algorithm access method and server
  • Novel multi-algorithm access method and server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0047] A new method for accessing multiple algorithms, including at least one GPU server, the method includes,

[0048] Configure the algorithm service, write the configuration file, and configure the algorithm service according to the algorithm service configuration information;

[0049] Start the algorithm service, start the algorithm service through the monitoring process, and start the monitoring process according to the configuration of the algorithm service;

[0050] Algorithm service application connection, according to the configured algorithm service, apply for connection algorithm service;

[0051] Algorithm call, according to the input of the algorithm, submit the algorithm request to the algorithm service, so as to call the algorithm.

[0052]Access a variety of deep learning algorithm forward frameworks; such as TensorRT, Ncnn; non-deep learning algorithms, such as matrix multiplication, random forest, etc., and provide personalized algorithm access such as detec...

Embodiment 2

[0063] On the basis of embodiment 1, a new type of multi-algorithm access server is implemented through the new multi-algorithm access method implemented in embodiment 1, including at least one GPU server, which is realized through a new multi-algorithm access method The server includes a main control process module, a resource allocation process module and a monitoring process module; the main control process module is used to read configuration file information and send connection requests; the resource allocation process module is used to allocate shared memory and shared video memory in management algorithm services; monitoring The process module is used to monitor the status of the main control process module and the resource allocation process module, and recycle and restart the algorithm resources.

Embodiment 3

[0065] On the basis of the above embodiments, this embodiment also includes an algorithm process module, which is used to schedule GPU resources, start the algorithm and monitor algorithm tasks, read the configuration file according to the master control process module, and process the algorithm return according to the algorithm task result.

[0066] It also includes an external interface module, which is used to call the interface of the algorithm service externally, and the interface of the algorithm service includes a c interface, a c++ interface and a python interface.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of algorithm deployment, and discloses a novel multi-algorithm access method and a server. The method comprises the following steps that: an algorithm service is configured, a configuration file is compiled, and the algorithm service is configured according to algorithm service configuration information; the algorithm service is started, the algorithm service is started through the monitoring process, and the monitoring process is started according to configuration of the algorithm service; the algorithm service applies for connection, and the algorithm service applies for connection according to the configured algorithm service; and an algorithm is called, and according to the input of the algorithm, an algorithm request is submitted to the algorithm service so as to call the algorithm. On an algorithm server with a GPU, the application of the algorithm service is used, the used resources of CPU / GPU copy bandwidth and video memory are less, the utilization rate of the GPU is higher, more algorithm requests can be accessed, and the use efficiency of the server is further improved.

Description

technical field [0001] The present invention relates to the field of algorithm deployment, in particular to a new method and server for accessing multi-algorithms based on the deployment of conventional algorithms and deep learning algorithms on servers. Background technique [0002] With the rapid development of deep learning in the past few years, the number of products in computer vision and data processing has increased day by day, and the corresponding algorithm server also needs to support the simultaneous operation of multiple algorithms and access to multi-channel front-end devices. The current way of server management algorithm has the following problems: [0003] If each user application independently accesses the algorithm backend, each application will independently apply for algorithm resources. If these applications have parameters that use the same algorithm (such as the model parameters of the deep learning algorithm), resources will be wasted; at the system...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/445G06F9/448G06F9/50G06F11/30
Inventor 罗涛徐联伯
Owner 杭州英歌智达科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products