Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Artificial intelligence reinforcement learning service platform

A technology of reinforcement learning and service platform, which is applied in the field of reinforcement learning development platform, can solve problems such as difficult to quickly develop and verify code, a large number of computing resources, and time-consuming, so as to improve resource utilization, algorithm reliability, and efficient support The effect of service and lowering the threshold

Active Publication Date: 2020-07-17
COMP NETWORK INFORMATION CENT CHINESE ACADEMY OF SCI
View PDF1 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although these environments provide a standardized research environment in specific fields to a certain extent, the development and research based on these reinforcement learning environments still face the following problems: training reinforcement learning algorithms requires a large One-stop scientific research environment; it takes a lot of time to deploy the corresponding reinforcement learning development environment, and it is difficult to reproduce the algorithm due to different software versions and hyperparameters; the server side lacks visual development tools, and cannot observe the reinforcement learning agent environment in real time Simulation, difficult to quickly develop and verify code

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Artificial intelligence reinforcement learning service platform
  • Artificial intelligence reinforcement learning service platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] figure 1 It is a schematic structural diagram of an artificial intelligence reinforcement learning service platform provided by an embodiment of the present invention. Such as figure 1 As shown, the artificial intelligence reinforcement learning service platform provided by the embodiment of the present invention is structurally divided into an infrastructure layer, an application service layer, and an interface access layer, wherein:

[0031] The infrastructure layer is used to provide the network resources, computing resources, storage resources and virtualization service resources required by the reinforcement learning service platform, and provide cloud storage and cloud processing related services through virtualization, load balancing, disaster recovery backup and elastic computing technologies. IT infrastructure services.

[0032] The infrastructure layer adopts OpenStack cloud computing management platform, and calls OpenStack services through Python language,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an artificial intelligence reinforcement learning service platform. The system is structurally divided into an infrastructure layer, an application service layer and an interface access layer, the infrastructure layer provides network resources, computing resources, storage resources and virtualization service resources required by the reinforcement learning service platform, and provides IT infrastructure services related to cloud storage and cloud processing through virtualization, load balancing, disaster recovery backup and elastic computing technologies; the application service layer comprises a Project-based packaging and management module, a cloud development and debugging environment module and a virtual development environment interface module; the three systems provide various universal or self-defined cloud research environments for field researchers from top to bottom; the platform encapsulates data, algorithms and research environments involved inreinforcement learning research in a virtualization container in a Project form, and opens up an independent test environment for each user using the platform; and the interface access layer can enable the reinforcement learning researcher to manage the cloud computing environment in a self-service manner.

Description

technical field [0001] The invention relates to a pre-reinforcement learning development platform technology, in particular to an artificial intelligence reinforcement learning service platform. Background technique [0002] Machine learning is one of the core issues of artificial intelligence, which is to study and simulate human learning behavior, and to generate new knowledge through learning after acquiring knowledge. Data-based machine learning is one of the important methods in modern intelligent technology. Research starts from observed data (samples) to find laws and acquire knowledge, and use these laws and knowledge to predict future data or unobservable data through a certain learning mode. . Machine learning can be classified into supervised learning, unsupervised learning and reinforcement learning according to the learning mode. The goal of reinforcement learning is to learn the mapping from environmental state to behavior, so that the behavior selected by th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/20G06F8/30G06F11/36G06F9/455G06N20/00G06N3/10
CPCG06F8/24G06F8/315G06F11/3624G06F11/3664G06F9/45533G06N20/00G06N3/105
Inventor 王晓光曹荣强王珏周纯葆张博尧王彦棡
Owner COMP NETWORK INFORMATION CENT CHINESE ACADEMY OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products