Containerization test method and system for reinforcement learning model

A technology that reinforces learning and testing methods, applied in software testing/debugging, instrumentation, error detection/correction, etc., can solve problems such as dependencies, poor isolation of the testing process, and high testing environment requirements, to ensure fairness, achieve visualization, The effect of ensuring security and privacy

Pending Publication Date: 2021-10-22
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The traditional reinforcement learning model testing needs to build a test environment in the test server and connect the test objects to the test server. The problem with this approach is: First, from the requirements of the test environment, the testing process of the reinforcement learning model relies heavily on The test environment and its configuration files generally have high requirements for the test environment; secondly, in terms of the isolation of the test, the traditional te

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Containerization test method and system for reinforcement learning model
  • Containerization test method and system for reinforcement learning model

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0083] The invention will be described in further detail below with reference to the accompanying drawings, and the exemplary examples are intended to be construed as not intended to limit the scope of the invention.

[0084] In the present embodiment, the driver's decision control algorithm in automatic driving is used as a test agent (strengthening learning model). The tester refers to the main body of the test and evaluation of the drone decision control algorithm. It provides a Docker mirror file containing the test environment, test agency template (drone decision control algorithm) for the drone decision control algorithm. Unmanned car decision control algorithm and test results; test platform is a management platform for unmanned vehicle decision control algorithm testing, used to create and manage test projects, display test procedures in real time, show test results, perform test and test Split to achieve the decoupling of the test environment and the survey agent; the te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a containerized testing method and system for a reinforcement learning model. The method comprises the following steps that 1) a test party makes a test environment Docker mirror image, a proxy Docker mirror image and a corresponding connection module, a callback module and an evaluation module according to a to-be-established test task, and then packages the connection module into a proxy Docker mirror image file template; 2) the test party creates a test task on the test platform and uploads a mirror image file to a test party server; 3) the tested party downloads a mirror image file training agent of the test task, integrates the trained agent to an agent Docker mirror image and uploads the Docker mirror image to a server of the test party; 4) the test party server adds or replaces the callback module and the evaluation module in the newly uploaded proxy Docker mirror image file, repackages the newly uploaded proxy Docker mirror image file to obtain a new proxy Docker mirror image, and starts to run the test task; and 5) the test party server transmits the test process data back to the test platform.

Description

technical field [0001] The invention belongs to the technical field of computer software, and in particular relates to a containerized testing method and system for reinforcement learning models. Background technique [0002] Reinforcement learning is a learning method that is closer to life in reality. Unlike "deep learning" technology, it does not use pre-labeled data, but guides behavior through the rewards obtained by the agent interacting with the environment. The goal is to make the agent Get as many rewards from the environment as possible to learn the optimal policy. The testing of reinforcement learning needs to rely on the environment of reinforcement learning. In principle, the difference between reinforcement learning and deep learning is that the former needs to interact with the test environment online in real time, generate corresponding behaviors based on environmental feedback, and then make relevant judgments and evaluations; while the latter is not depend...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/36G06F9/455
CPCG06F11/3684G06F11/3688G06F9/45558G06F2009/45562Y02D10/00
Inventor 董乾薛云志孟令中杨光师源王鹏淇武斌
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products