Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-agent training method and system under complex conditions

A multi-agent, training method technology, applied in the direction of communication interference, electrical components, instruments, etc., can solve the problems of poor training effect, unstable training, long training period, etc., to improve quality, shorten training time, solve problems Effects on non-stationary problems

Active Publication Date: 2021-03-12
NAT INNOVATION INST OF DEFENSE TECH PLA ACAD OF MILITARY SCI +1
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In summary, in the training process of multi-agent deep reinforcement learning with complex scenarios, especially in the cooperation / competition scenario of training multi-agents, with the increase of agents, the existing training methods have training instability, Poor training effect and long training period

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-agent training method and system under complex conditions
  • Multi-agent training method and system under complex conditions
  • Multi-agent training method and system under complex conditions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] In order to better understand the present invention, the content of the present invention will be further described below in conjunction with the accompanying drawings and examples.

[0074] The present invention is different from the existing single-scenario research. The inventor proposes a multi-agent learning method for complex scenario course learning (SCL). The SCL method starts from learning a simple multi-agent scene and gradually increases the number of agents. The number and the complexity of the environment can finally achieve the purpose of learning the target task, solve the non-stationary problem of multi-agent reinforcement learning and improve the training effect.

[0075] like figure 1 As shown, a kind of training method for multi-agents under complex conditions provided by the present invention includes:

[0076] S1 builds training scenarios and agent models based on training objectives;

[0077] S2 decomposes the training scene into multiple course ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-agent training method and system under complex conditions, and the method comprises the steps: building a training scene and an agent model based on a training target; sequentially decomposing the training scene into a plurality of course tasks from simple to complex according to the complexity of the scene; sequentially selecting curriculum tasks for training according to the scene complexity by utilizing the agent model to obtain strategies of each agent; wherein the training result of the previous course task is used as the initial condition of the next coursetask in the training process. According to the invention, the training scene is sequentially decomposed into a plurality of course tasks from simple to complex according to the complexity of the scene, the problem of instability in multi-agent reinforcement learning is solved, the training effect is improved, and the training time is shortened at the same time.

Description

technical field [0001] The invention relates to the technical field of agent control, in particular to a training method and system for multi-agents under complex conditions. Background technique [0002] In the study of multi-agent systems, an intuitive research method is to predefine behavior rules for agents. In task execution, each agent implements various behaviors according to preset rules. However, this method needs to define a large number of behavioral rules to deal with various possible situations in the environment. In a complex environment, it is difficult to enumerate various situations in the environment, and the behavior of other agents will make the environment change continuously. Therefore, in a complex environment, multi-agents need to learn new behaviors through continuous interaction with the environment to ensure the execution performance of tasks. Reinforcement learning (RL), as a learning mode that does not rely on prior knowledge and data, is an e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N20/00H04K3/00
CPCG06N20/00H04K3/80Y02T10/40
Inventor 史殿习张耀文张拥军武云龙秦伟徐天齐王功举
Owner NAT INNOVATION INST OF DEFENSE TECH PLA ACAD OF MILITARY SCI
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More