Multi-agent group cooperation strategy automatic generation method
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- 厦门渊亭信息科技有限公司
- Publication Date
- 2021-03-12
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the field of artificial intelligence, in particular to a method for automatically generating multi-agent group cooperation strategies. Background technique
[0002] MADDPG is a multi-agent reinforcement learning framework based on the deep deterministic policy gradient algorithm, which can be used for the automatic generation of multi-agent cooperative strategies.
[0003] In a multi-agent system, each agent learns to improve its strategy by interacting with the environment to obtain a reward value (reward), so that the process of obtaining the optimal strategy in the environment is multi-agent reinforcement learning.
[0004] In single-agent reinforcement learning, the environment of the agent is stable, but in multi-agent reinforcement learning, the environment is complex and dynamic, which brings great difficulties to the learning process.
[0005] Dimension Explosion: In monolithic reinforcement learning, state-value funct...