Multi-agent group cooperation strategy automatic generation method

An automatic generation and intelligent agent technology, applied in the field of artificial intelligence, can solve problems such as slow learning speed and difficult algorithm stability, and achieve the effect of improving training efficiency, improving generation and evaluation efficiency
CN112488310APending Publication Date: 2021-03-12厦门渊亭信息科技有限公司

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
厦门渊亭信息科技有限公司
Publication Date
2021-03-12

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to the field of artificial intelligence, and discloses a multi-agent group cooperation strategy automatic generation method, which defines agents and strategy networks thereof according to a specific application environment, evaluates the networks and experiences, and realizes automatic generation of a multi-agent cooperation strategy. The adopted algorithm provides three innovations on the basis of the MADDPG algorithm: trace information, multi-agent cooperative team formation and birth and death training. The learning history of the intelligent agent in the environmentcan leave a trace amount of information in the environment, and the user can learn the experience of other people through the trace amount of information intelligent agent to avoid walking; the training efficiency can be improved through cooperative team formation of the multiple intelligent agents; finally, the agents with excellent learning ability in the environment are inherited to all information of themselves through filial generations to continue to be trained through birth and death training, the agents with poor learning ability in the environment return to the initial point to be trained again through death, and the generation and evaluation efficiency of the multi-agent cooperation strategy can be greatly improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the field of artificial intelligence, in particular to a method for automatically generating multi-agent group cooperation strategies. Background technique

[0002] MADDPG is a multi-agent reinforcement learning framework based on the deep deterministic policy gradient algorithm, which can be used for the automatic generation of multi-agent cooperative strategies.

[0003] In a multi-agent system, each agent learns to improve its strategy by interacting with the environment to obtain a reward value (reward), so that the process of obtaining the optimal strategy in the environment is multi-agent reinforcement learning.

[0004] In single-agent reinforcement learning, the environment of the agent is stable, but in multi-agent reinforcement learning, the environment is complex and dynamic, which brings great difficulties to the learning process.

[0005] Dimension Explosion: In monolithic reinforcement learning, state-value funct...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More