Reinforcement learning training optimization method and device for multi-agent confrontation
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- NAT INNOVATION INST OF DEFENSE TECH PLA ACAD OF MILITARY SCI
- Publication Date
- 2020-04-10
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of machine learning, in particular to a multi-agent confrontation-oriented reinforcement learning training optimization method and device. Background technique
[0002] Artificial intelligence is a technical science that researches and develops theories, methods, technologies and applications for simulating and expanding human intelligence. One of the main goals of artificial intelligence research is to simulate human decision-making by intelligent agents (Agents), so as to be competent for some complex tasks that require human intelligence to complete. The limited functionality of a single agent to cope with complex tasks has driven the concept of multi-agent systems. A multi-agent system is composed of multiple agents that can make independent decisions and interact with each other. They share the same environment and have perception and execution mechanisms. At present, multi-agent systems have become a...