Multi-agent confrontation method and system based on dynamic graph neural network

A multi-agent and neural network technology, applied in the field of reinforcement learning, can solve problems such as multi-manual intervention, low efficiency, and slow model training speed, and achieve the effects of improving training efficiency, improving confrontation effects, and reducing human interference

Pending Publication Date: 2021-11-09
INST OF AUTOMATION CHINESE ACAD OF SCI +1
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve the above-mentioned problems in the prior art, that is, the existing graph neural network-based multi-agent model training speed is slow, the efficiency is low, and the problem of requiring more manual intervention in graph construction, the present invention provides a dynamic graph-based A multi-agent confrontation method of a neural network, the multi-agent confrontation method includes:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-agent confrontation method and system based on dynamic graph neural network
  • Multi-agent confrontation method and system based on dynamic graph neural network
  • Multi-agent confrontation method and system based on dynamic graph neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, not to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0055] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0056] The present invention provides a multi-agent confrontation method based on a dynamic graph neural network, which is different from the general multi-agent model, which can only model the relationship between the time series in the agent, and adopts a multi-agent based on a gra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of reinforcement learning of a multi-agent system, particularly relates to a multi-agent confrontation method and system based on a dynamic graph neural network, and aims at solving the problems that an existing multi-agent model based on the graph neural network is low in training speed and low in efficiency, and much manual intervention is needed in graph construction. The method comprises the following steps: obtaining an observation vector of each agent, and carrying out linear transformation to obtain an observation feature vector; calculating a connection relationship between adjacent agents, and constructing a graph structure between the agents; carrying out embedded representation on a graph structure between the intelligent agents in combination with the observation feature vectors; performing network space-time parallel training on the action prediction result of the action network and the evaluation of the evaluation network by using the embedded representation; and performing action prediction and action evaluation in multi-agent confrontation through the trained network. According to the method, a more real graph relationship is established through pruning, space-time parallel training is realized by utilizing the full-connection neural network and position coding, the training efficiency is high, and the effect is good.

Description

technical field [0001] The invention belongs to the field of reinforcement learning of multi-agent systems, and in particular relates to a multi-agent confrontation method and system based on a dynamic graph neural network. Background technique [0002] Reinforcement learning has many successful applications in many fields, and multi-agent technology, as an important branch of reinforcement learning, has also been studied by many scholars. An important multi-agent research direction is multi-agent cooperative confrontation. Multi-agent models are mainly divided into two categories, one is confrontation and the other is cooperation. A typical confrontation model is AlphaZero, which requires two agents to play a game to find a better strategy. The cooperative relationship requires multiple agents to cooperate to complete tasks, such as tennis doubles, football tasks, etc. [0003] For multi-agent tasks, one of the most direct ideas is to directly equip each agent with a sin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/082G06N3/045
Inventor 何赛克张连怡闫硕熊彦钧郑晓龙曾大军
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products