Wargame multi-entity asynchronous collaborative decision-making method and device based on reinforcement learning

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and collaborative decision-making, applied in neural learning methods, machine learning, biological models, etc., can solve problems such as difficult to effectively solve multi-entity asynchronous cooperation in war chess, inconsistent execution time of basic actions, etc. The effect of high combat efficiency and high final win rate

Active Publication Date: 2022-08-09

NAT UNIV OF DEFENSE TECH

View PDF8 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] Second, the heterogeneity of multiple entities in wargames leads to the asynchrony of multi-agent collaboration, that is, the execution time of basic actions of different agents is inconsistent.

This asynchrony makes it difficult for existing multi-agent reinforcement learning algorithms to effectively solve the problem of multi-entity asynchronous cooperation in war games

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.

[0046] like figure 1 As shown, a reinforcement learning-based multi-entity asynchronous collaborative decision-making method for war chess provided by this application, in one embodiment, includes the following steps:

[0047] Step 102: Obtain a wargame environment and a multi-entity asynchronous cooperative decision-making problem corresponding to the wargame environment, and perform modeling and analysis on the multi-entity asynchronous cooperative decision-making problem to obtain an initial model.

[0048] Specifically, the wargame environment and the multi-entity...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of intelligent decision making, and relates to a war game multi-entity asynchronous collaborative decision making method and device based on reinforcement learning, and the method comprises the steps: obtaining a war game deduction environment and a multi-entity asynchronous collaborative decision making problem, and carrying out the modeling analysis of the multi-entity asynchronous collaborative decision making problem, and obtaining an initial model; according to the initial model, a multi-agent deep reinforcement learning algorithm is adopted to establish an agent network model and a hybrid evaluation network model; training the agent network model and the hybrid evaluation network model to obtain a collaborative decision framework; a loss function of the multi-agent deep reinforcement learning algorithm is reconstructed by setting a weighting operator or optimizing the multi-agent deep reinforcement learning algorithm through multi-step return; using the reconstructed loss function to update the collaborative decision framework; and according to the updated collaborative decision framework, performing decision making on asynchronous collaboration of multiple entities. According to the method, multi-entity asynchronous collaborative decision-making in war game deduction can be realized.

Description

technical field [0001] The present application relates to the technical field of intelligent decision-making, in particular to a multi-entity asynchronous cooperative decision-making method and device for wargames based on reinforcement learning. Background technique [0002] Wargaming is a process of using wargames to simulate war activities. Wargame players use chessboards and pieces that represent the environment and military power to simulate war confrontation based on specific military rules and probability theory principles, and conduct process deduction, evaluation and optimization of combat plans. The wargame team of the National Defense University developed a strategic campaign wargame system, and pointed out the key problem that needs to be solved in the application of artificial intelligence technology to wargames - intelligent situational awareness. The early wargame agent design mainly used the high-level human player's deduction experience to form a knowledge ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06F30/27G06N3/00G06N3/04G06N3/08G06N20/00

CPCG06F30/27G06N3/006G06N3/08G06N20/00G06N3/044

Inventor张煜蒋超远罗俊仁李婷婷刘运杨景照刘果李鑫刘屹峰陈佳星

OwnerNAT UNIV OF DEFENSE TECH

Wargame multi-entity asynchronous collaborative decision-making method and device based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology