Unlock instant, AI-driven research and patent intelligence for your innovation.

A self-evolution generation method for multi-agent action strategy

A multi-agent, intelligent body technology, applied in the field of agents, can solve the problems of lack of cases, not considering the increase and decrease of agents and clustering, unable to effectively give action strategies, etc., to achieve strong robustness and increased robustness. awesome effect

Active Publication Date: 2022-07-05
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The commonly used theories such as analytic hierarchy process, evidence fusion method and multi-attribute decision-making mainly rely on expert knowledge and experience database for decision-making, which lacks enough cases for judgment, and the agent does not have self-exploration ability. In the battlefield environment, the increase, decrease, and clustering of agents in the entire intelligent system are not considered; the strategy generation method based on neural networks relies on large-scale supervised learning, and cannot effectively give action strategies in the face of small sample combat cases

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A self-evolution generation method for multi-agent action strategy
  • A self-evolution generation method for multi-agent action strategy
  • A self-evolution generation method for multi-agent action strategy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be described in detail below with reference to the accompanying drawings and embodiments.

[0036] The invention provides a multi-agent action strategy self-evolution generation method, which is suitable for the multi-agent action strategy autonomous generation. Among them, the agent is an abstract concept of a real entity, and the scope can include the ability to dynamically perceive; to perform actions and obtain feedback; to obtain evaluation information on the feedback results. Its main entities can include the following: fire-fighting drones in forest fires; rescue robots in natural disaster rescue missions; reconnaissance aircraft and intelligent strike weapons in military strike missions.

[0037] All of the above scenarios have the following characteristics:

[0038] The action strategy of the agent is restricted by certain rules. The strategic behavior of an agent must not exceed the scope of its capabilities. For example, unmanned ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-agent action strategy self-evolution generation method, which has strong robustness and self-adaptive ability, and is suitable for the rapid generation of the action strategy of the agent under the highly dynamic changing situation of the battlefield. Initialize the public neural network, which contains the actor network and the critic network. After initializing the settings, calculate the reward value obtained after the current time node performs the action; update the time node t, that is, t increments by 1. Calculate the state reward value of the last time node t in the current time series; update the state reward value of time node t+1, update the gradient value of the actor network parameters in the current thread, and update the critic network parameters in the current thread. The gradient value of network parameters; Update two global parameters of the public neural network. After the training of the public neural network is completed, a strategy generation model is formed, and in the face of a new air combat environment change, the state features and actions of the new air combat environment are input into the strategy generation model, and an action strategy is output.

Description

technical field [0001] The invention relates to the technical field of agents, in particular to a method for self-evolution and generation of multi-agent action strategies. Background technique [0002] In the real natural environment, unmanned equipment can be regarded as an intelligent body with perception and action ability. Since the situation information is usually in the process of constant change, the surrounding environment, allocatable resources, and macro tasks of the intelligent body may appear in a short period of time. And the rapid change of information such as the agent's own capabilities. In the limited time for action planning, the ability to quickly generate action strategies and realize the ability of multi-agent unified deployment and coordinated operations is the current focus of research on action strategies for multi-agent complex systems. It has a very wide range of applications in the fields of intelligent scheduling, industrial intelligence and com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/04G06N3/08
Inventor 王玥庄星尹昊刘劲涛李柯绪
Owner BEIJING INSTITUTE OF TECHNOLOGYGY