Unlock instant, AI-driven research and patent intelligence for your innovation.

Collaborative battle method and device of intelligent agent

An intelligent body and air combat technology, applied in the field of artificial intelligence, can solve problems such as difficulty in determining target rewards, complex calculations, and instability, and achieve the effect of overcoming difficulty and instability in target rewards

Active Publication Date: 2022-01-07
NO 15 INST OF CHINA ELECTRONICS TECH GRP
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in multi-agent reinforcement learning, the environment is complex and changeable, the state space will grow exponentially with the increase of agents, the problem of dimension explosion may occur, and the calculation is complicated; at the same time, there are difficulties and instability in determining the target reward. The definition of the reward function will be affected by the different cooperation and tasks between multi-agents, and when the strategy of each agent changes, the strategies of other agents will also change, affecting the final convergence of the algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Collaborative battle method and device of intelligent agent
  • Collaborative battle method and device of intelligent agent
  • Collaborative battle method and device of intelligent agent

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0080] An embodiment of the present invention provides a method for cooperative fighting of agents, which is used for fighting among multiple agents. The execution subject of this embodiment is a cooperative battle device of an intelligent body, which is set on the intelligent body, and the intelligent body may be an unmanned aerial vehicle or a manned machine.

[0081] refer to figure 1 , which shows a flow chart of the steps of an embodiment of an intelligent agent cooperative battle method according to the present invention, the method may specifically include the following steps:

[0082] S101. Determine the virtual air combat scene where the agents fight;

[0083] Specifically, based on...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a collaborative battle method and device of an intelligent agent, terminal equipment and a storage medium. The method comprises the following steps: determining a virtual air battle scene in which the intelligent agent carries out battle; according to the virtual air battle scene, determining action space information and state space information of one or more agents, and according to the state value, determining a reward value of an action corresponding to the state value; according to the virtual air battle scene, the action space information, the state space information and the reward value, training the initial reinforcement learning model, and when the initial reinforcement learning model is in a convergence state, obtaining a target reinforcement learning model; and using the target reinforcement learning model and the rule agent for fighting, and solving the problem that target reward is difficult and unstable. And when the strategy of the multiple agents is changed, reinforcement learning of the multiple agents cannot be affected.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a method, device, terminal equipment and storage medium for cooperative combat of intelligent agents. Background technique [0002] When multiple unmanned combat aircraft make autonomous maneuver decisions in air combat, they need to carry out decision-making cognition and coordination. Because the environment of unmanned combat aircraft is relatively complex, and the coordination between aircraft needs to be considered, how to realize the autonomous control of combat aircraft is a research focus. [0003] Traditional UAV control relies on expert knowledge to deal with different situations through expert judgment on the environment and experience construction rules, but this requires experts to have high experience and knowledge, and it takes a lot of time and energy to consider all situations. With the development of artificial intelligence technology, d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): A63F13/52G06K9/62G06N3/08
CPCA63F13/52G06N3/08A63F2300/8029G06F18/214
Inventor 黄茗王滨原鑫李波
Owner NO 15 INST OF CHINA ELECTRONICS TECH GRP