Unlock instant, AI-driven research and patent intelligence for your innovation.

Cooperative battle method and device for intelligent agents

An intelligent body and air combat technology, applied in the field of artificial intelligence, can solve problems such as difficulty in determining target rewards, complex calculations, and dimension explosion, and achieve the effect of overcoming the difficulty and instability of target rewards

Active Publication Date: 2022-03-25
NO 15 INST OF CHINA ELECTRONICS TECH GRP
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in multi-agent reinforcement learning, the environment is complex and changeable, the state space will grow exponentially with the increase of agents, the problem of dimension explosion may occur, and the calculation is complicated; at the same time, there are difficulties and instability in determining the target reward. The definition of the reward function will be affected by the different cooperation and tasks between multi-agents, and when the strategy of each agent changes, the strategies of other agents will also change, affecting the final convergence of the algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cooperative battle method and device for intelligent agents
  • Cooperative battle method and device for intelligent agents
  • Cooperative battle method and device for intelligent agents

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0080] An embodiment of the present invention provides a method for cooperative fighting of agents, which is used for fighting among multiple agents. The execution subject of this embodiment is a cooperative battle device of an intelligent body, which is set on the intelligent body, and the intelligent body may be an unmanned aerial vehicle or a manned machine.

[0081] refer to figure 1 , which shows a flow chart of the steps of an embodiment of an intelligent agent cooperative battle method according to the present invention, the method may specifically include the following steps:

[0082] S101. Determine the virtual air combat scene where the agents fight;

[0083] Specifically, based on...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a method, device, terminal equipment and storage medium for cooperative combat of intelligent agents. By determining the virtual air combat scene where the intelligent agents fight; according to the virtual air combat scene, the action space information and state space of one or more intelligent agents are determined. Information, according to the state value, determine the reward value of the action corresponding to the state value; according to the virtual air combat scene, action space information, state space information and reward value, train the initial reinforcement learning model, when the initial reinforcement learning model is in a convergent state , get the target reinforcement learning model; use the target reinforcement learning model to fight against the regular agent, overcome the difficulty and instability of the target reward, and when the strategy of the multi-agent is changed, it will not affect the reinforcement learning of the multi-agent.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a method, device, terminal equipment and storage medium for cooperative combat of intelligent agents. Background technique [0002] When multiple unmanned combat aircraft make autonomous maneuver decisions in air combat, they need to carry out decision-making cognition and coordination. Because the environment of unmanned combat aircraft is relatively complex, and the coordination between aircraft needs to be considered, how to realize the autonomous control of combat aircraft is a research focus. [0003] Traditional UAV control relies on expert knowledge to deal with different situations through expert judgment on the environment and experience construction rules, but this requires experts to have high experience and knowledge, and it takes a lot of time and energy to consider all situations. With the development of artificial intelligence technology, d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): A63F13/52G06K9/62G06N3/08
CPCA63F13/52G06N3/08A63F2300/8029G06F18/214
Inventor 黄茗王滨原鑫李波
Owner NO 15 INST OF CHINA ELECTRONICS TECH GRP