A human-swarm coordination method based on hierarchical multi-agent reinforcement learning
By adopting a three-layer hierarchical multi-agent reinforcement learning architecture, the decision-making difficulties of multi-agent systems in large-scale complex environments are solved, and efficient and interpretable drone swarm control under human-machine collaboration is realized, improving the adaptability of the strategy and the efficiency of cooperation.
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- UNIV OF SCI & TECH OF CHINA
- Filing Date
- 2026-03-11
- Publication Date
- 2026-06-19
AI Technical Summary
Existing multi-agent reinforcement learning methods suffer from poor convergence, low collaboration efficiency, and insufficient policy generalization ability in large-scale, dynamically changing, and complex environments, especially in human-machine collaboration and dynamic preference adjustment.
A three-layer hierarchical multi-agent reinforcement learning architecture is adopted, including a top-level human-cluster interaction module, a middle-level target selection module, and a bottom-level policy collaboration execution module. Through centralized training, distributed execution, and attention mechanisms, it achieves seamless integration of human commander intent and agent decision-making and policy scalability.
It improves the decision-making transparency and adaptability of UAV swarms in complex adversarial environments, enables efficient and interpretable swarm control, and allows for real-time response to tactical changes while maintaining tactical advantage.
Smart Images

Figure 1 
Figure 2