Course learning method for learning multi-robot formation navigation strategy under sparse reward signals
A multi-robot and learning method technology, applied in the field of multi-mobile robots, can solve the problem of difficulty in learning navigation strategies for robot formations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] like figure 1 and figure 2 Shown, a curriculum learning method for learning a multi-robot formation navigation policy under sparse reward signals, where a curriculum learning based on fusing relative performance and absolute performance is used to allow the multi-robot formation to still be able to Learn an effective navigation strategy; based on the fusion of relative performance and absolute performance curriculum learning, that is, as the training progresses, gradually switch from relative performance-based curriculum learning to absolute performance-based curriculum learning, in this way, in the training In the early stage, the basic navigation strategy is quickly mastered through the course learning based on relative performance, and the complex navigation strategy is overcome through the course learning based on absolute performance in the later stage of training.
[0043] Compared with the general multi-robot formation navigation method based on deep reinforcem...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


