The invention discloses a multi-
robot collaborative navigation and
obstacle avoidance method. The method comprises the following steps of modeling a
decision process of a
robot in an unknown environment according to a
partially observable Markov decision process; according to the environment modeling information of the current
robot, introducing a depth deterministic strategy gradient
algorithm, extracting a sampled image sample, and inputting the sampled image sample into a
convolutional neural network for
feature extraction; improvement being carried out on the basis of a depth deterministic strategy gradient
algorithm, a long-short-
term memory neural network being introduced to enable the network to have memorability, and image data being more accurate and stable by using a frame skipping mechanism; and meanwhile, an experience
pool playback mechanism being modified, and a priority being set for each stored experience sample so that few and important experiences can be more applied to learning, and learning efficiency is improved; and finally, a multi-robot navigation
obstacle avoidance simulation system being established. The method is advantaged in that the robot is enabled to learn navigation and
obstacle avoidance from easy to difficult by adopting a curriculum type learning mode so that the training speed is accelerated.