The invention designs an AUV (Autonomous
Underwater Vehicle) three-dimensional path
planning method based on
reinforcement learning. The AUV three-dimensional path
planning method comprises the following steps: firstly, modeling a known
underwater working environment, and performing global path planning for an AUV; secondly, designing a bonus value specific to a special
working environment and a planning target of the AUV, performing
obstacle avoidance training on the AUV by using a
Q learning method improved on the basis of a self-organizing neural network, and writing an
obstacle avoidance strategy obtained by training into an internal
control system of a
robot; and finally receiving global path planning nodes after the
robot enters into water, calculating a target heading plan by the AUV with the global path planning nodes as target nodes for planning a
route, and avoiding obstacles by using the
obstacle avoidance strategy in case of emergent obstacles. Through adoption of the method, the economical efficiency of the AUV routing path is ensured, and the security in case of emergent obstacles is ensured. Meanwhile, the
route planning accuracy can be improved; the planning time isshortened; and the environmental adaptability of the AUV is enhanced. The method can be applied to the AUV which carriers an obstacle avoidance
sonar and can implement autonomous routing.