A hybrid sensing autonomous obstacle avoidance method and system for unmanned boats based on reinforcement learning

A reinforcement learning, unmanned boat technology, applied in control/adjustment systems, two-dimensional position/channel control, instruments, etc., can solve problems such as difficulty in adapting to large-scale complex environments, lack of autonomous learning, and cumbersomeness. Achieving the effect of reliable and stable threat evasion capability

Active Publication Date: 2022-07-15
SHANGHAI JIAO TONG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although these methods have relatively good performance in their respective application backgrounds, they need to be carefully designed for different functional modules, and the comprehensive algorithm needs to be configured and adjusted as a whole, which makes the realization of the intelligent algorithm of the unmanned vehicle complex and cumbersome.
Furthermore, due to the lack of self-learning ability of these methods, it is difficult to adapt to large-scale and complex environments, and it is necessary to redesign, reorganize and cooperate with different algorithm modules for different environments.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A hybrid sensing autonomous obstacle avoidance method and system for unmanned boats based on reinforcement learning
  • A hybrid sensing autonomous obstacle avoidance method and system for unmanned boats based on reinforcement learning
  • A hybrid sensing autonomous obstacle avoidance method and system for unmanned boats based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0037] like figure 1 As shown, a hybrid perception autonomous obstacle avoidance method for unmanned boats based on reinforcement learning, the method includes the following steps:

[0038] 1) Build the marine environment: establish the rules for the interaction between the unmanned boat and the marine environment, generate random obstacles, and randomly generate the starting point and end point of the unmanned boat;

[0039] The rules of interaction between the unmanned vehicle and the marine environment follow the dynamic equation of the unmanned vehicle itself:

[0040]

[0041]

[0042] Among them, η = [x, y, ψ] T Contains the position and yaw angle information of the unmanned boat, v=[u,υ,r] T Including sway, surge, and yaw speed information, τ=[τ u ,0,τ t ] T is the sway force and the yaw force of the unmanned boat, M is the mass of the unmanned boat, R(ψ) is the function of the yaw angle ψ, and C(v) and g(v) are the functions of v respectively;

[0043] The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and system for autonomous obstacle avoidance based on reinforcement learning based on hybrid perception of unmanned boats. The method includes the following steps: 1) building a marine environment; The provided global planning information and the obstacle information within the detection radius of the radar system are learned to obtain the reinforcement learning state code; 3) Set the reward target weight to obtain the comprehensive reward function; 4) Build and train the evaluation network and policy network; The learning state code is input to the evaluation network and the policy network respectively, the comprehensive reward function is input to the evaluation network, and the output of the controller is determined according to the action corresponding to the learned mean value of the policy network. Compared with the prior art, the present invention has a high self-learning ability, and can adapt to different large-scale complex environments through simple deployment and training, thereby realizing autonomous perception, autonomous navigation, and autonomous obstacle avoidance.

Description

technical field [0001] The invention relates to an autonomous obstacle avoidance method and system for an unmanned boat, in particular to a hybrid perception autonomous obstacle avoidance method and system for an unmanned boat based on reinforcement learning. Background technique [0002] Unmanned boat is a surface unmanned vehicle that can realize autonomous navigation, autonomous obstacle avoidance, and autonomous surface operations. It has the advantages of small size, high speed, good stealth and no risk of casualties. Unmanned boats are very suitable for performing surface operations tasks in dangerous sea areas with high risk of casualties or simple surface operations tasks that require low personnel participation. It has a good cost-effectiveness ratio, so it is widely and effectively used in marine monitoring. , marine survey, maritime search and rescue, unmanned cargo and other fields. [0003] At present, the mainstream idea of ​​realizing autonomous navigation of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/02
CPCG05D1/0257G05D1/0223G05D1/0221G05D1/0276
Inventor 张卫东王雪纯徐鑫莉蔡云泽
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products