Unlock instant, AI-driven research and patent intelligence for your innovation.

A robot path learning and obstacle avoidance system and method combined with deep q-learning

A robot and path technology, applied in control/regulation systems, instruments, two-dimensional position/channel control, etc., can solve problems such as slowing down the learning speed, robots falling into local optimal state, lack of prior knowledge, etc., to reduce Effects of search time, reduced redundant selection, and small action search space

Active Publication Date: 2022-05-06
HANGZHOU DIANZI UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although some self-learning methods can overcome the lack of prior knowledge to some extent, it may suffer from problems caused by over-learning due to various properties of task scenarios.
In contrast, less knowledge of the environment may slow down learning and cause the robot to get stuck in a local optimum
Also, for most learning methods, it is necessary to have a large matrix to hold the computed values

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A robot path learning and obstacle avoidance system and method combined with deep q-learning
  • A robot path learning and obstacle avoidance system and method combined with deep q-learning
  • A robot path learning and obstacle avoidance system and method combined with deep q-learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0043] The invention proposes a robot path learning and obstacle avoidance system and method combined with deep Q-learning. The Monte Carlo tree method is usually used to deal with the local optimum problem, so as to help the robot escape from obstacles. There is a successful example, Alpha Go, an AI robot from Google, uses a deep neural network to make an overall assessment of the current game state, and uses a Monte Carlo tree method to complete rigorous computational tasks. Similar to Go, for the task of robot path planning, the robot needs to heuristic the current confrontation environment as a whole to form a reasonable direction of action, and in extreme cases such as obstacles, it also needs a convergence strategy to form a precise path.

[0044] In the present invention, we propose Adaptive and Random Exploration Meth...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot path learning and obstacle avoidance system and method combined with deep Q learning. The invention includes an action module, a learning module and an obstacle avoidance module. During path planning, the action module will receive instructions from the learning module and the obstacle avoidance module, and let the robot complete the specified actions according to the instructions. The learning module trains the action selection strategy based on the current state of the robot and the historical data sequence of actions. The obstacle avoidance module implements a random tree search algorithm to guide the robot to obtain a safe path from dangerous situations. After each module is executed, the current state of the robot in the environment will change. The risk of each module is assessed through the scheduling mechanism and the module with the least risk should be activated. The invention solves the problem of large-capacity data storage by using the Q-learning algorithm. Improved the efficiency of the robot when avoiding obstacles.

Description

technical field [0001] The invention belongs to the field of robot control, and in particular relates to a robot path learning and obstacle avoidance method combined with deep Q-learning. Background technique [0002] Exploring unknown environments with mobile robots is a very common problem for robotic applications like rescue and mining. Typically, robots need complex logic about obstacles and a topological map of the environment, aided by information from vision or depth sensors. However, these traditional methods do not have advanced human-brain-like intelligence. The present invention develops a machine learning approach for robots to explore unknown environments using raw sensor input. [0003] To date, research on robot path planning has been extensive, and the literature on modeling and solution methods is extensive. Research related to the realization of threat information can be divided into two categories: static path planning based on prior complete environmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/02
CPCG05D1/0214G05D1/0221
Inventor 颜成钢裘健鋆路荣丰孙垚棋张继勇李宗鹏
Owner HANGZHOU DIANZI UNIV