Nao robot path planning method based on deep Double-Q network

A path planning and robotics technology, applied in the direction of instruments, motor vehicles, two-dimensional position/channel control, etc., can solve the problems of not being able to obtain environmental information, achieve the effect of improving generalization ability and solving path planning problems

Pending Publication Date: 2021-12-31
NORTHWESTERN POLYTECHNICAL UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of the above algorithms need to have a full understanding of the surrounding environment information of the robot, and then carry out navigation planning based on certain rules. The actual application environment changes a lot and...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Nao robot path planning method based on deep Double-Q network
  • Nao robot path planning method based on deep Double-Q network
  • Nao robot path planning method based on deep Double-Q network

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0060] The basic idea of ​​the present invention is to use the deep Double-Q network to train the robot to complete the path planning task, and can migrate to the real robot and the unknown real environment.

[0061] Step 1: Virtual environment information preprocessing

[0062] The Naoqi platform in the Choregraphe software is used as a virtual environment for training Nao robots, such as figure 2 shown. The ultrasonic information of the Nao robot in this virtual environment cannot be called. In order to make the virtual environment consistent with the state information that the robot can obtain in the real environment, it is necessary to preprocess the information in the virtual environment.

[0063] In the virtual environment, the Nao robot can obtain the relative distance from the obstacle in the X and Y directions, so it is processed based on this:

[0064] (1) When facing an obstacle, calculate the angle between the Nao robot and the obstacle

[0065]

[0066] whe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Nao robot path planning method based on a deep Double-Q network, which aims at an uncertain environment, utilizes local information acquired by a robot to realize active obstacle avoidance in an indoor environment, plans a path to reach a set target point, and improves the generalization ability of a common path planning algorithm. Through preprocessing training environment data to approach local environment information which can be acquired by a robot in a real scene and setting a proper state space, a proper action space and a proper reward function, the method can directly utilize the acquired local environment information to perform effective obstacle avoidance and path planning in an unknown environment; the method overcomes the defect that a traditional algorithm needs to obtain sufficient environmental information and perform modeling again to explore the environment to a certain extent, improves the generalization ability of the algorithm, and is beneficial to solving the problem of path planning in an unknown environment.

Description

technical field [0001] The invention belongs to the field of deep reinforcement learning and path planning, and relates to a Nao robot path planning method based on a deep Double-Q network. Background technique [0002] Path planning refers to the technology of finding a collision-free path from the initial state (including position and attitude) to the target state (including position and attitude) in an environment with obstacles according to certain evaluation criteria. [0003] At present, path planning algorithms can be roughly divided into: classical algorithms and artificial intelligence algorithms. Traditional path planning algorithms mainly include: simulated annealing algorithm, artificial potential field algorithm, tabu search algorithm, etc. With the rise of artificial intelligence, because of its certain self-learning, self-renewal and memory capabilities, many path planning algorithms based on artificial intelligence have been proposed, typical ones are: ant c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05D1/02
CPCG05D1/0214G05D1/0255G05D2201/0217
Inventor 赵佳玮张利军
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products