Quadruped robot motion planning method based on hierarchical reinforcement learning

A robot motion, quadruped robot technology, applied in the field of quadruped robot motion planning based on hierarchical reinforcement learning, can solve the problem that the quadruped robot control strategy cannot effectively balance the complex environment exploration ability and the fuselage stability, and the coordination cannot be directly effective Ground coordination and other issues to achieve good environmental generalization effect, good control stability, and the effect of maintaining the fuselage posture

Active Publication Date: 2021-06-11
WESTLAKE UNIV
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the quadruped robot control strategy based on deep reinforcement learning cannot effectively balance the ability to explore complex environments and the stability of the fu

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quadruped robot motion planning method based on hierarchical reinforcement learning
  • Quadruped robot motion planning method based on hierarchical reinforcement learning
  • Quadruped robot motion planning method based on hierarchical reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be described in detail below with reference to the accompanying drawings and preferred embodiments, and the purpose and effect of the present invention will become clearer. It should be understood that the specific embodiments described here are only used to explain the present invention and are not intended to limit the present invention.

[0033] Such as figure 1 As shown, the quadruped robot motion planning method based on layered reinforcement learning of the present invention specifically includes the following steps:

[0034] Step 1: Build a quadruped robot virtual simulation environment with neural network training capabilities on the simulation platform, and build a layered control network, including the upper layer control neural network and the lower layer model predictive controller. The upper-layer neural network is responsible for decision-making and planning the attitude and position of the robot during the movement process, and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a quadruped robot motion planning method based on hierarchical reinforcement learning. According to the method, an upper layer behavior decision controller based on deep reinforcement learning and a lower layer motion execution controller based on model prediction control are constructed; for a deep reinforcement learning network at an upper layer, a state updating network, an action execution network, a reward function and the like of the quadruped robot are designed based on an SAC algorithm, and corresponding behavior control parameters are output in combination with height map information of the environment around the robot; and for a model prediction controller in the lower layer execution controller, a corresponding control instruction is obtained by solving a control parameter quadratic programming equation set input based on the upper layer. A hierarchical control framework combines the advantages of the deep reinforcement learning and model prediction control, so that the quadruped robot can make a safer and more reliable behavior mode according to the current fuselage state and the terrain environment, a fuselage posture is kept stable, risks are effectively avoided, and effective motion track planning under the complex terrain is achieved.

Description

technical field [0001] The invention relates to the field of intelligent legged robots, in particular to a quadruped robot motion planning method based on layered reinforcement learning. Background technique [0002] Compared with other forms of mobile robots, quadruped robots have independent footholds when moving, and can change their gait according to the characteristics of terrain and tasks. It is more suitable for operations in complex environments such as uneven, rugged terrain, and stairs. Better environmental adaptability. However, motion planning for quadruped robots is quite challenging. It must consider a large number of time-varying factors in order to find the optimal control decision from a large action space. Deep reinforcement learning is a new control method. In deep reinforcement learning, the neural network representing the action strategy updates the network parameters through continuous interaction with the environment, and learns how to obtain the ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): B25J9/16
CPCB25J9/1664B25J9/1628
Inventor 么庆丰王纪龙魏震宇王东林
Owner WESTLAKE UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products