Quadruped robot motion planning method based on hierarchical reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A robot motion, quadruped robot technology, applied in the field of quadruped robot motion planning based on hierarchical reinforcement learning, can solve the problem that the quadruped robot control strategy cannot effectively balance the complex environment exploration ability and the fuselage stability, and the coordination cannot be directly effective Ground coordination and other issues to achieve good environmental generalization effect, good control stability, and the effect of maintaining the fuselage posture

Active Publication Date: 2021-06-11

WESTLAKE UNIV

View PDF5 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the quadruped robot control strategy based on deep reinforcement learning cannot effectively balance the ability to explore complex environments and the stability of the fuselage. At the same time, the cooperation between the upper-level control decision-making scheme and the underlying control actuators cannot be directly and effectively coordinated.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0032] The present invention will be described in detail below with reference to the accompanying drawings and preferred embodiments, and the purpose and effect of the present invention will become clearer. It should be understood that the specific embodiments described here are only used to explain the present invention and are not intended to limit the present invention.

[0033] Such as figure 1 As shown, the quadruped robot motion planning method based on layered reinforcement learning of the present invention specifically includes the following steps:

[0034] Step 1: Build a quadruped robot virtual simulation environment with neural network training capabilities on the simulation platform, and build a layered control network, including the upper layer control neural network and the lower layer model predictive controller. The upper-layer neural network is responsible for decision-making and planning the attitude and position of the robot during the movement process, and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a quadruped robot motion planning method based on hierarchical reinforcement learning. According to the method, an upper layer behavior decision controller based on deep reinforcement learning and a lower layer motion execution controller based on model prediction control are constructed; for a deep reinforcement learning network at an upper layer, a state updating network, an action execution network, a reward function and the like of the quadruped robot are designed based on an SAC algorithm, and corresponding behavior control parameters are output in combination with height map information of the environment around the robot; and for a model prediction controller in the lower layer execution controller, a corresponding control instruction is obtained by solving a control parameter quadratic programming equation set input based on the upper layer. A hierarchical control framework combines the advantages of the deep reinforcement learning and model prediction control, so that the quadruped robot can make a safer and more reliable behavior mode according to the current fuselage state and the terrain environment, a fuselage posture is kept stable, risks are effectively avoided, and effective motion track planning under the complex terrain is achieved.

Description

technical field [0001] The invention relates to the field of intelligent legged robots, in particular to a quadruped robot motion planning method based on layered reinforcement learning. Background technique [0002] Compared with other forms of mobile robots, quadruped robots have independent footholds when moving, and can change their gait according to the characteristics of terrain and tasks. It is more suitable for operations in complex environments such as uneven, rugged terrain, and stairs. Better environmental adaptability. However, motion planning for quadruped robots is quite challenging. It must consider a large number of time-varying factors in order to find the optimal control decision from a large action space. Deep reinforcement learning is a new control method. In deep reinforcement learning, the neural network representing the action strategy updates the network parameters through continuous interaction with the environment, and learns how to obtain the ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16

CPCB25J9/1664B25J9/1628

Inventor 么庆丰王纪龙魏震宇王东林

Owner WESTLAKE UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Quadruped robot motion planning method based on hierarchical reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology