Humanoid robot gait control method based on model correlated reinforcement learning

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A humanoid robot and reinforcement learning technology, which is applied in the field of humanoid robot gait control based on model-related reinforcement learning, can solve the problems that the PID controller cannot perfectly meet the system control requirements, the humanoid robot system is complicated, and the space is large

Active Publication Date: 2016-11-09

SOUTH CHINA UNIV OF TECH

View PDF2 Cites 30 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

PID is a linear controller, which requires the environment to be a linear approximation model, but the humanoid robot system is a complex nonlinear model, so the PID controller cannot perfectly meet the control requirements of the system

[0004] In order to better control the walking stability of humanoid robots, the use of reinforcement learning to control humanoid robots has attracted widespread attention, but the application of reinforcement learning to the walking stability control of humanoid robots also faces many problems. The state and control actions of the human robot are continuous, and the space is too large, so the traditional reinforcement learning is not convenient to apply

The experimental cost of humanoid robot is too high, and reinforcement learning needs multiple learning and training to achieve better control effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0049] The present invention will be further described below in conjunction with specific examples.

[0050] The gait control method of a humanoid robot based on model-related reinforcement learning described in this embodiment comprises the following steps:

[0051] 1) Define a reinforcement learning framework for the stability control task before and after walking of a humanoid robot;

[0052] 2) Gait control of humanoid robots using model-dependent reinforcement learning methods based on sparse online Gaussian processes;

[0053] 3) Use the PID controller to improve the action selection method of the reinforcement learning humanoid robot controller, and the improvement operation is to use the PID controller to obtain the optimal initial point for the action selection operation of the reinforcement learning controller.

[0054] In the present invention, reinforcement learning is used for the stability control of the humanoid robot before and after walking. Firstly, the fram...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a humanoid robot gait control method based on model correlated reinforcement learning. The method comprises steps of 1) defining a reinforcement learning framework for a stable control task in forward and backward movements of a humanoid robot; 2) carrying out gait control of the humanoid robot with a model correlated reinforcement learning method based on the sparse online Gaussian process; and 3) improving a motion selection method of a reinforcement learning humanoid robot controller by a PID controller, and taking the improved operation as an optimizing initial point for the PID controller obtaining the motion selection operation of the reinforcement learning controller. The invention utilizes reinforcement learning to control gaits of the humanoid robot in movement, and thus the movement control of the humanoid robot can be automatically adjusted via interaction with the environment, a better control effect is achieved, and the humanoid robot is enabled to be stable in forward and backward directions.

Description

technical field [0001] The invention relates to the field of walking stability control and reinforcement learning of a humanoid robot, in particular to a gait control method of a humanoid robot based on model-related reinforcement learning. Background technique [0002] When controlling the humanoid robot to walk, we usually use the theory of forward and inverse kinematics to obtain the static trajectory of each joint of the humanoid robot, and then use these trajectories to control the humanoid robot to walk. It’s just that the robot joint trajectories obtained in this way can only be used for walking on an ideal flat ground, but cannot walk on uneven ground, because these joint trajectories assume that the environment is a flat ground when planning, and there is no other The interference of factors, and the contact surface between the sole of the foot and the ground on the uneven ground is different from that on the flat ground. Therefore, when the robot walks on an uneve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G05D1/02

CPCG05D1/02

Inventor毕盛陈奇石董敏闵华清

OwnerSOUTH CHINA UNIV OF TECH

Humanoid robot gait control method based on model correlated reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology