Humanoid robot gait control method based on model correlated reinforcement learning

A humanoid robot and reinforcement learning technology, which is applied in the field of humanoid robot gait control based on model-related reinforcement learning, can solve the problems that the PID controller cannot perfectly meet the system control requirements, the humanoid robot system is complicated, and the space is large

Active Publication Date: 2016-11-09
SOUTH CHINA UNIV OF TECH
View PDF2 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

PID is a linear controller, which requires the environment to be a linear approximation model, but the humanoid robot system is a complex nonlinear model, so the PID controller cannot perfectly meet the control requirements of the system
[0004] In order to better control the walking stability of humanoid robots, the use of reinforcement learning to control humanoid robots has attracted widespread attention, but the application of reinforcement learning to the walking stability control of humanoid robots also faces many problems. The state and control actions of the human robot are continuous, and the space is too large, so the traditional reinforcement learning is not convenient to apply
The experimental cost of humanoid robot is too high, and reinforcement learning needs multiple learning and training to achieve better control effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Humanoid robot gait control method based on model correlated reinforcement learning
  • Humanoid robot gait control method based on model correlated reinforcement learning
  • Humanoid robot gait control method based on model correlated reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention will be further described below in conjunction with specific examples.

[0050] The gait control method of a humanoid robot based on model-related reinforcement learning described in this embodiment comprises the following steps:

[0051] 1) Define a reinforcement learning framework for the stability control task before and after walking of a humanoid robot;

[0052] 2) Gait control of humanoid robots using model-dependent reinforcement learning methods based on sparse online Gaussian processes;

[0053] 3) Use the PID controller to improve the action selection method of the reinforcement learning humanoid robot controller, and the improvement operation is to use the PID controller to obtain the optimal initial point for the action selection operation of the reinforcement learning controller.

[0054] In the present invention, reinforcement learning is used for the stability control of the humanoid robot before and after walking. Firstly, the fram...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a humanoid robot gait control method based on model correlated reinforcement learning. The method comprises steps of 1) defining a reinforcement learning framework for a stable control task in forward and backward movements of a humanoid robot; 2) carrying out gait control of the humanoid robot with a model correlated reinforcement learning method based on the sparse online Gaussian process; and 3) improving a motion selection method of a reinforcement learning humanoid robot controller by a PID controller, and taking the improved operation as an optimizing initial point for the PID controller obtaining the motion selection operation of the reinforcement learning controller. The invention utilizes reinforcement learning to control gaits of the humanoid robot in movement, and thus the movement control of the humanoid robot can be automatically adjusted via interaction with the environment, a better control effect is achieved, and the humanoid robot is enabled to be stable in forward and backward directions.

Description

technical field [0001] The invention relates to the field of walking stability control and reinforcement learning of a humanoid robot, in particular to a gait control method of a humanoid robot based on model-related reinforcement learning. Background technique [0002] When controlling the humanoid robot to walk, we usually use the theory of forward and inverse kinematics to obtain the static trajectory of each joint of the humanoid robot, and then use these trajectories to control the humanoid robot to walk. It’s just that the robot joint trajectories obtained in this way can only be used for walking on an ideal flat ground, but cannot walk on uneven ground, because these joint trajectories assume that the environment is a flat ground when planning, and there is no other The interference of factors, and the contact surface between the sole of the foot and the ground on the uneven ground is different from that on the flat ground. Therefore, when the robot walks on an uneve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/02
Inventor 毕盛陈奇石董敏闵华清
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products