Humanoid robot motion control method and system based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A robot motion, humanoid robot technology, applied in the field of humanoid robot motion control based on deep reinforcement learning, can solve the problems of slow training, poor anti-interference ability, difficult parameter adjustment, etc., to improve stability and reliability, improve The effect of learning speed and improving training efficiency

Pending Publication Date: 2020-07-03

CENT SOUTH UNIV

View PDF6 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, compared with wheeled or tracked robots, humanoid robots are inherently unstable and require active control to achieve equilibrium due to their limited support area, high center of mass, and limited actuator capabilities

Therefore, the scope of application scenarios of humanoid robots is mainly limited by the balance of humanoid robots and the ability to deal with disturbances and uncertainties.

[0003] Classical control methods propose a variety of motion algorithms, but these algorithms lack versatility and are trained based on simplified models, which have poor anti-interference ability

In recent years, reinforcement learning algorithms have been applied to the motion control of humanoid robots, but there are still problems such as difficulty in parameter adjustment and slow training, and it is difficult to achieve stable and reliable motion control of humanoid robots

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0032] The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.

[0033] Such as figure 1 As shown, the humanoid robot motion control method based on deep reinforcement learning in this embodiment includes: S1. Simulation control: obtain the current state of the humanoid robot, and calculate and determine the humanoid robot according to the current state with a preset deep reinforcement learning model The target angle of each joint; S2.PD control: through the PD controller, the target angle is used as the control target, and the actual angle and joint torque of the joint are used as feedback to determine the control torque of the joint, and control the joint action according to the control torque.

[0034] In this embodiment, a specific humanoid robot model is taken as an example for illustration, such as figure 2 shown, and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a humanoid robot motion control method and system based on deep reinforcement learning. The method comprises the steps that S1, simulation control is carried out, specifically,the current state of a humanoid robot is obtained, and the target angle of each joint of the humanoid robot are calculated and determined through a preset deep reinforcement learning model accordingto the current state; and S2, PD control is carried out, specifically, through a PD controller, the target angle serves as a control target, the actual angle and the joint torque of the joint serve asfeedback, the control torque of the joint is determined, and the joint is controlled to act according to the control torque. The method has the advantages of good control stability, good reliabilityand the like.

Description

technical field [0001] The invention relates to the technical field of motion control of humanoid robots, in particular to a method and system for motion control of humanoid robots based on deep reinforcement learning. Background technique [0002] Humanoid robots have great application potential and can be deployed in environments where the use of wheeled robots is limited, such as terrain with obstacles, narrow and elevated surfaces (such as stairs). However, compared with wheeled or tracked robots, humanoid robots are inherently unstable and require active control to achieve equilibrium due to their limited support area, high center of mass, and limited actuator capabilities. Therefore, the range of application scenarios for humanoid robots is mainly limited by the ability of humanoid robots to maintain balance and cope with disturbances and uncertainties. [0003] Classical control methods propose a variety of motion algorithms, but these algorithms lack versatility, an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16

CPCB25J9/1664B25J9/1633Y02P90/02

Inventor 任炬许人文张尧学

Owner CENT SOUTH UNIV

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Humanoid robot motion control method and system based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology