Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Humanoid robot gait planning deep reinforcement learning new method

A humanoid robot and reinforcement learning technology, which is applied in the new field of deep reinforcement learning of humanoid robot gait planning, to achieve the effect of enhancing stability and robustness

Inactive Publication Date: 2020-08-18
CHANGZHOU INST OF TECH
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved in the present invention overcomes the existing defects, provides a new method of deep reinforcement learning for gait planning of humanoid robots, uses the improved DQN algorithm to solve the walking control problem of biped robots, and does not need to establish complex bipeds Based on the robot dynamics model, the reinforcement deep learning method is applied to the biped robot to achieve long-distance stable gait control under fast walking conditions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Humanoid robot gait planning deep reinforcement learning new method
  • Humanoid robot gait planning deep reinforcement learning new method
  • Humanoid robot gait planning deep reinforcement learning new method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0055] Such as figure 1 As shown, a new deep reinforcement learning method for humanoid robot gait planning of the present invention includes establishing a humanoid biped robot model, pre-training control parameters and biped robot walking motion.

[0056] Among them, the humanoid biped robot model uses a simplified 6-DOF linkage model, and the foot adopts a planar plantar structure; the pre-training control parameters use an improved DQN (Deep-Q-Network) network structure The deep reinforcement learning of the controller is trained; the stability of the biped robot's walking motion is mainly guaranteed by the trained controller. First of all, it is necessary to u...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

PropertyMeasurementUnit
Densityaaaaaaaaaa
Densityaaaaaaaaaa
Login to View More

Abstract

The invention relates to an improvement on the robot technology, in particular to a humanoid robot gait planning deep reinforcement learning new method. The problem of walking control of a biped robotis solved through an improvement DQN algorithm. On the basis that there is no need to establish a complex biped robot dynamic model, the deep reinforcement learning method is used for realizing long-distance stable gait control of the biped robot under the condition of walking fast. The method comprises the steps of establishing of a humanoid biped robot model, pre-training control parameters andwalking movement of the biped robot, wherein a simplified six-degree-of-freedom connecting rod model is adopted as the humanoid biped robot model, the feet adopt planar sole structures; through the pre-training control parameters, training of a controller is carried out through the improved deep reinforcement learning method adopting a DQN network structure; and the stability of walking movementof the biped robot is realized through the trained controller.

Description

technical field [0001] The invention relates to an improvement of robot technology, in particular to a new deep reinforcement learning method for gait planning of a humanoid robot. Background technique [0002] The humanoid biped robot has a humanoid structure that can adapt to complex terrain environments such as stairs, streets, and uneven ground, and has the characteristics of flexible movement forms. Therefore, robots with this type of motion can be used in many industries such as medical treatment, rescue, service, and exploration. Human locomotion looks simple, but it is a very complex action involving multiple degrees of freedom. It can be seen as the complex nonlinear dynamics arising from the interrelationship of these degrees of freedom through the extensor and flexor groups of the lower body, which also became the motivation to properly understand the physiology involved in the study of locomotion and replicate it to bipedal robots . Biped walking robots have t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): B25J9/16
CPCB25J9/1612B25J9/163B25J9/1664
Inventor 冯春赵彻李晓贞张祎伟姜文彪武之炜
Owner CHANGZHOU INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products