Gait planning method for biped robot based on deep reinforcement learning

A biped robot and reinforcement learning technology, applied in the gait planning field of biped robots based on deep reinforcement learning, can solve problems such as weak anti-interference ability and stiff gait, and achieve the elimination of geometric differences and noise, smooth walking, high The effect of control requirements

Active Publication Date: 2018-12-11
YANSHAN UNIV
View PDF7 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The invention utilizes the characteristics that the structure and walking process of the knee-legged robot model are similar to the human body model, and combines the deep reinforcement learning method driven by big data to solve the weak anti-interference ability and conventional intelligent gait of the model-based gait planning method. Improve the stability and compliance of the robot when walking, such as stiff gait in the planning method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gait planning method for biped robot based on deep reinforcement learning
  • Gait planning method for biped robot based on deep reinforcement learning
  • Gait planning method for biped robot based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments.

[0049] figure 1 is a schematic diagram of the planning method of the present invention; as figure 1 Shown, a kind of biped robot gait planning method based on deep reinforcement learning of the present invention comprises:

[0050] Step S1: Establish a biped robot model and describe the walking process of the robot; Step S1 specifically includes the following steps:

[0051] Step S101: Establish a 4-link robot model with knee arc feet;

[0052] Step S102: Analyze the walking process of the model from the perspective of the right side of the robot...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a gait planning method for a biped robot based on deep reinforcement learning, and the method utilizes the stability and flexibility of human gait and combines deep reinforcement learning to effectively control the gait of the biped robot. The method includes the following steps: 1), establishing a passive biped robot model; 2), acquiring and processing human gait data andtarget gait data; 3), extracting the implicit features of the gait data of the biped robot and human gait data through using a noise reduction auto-encoder; 4), learning the gait characteristics of ahuman body through using deep reinforcement learning, and then planning the gait of the biped robot. The method combines deep reinforcement learning with human gait data to control the biped robot towalk as stable and compliant as a human.

Description

technical field [0001] The invention relates to the technical field of biped robots, in particular to a gait planning method for biped robots based on deep reinforcement learning. Background technique [0002] At present, the moving modes of mobile robots include crawler, wheel, biped and so on. Compared with tracked and wheeled robots, biped robots have stronger adaptability, and can move on both flat ground and irregular environments (up and down steps, walking on uneven ground, etc.). However, the biped robot itself is a highly nonlinear hybrid dynamic system, and its gait planning has always been a difficult problem. [0003] In addition to maintaining the stability of walking, the gait planning of biped robots must also consider issues such as energy efficiency, compliance, and environmental adaptability of walking movements. Gait planning methods based on simplified models are commonly used in biped robot gait planning. The method based on the simplified model start...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/08
CPCG05D1/0891
Inventor 吴晓光刘绍维杨磊张天赐李艳会王挺进
Owner YANSHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products