Foot type robot motion control method and system based on deep reinforcement learning

A technology of robot motion and reinforcement learning, applied in the direction of program control manipulators, manipulators, manufacturing tools, etc., can solve problems such as difficulty in handling high-dimensional objects, complex design process, limited adaptability to model uncertainty and environmental unknowns, etc. Achieve the effect of reducing workload and moving smoothly and quickly

Active Publication Date: 2019-03-19
BEIJING INST OF CONTROL ENG
View PDF5 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Footed robots have the characteristics of multi-joints, strong nonlinearity, and multiple motion modes. At the same time, in the unstructured environment of extraterrestrial bodies, information such as surface materials, stiffness, and terrain are missing or inaccurate, which gives the footed robots motion control. In particular, high-performance motion control that takes into account both fastness and stability poses a hu

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Foot type robot motion control method and system based on deep reinforcement learning
  • Foot type robot motion control method and system based on deep reinforcement learning
  • Foot type robot motion control method and system based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0020] Hereinafter, exemplary embodiments of the present disclosure will be described in more detail with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. On the contrary, these embodiments are provided to enable a more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art. It should be noted that the embodiments of the present invention and the features in the embodiments can be combined with each other if there is no conflict. Hereinafter, the present invention will be described in detail with reference to the drawings and in conjunction with the embodiments.

[0021] figure 1 It is a flowchart of a method for motion control of a footed robot based on deep reinforcement learnin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a foot type robot motion control method and system based on deep reinforcement learning. The foot type robot motion control method based on the deep reinforcement learning comprises the steps that a foot type robot 3D model is established; reward and punishment functions are designed; a motion network and a target motion network are established, and network initialization is completed; controlled quantity is generated by using the motion network to obtain a state of a robot at a next moment, and a reward and punishment value is calculated; a certain number of samples are selected randomly, a state-motion value of a target evaluation network is calculated, and output of the evaluation network is updated according to a Bellman equation; a weight of the motion networkis updated by using the evaluation network; the target evaluation network and the target motion network are updated by using the evaluation network and the weight of the motion network; the steps arerepeated until network convergence; and a control instruction of the robot motion is obtained according to the motion network. According to the foot type robot motion control method and system based on the deep reinforcement learning, a foot type robot is enabled to realize the high-efficiency and stable movement under unknown environments.

Description

technical field [0001] The invention belongs to the technical field of motion control of legged robots, and in particular relates to a method and system for motion control of legged robots based on deep reinforcement learning. Background technique [0002] Footed robots have the characteristics of multi-joints, strong nonlinearity, and multiple motion modes. At the same time, in the unstructured environment of extraterrestrial bodies, information such as surface materials, stiffness, and terrain are missing or inaccurate, which gives the footed robots motion control. In particular, high-performance motion control that takes into account both fastness and stability poses a huge challenge. The classic model-based motion control method is highly dependent on model accuracy, the design process is complex, it is difficult to deal with high-dimensional objects, and it cannot give full play to the mobility of the robot, especially the adaptability to model uncertainty and environme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): B25J9/16
CPCB25J9/163
Inventor 黄煌邢琰胡勇何英姿魏春岭
Owner BEIJING INST OF CONTROL ENG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products