Foot type robot motion control method and system based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of robot motion and reinforcement learning, applied in the direction of program control manipulators, manipulators, manufacturing tools, etc., can solve problems such as difficulty in handling high-dimensional objects, complex design process, limited adaptability to model uncertainty and environmental unknowns, etc. Achieve the effect of reducing workload and moving smoothly and quickly

Active Publication Date: 2019-03-19

BEIJING INST OF CONTROL ENG

View PDF5 Cites 20 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0002] Footed robots have the characteristics of multi-joints, strong nonlinearity, and multiple motion modes. At the same time, in the unstructured environment of extraterrestrial bodies, information such as surface materials, stiffness, and terrain are missing or inaccurate, which gives the footed robots motion control. In particular, high-performance motion control that takes into account both fastness and stability poses a huge challenge

The classic model-based motion control method is highly dependent on model accuracy, the design process is complex, it is difficult to deal with high-dimensional objects, and it cannot give full play to the mobility of the robot, especially the adaptability to model uncertainty and environmental unknowns is very limited. Difficulty coping with unknown unstructured environments on the surface of extraterrestrial objects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0020] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art. It should be noted that, in the case of no conflict, the embodiments of the present invention and the features in the embodiments can be combined with each other. The present invention will be described in detail below with reference to the accompanying drawings and examples.

[0021] figure 1 It is a flow chart of the motion control method of a legged robot based on deep reinforcement learning provided by an embodiment of the prese...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a foot type robot motion control method and system based on deep reinforcement learning. The foot type robot motion control method based on the deep reinforcement learning comprises the steps that a foot type robot 3D model is established; reward and punishment functions are designed; a motion network and a target motion network are established, and network initialization is completed; controlled quantity is generated by using the motion network to obtain a state of a robot at a next moment, and a reward and punishment value is calculated; a certain number of samples are selected randomly, a state-motion value of a target evaluation network is calculated, and output of the evaluation network is updated according to a Bellman equation; a weight of the motion networkis updated by using the evaluation network; the target evaluation network and the target motion network are updated by using the evaluation network and the weight of the motion network; the steps arerepeated until network convergence; and a control instruction of the robot motion is obtained according to the motion network. According to the foot type robot motion control method and system based on the deep reinforcement learning, a foot type robot is enabled to realize the high-efficiency and stable movement under unknown environments.

Description

technical field [0001] The invention belongs to the technical field of motion control of legged robots, and in particular relates to a method and system for motion control of legged robots based on deep reinforcement learning. Background technique [0002] Footed robots have the characteristics of multi-joints, strong nonlinearity, and multiple motion modes. At the same time, in the unstructured environment of extraterrestrial bodies, information such as surface materials, stiffness, and terrain are missing or inaccurate, which gives the footed robots motion control. In particular, high-performance motion control that takes into account both fastness and stability poses a huge challenge. The classic model-based motion control method is highly dependent on model accuracy, the design process is complex, it is difficult to deal with high-dimensional objects, and it cannot give full play to the mobility of the robot, especially the adaptability to model uncertainty and environme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16

CPCB25J9/163

Inventor 黄煌邢琰胡勇何英姿魏春岭

Owner BEIJING INST OF CONTROL ENG

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Foot type robot motion control method and system based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology