Gait training method and device of quadruped robot based on deep reinforcement learning, electronic equipment and medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A quadruped robot and reinforcement learning technology, applied in the field of gait training methods of quadruped robots, electronic equipment and media, and devices, can solve the problem of large sensor information noise and delay, strong model assumptions, and difficult to meet the complexity of quadruped robots. , high dynamic, multi-scene usage requirements, etc.

Pending Publication Date: 2021-04-02

HANGZHOU WEIMING XINKE TECH CO LTD +1

View PDF9 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the method based on such traditional control theory requires a lot of manual parameter adjustment, and the noise and delay of sensor information from a single source are relatively large, which is likely to cause problems in the control system, and the robustness of the model is difficult to meet the needs of complex environments.

In addition, when the quadruped robot encounters uneven and unstructured terrain, the traditional step control strategy is prone to failure. The fundamental reason is that the model assumptions of such methods are strong, and it is difficult to meet the complex, highly dynamic, and multi-scenario requirements of the actual quadruped robot. Usage requirements

[0003] At present, the research on the gait control of quadruped robots in related technologies is still relatively simple, and most of them focus on the simulator based on the physics engine and the algorithm level of reinforcement learning, lacking a complete starting from the simulator and transferring to the physical robot feasible method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0047] Various exemplary embodiments of the present application will now be described in detail with reference to the accompanying drawings. It should be noted that the relative arrangements of components and steps, numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present application unless specifically stated otherwise.

[0048] At the same time, it should be understood that, for the convenience of description, the sizes of the various parts shown in the drawings are not drawn according to the actual proportional relationship.

[0049] The following description of at least one exemplary embodiment is merely illustrative in nature and not intended as any limitation of the application, its application or uses.

[0050] Techniques, methods and devices known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods and devices should be considered part of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a gait training method and device of a quadruped robot based on deep reinforcement learning, electronic equipment and a medium. According to the application, an environment, the quadruped robot model and a sensor model can be built in the simulator; a measurement signal is acquired through a multi-source information fusion technology and a sensor, and the state of the robotis estimated; a foot end track planning and initial gait control strategy of the quadruped robot is generated by utilizing the environment module, the robot model and the measurement signal; and strategy improvement is performed on the initial gait control strategy of the quadruped robot based on a deep reinforcement learning algorithm until a strategy improvement ending condition is triggered. By applying the technical scheme of the invention, the problems that a large amount of manual parameter adjustment is needed and the labor cost is relatively high due to a model control algorithm in related technologies can be avoided. Parameters obtained through training are often poor in robustness, can only be suitable for a certain fixed scene, and are difficult to meet the requirements of various application tasks of an actual quadruped robot.

Description

technical field [0001] This application relates to computer technology, especially a gait training method, device, electronic equipment and medium for a quadruped robot based on deep reinforcement learning. Background technique [0002] In the field of gait control of quadruped robots, methods such as forward and inverse kinematics and robot dynamics are usually combined with advanced control theories such as model predictive control to realize the gait control of quadruped robots. The entire control system is usually divided into foot trajectory planning Module and motion control module, the planning part realizes the calculation of the trajectory of the foot end, and the control part realizes the control of each joint actuator according to the trajectory. However, the method based on such traditional control theory requires a lot of manual parameter adjustment, and the noise and delay of sensor information from a single source are relatively large, which is likely to cause...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05D1/08

CPCG05D1/0891

Inventor 张文祺李鹏赵锴傅汇乔叶发萍江微杰朱晓王韬

Owner HANGZHOU WEIMING XINKE TECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Gait training method and device of quadruped robot based on deep reinforcement learning, electronic equipment and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology