Method and system for adaptive control of robot motion parameters based on deep reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of robot motion and adaptive control, applied in the direction of comprehensive factory control, program control manipulator, manipulator, etc., to achieve the effect of improving environmental adaptability and robustness, reducing exploration time, and optimizing controller parameters

Active Publication Date: 2022-05-17

THE 21TH RES INST OF CHINA ELECTRONIC TECH GRP CORP

View PDF10 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The purpose of this application is to provide a method and system for adaptive control of robot motion parameters based on deep reinforcement learning to solve or alleviate the problems in the above-mentioned prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0040] The present application will be described in detail below with reference to the accompanying drawings and embodiments. Each example is provided by way of explanation of the application, not limitation of the application. In fact, those skilled in the art will recognize that modifications and variations can be made in the present application without departing from the scope or spirit of the application. For example, features illustrated or described as part of one embodiment can be used on another embodiment to yield a still further embodiment. Accordingly, it is intended that the present application cover such modifications and variations as come within the scope of the appended claims and their equivalents.

[0041] First, it should be noted that in the embodiment of the present application, the robot in the simulation environment refers to the simulation model of the robot, and the simulation model of the quadruped robot is used. The controller for motion control of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present application provides a method and system for adaptive control of robot motion parameters based on deep reinforcement learning. The method includes: constructing an agent in a simulation environment, the agent comprising: a strategy neural network, a value neural network and a task planning module; based on guided reinforcement learning, according to sample parameters, the strategy neural network in the agent is Carry out training; based on hierarchical reinforcement learning, according to multiple subtasks and their corresponding reward functions, the strategy neural network and the value neural network in the agent are alternately carried out strategy promotion and strategy evaluation, and the trained strategy neural network is obtained. Network model; based on the trained strategy neural network model, output control parameter optimization values to the controller according to the target task, so that the controller can control the robot according to the control parameter optimization values .

Description

technical field [0001] The present application relates to the technical field of robot control, in particular to a method and system for adaptive control of robot motion parameters based on deep reinforcement learning. Background technique [0002] Control parameters play an important role in the kinematic performance of quadruped robot systems, while the parameter selection of traditional controllers depends on professional domain knowledge and engineering experience. At present, some control methods based on deep reinforcement learning expect to achieve end-to-end optimization from sensor data to motor control signals, but this technical route has a long training period and difficult convergence. Stability and robustness, if the performance of the training model is not good, it can only be redesigned and trained, which greatly limits the engineering application of deep reinforcement learning technology in robot motion control. [0003] Therefore, it is necessary to provid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): B25J9/16

CPCB25J9/161B25J9/1664Y02P90/02

Inventor 任亮王春雷杨亚邵海存张志鹏马保平彭长武李晓强

Owner THE 21TH RES INST OF CHINA ELECTRONIC TECH GRP CORP

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method and system for adaptive control of robot motion parameters based on deep reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology