Method and system for adaptive control of robot motion parameters based on deep reinforcement learning

A technology of robot motion and adaptive control, applied in the direction of comprehensive factory control, program control manipulator, manipulator, etc., to achieve the effect of improving environmental adaptability and robustness, reducing exploration time, and optimizing controller parameters

Active Publication Date: 2022-05-17
THE 21TH RES INST OF CHINA ELECTRONIC TECH GRP CORP
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of this application is to provide a method and system for adaptive control of robot motion parameters based on deep reinforcement learning to solve or alleviate the problems in the above-mentioned prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for adaptive control of robot motion parameters based on deep reinforcement learning
  • Method and system for adaptive control of robot motion parameters based on deep reinforcement learning
  • Method and system for adaptive control of robot motion parameters based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present application will be described in detail below with reference to the accompanying drawings and embodiments. Each example is provided by way of explanation of the application, not limitation of the application. In fact, those skilled in the art will recognize that modifications and variations can be made in the present application without departing from the scope or spirit of the application. For example, features illustrated or described as part of one embodiment can be used on another embodiment to yield a still further embodiment. Accordingly, it is intended that the present application cover such modifications and variations as come within the scope of the appended claims and their equivalents.

[0041] First, it should be noted that in the embodiment of the present application, the robot in the simulation environment refers to the simulation model of the robot, and the simulation model of the quadruped robot is used. The controller for motion control of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application provides a method and system for adaptive control of robot motion parameters based on deep reinforcement learning. The method includes: constructing an agent in a simulation environment, the agent comprising: a strategy neural network, a value neural network and a task planning module; based on guided reinforcement learning, according to sample parameters, the strategy neural network in the agent is Carry out training; based on hierarchical reinforcement learning, according to multiple subtasks and their corresponding reward functions, the strategy neural network and the value neural network in the agent are alternately carried out strategy promotion and strategy evaluation, and the trained strategy neural network is obtained. Network model; based on the trained strategy neural network model, output control parameter optimization values ​​to the controller according to the target task, so that the controller can control the robot according to the control parameter optimization values .

Description

technical field [0001] The present application relates to the technical field of robot control, in particular to a method and system for adaptive control of robot motion parameters based on deep reinforcement learning. Background technique [0002] Control parameters play an important role in the kinematic performance of quadruped robot systems, while the parameter selection of traditional controllers depends on professional domain knowledge and engineering experience. At present, some control methods based on deep reinforcement learning expect to achieve end-to-end optimization from sensor data to motor control signals, but this technical route has a long training period and difficult convergence. Stability and robustness, if the performance of the training model is not good, it can only be redesigned and trained, which greatly limits the engineering application of deep reinforcement learning technology in robot motion control. [0003] Therefore, it is necessary to provid...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): B25J9/16
CPCB25J9/161B25J9/1664Y02P90/02
Inventor 任亮王春雷杨亚邵海存张志鹏马保平彭长武李晓强
Owner THE 21TH RES INST OF CHINA ELECTRONIC TECH GRP CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products