Reinforcement learning awarding method suitable for movable mechanical arm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A mobile robot arm and reinforcement learning technology, applied in the direction of manipulators, program-controlled manipulators, manufacturing tools, etc., can solve a large number of human experience, the manipulator can not adapt to changes in the external environment in time, complex kinematics and other problems

Active Publication Date: 2020-08-11

NANJING UNIV +1

View PDF9 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

As the complexity of the actual application environment continues to increase, the traditional model-based and rule-based control methods will become more and more complex in modeling the environment and solving the kinematics of the manipulator, and require a lot of human experience to design the corresponding rules

[0003] Both design rules and modeling require a lot of energy, and in most scenarios, the manipulator cannot adapt to changes in the external environment in time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0079] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0080] as attached figure 1 Shown: This embodiment provides a reinforcement learning reward method suitable for mobile manipulators. The present invention utilizes an additionally designed reflective reward function so that the mobile manipulator can learn the target strategy faster and complete the task under the training of the reinforcement learning algorithm. corresponding control tasks. The present invention is described in detail by taking a mobile mechanical arm system with 6 degrees of freedom as an example. The end of the mechanical arm is a two-finger gripper that can grip a dropper, and the gripper can move horizontally and vertically; and Attached to the robotic arm is a four-wheeled mobile robot that moves only horizontally in the mission setting. The task scenario designed in this embodiment is to use the mobile robot arm to com...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a reinforcement learning awarding method suitable for a movable mechanical arm. The method comprises the following steps that S1, a task scene, an initialization algorithm, various parameters of the movable mechanical arm and a deep network model corresponding to the algorithm are designed; S2, environment information sent by information collection equipment on the movablemechanical arm is reconstructed and rasterized, and a starting position and a target position of the movable mechanical arm are clear and definite; S3, interaction with the environment, training dataare collected and stored in an experience pond; S4, one batch of data is sampled from the experience pond, and through treatment of a reward function, rethinking rewards which are additionally designed are obtained for subsequent training; S5, original rewards and extra rewards are combined, and a deep reinforcement learning algorithm trains the movable mechanical arm to finish a target task in aplanned space; and S6, related training data and final model parameters obtained through training are recorded, and a corresponding optimal strategy is obtained.

Description

technical field [0001] The invention relates to a reinforcement learning reward method suitable for mobile manipulators. Reinforcement learning is used in traditional manipulator control tasks, and the structure of the reward signal is redesigned to solve the defects of the reward setting in the existing method, and achieve better results. Good control performance. Background technique [0002] In the traditional control field, the control and motion planning of the mobile manipulator usually involves kinematic modeling of the manipulator and solving the end pose and the angle value of each joint. As the complexity of the actual application environment continues to increase, the traditional model-based and rule-based control methods will become more and more complex in modeling the environment and solving the kinematics of the manipulator, and require a lot of human experience to design the corresponding rules. [0003] Both design rules and modeling require a lot of energy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): B25J9/16

CPCB25J9/16B25J9/163B25J9/1628

Inventor 辛博朱冰清程旭陈春林马晶

Owner NANJING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Reinforcement learning awarding method suitable for movable mechanical arm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology