Robot skill learning method and system based on reinforcement learning and unsupervised learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of unsupervised learning and reinforcement learning, which is applied in the field of robot skill learning methods and systems, can solve the problems of deep neural networks such as difficulty, difficulty in expansion, and long training time, so as to improve decision-making accuracy and reduce the need for reward function design. The effect of improving training efficiency

Pending Publication Date: 2021-06-22

SOUTH CHINA UNIV OF TECH

View PDF0 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, using an end-to-end deep neural network is generally difficult to converge and requires more training time, and for different tasks, different reward functions need to be designed. The specially customized network structure and parameters make it only complete specific tasks, which is difficult Expand to other different tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0040] Such as figure 1As shown, this embodiment provides a robot skill learning system based on reinforcement learning and unsupervised learning, which can input a target image, so that the robot actuator can complete the corresponding target task. The system includes an unsupervised learning subsystem, a deep reinforcement learning subsystem, a robot control subsystem and an image acquisition subsystem, and the robot control subsystem further includes a robot controller and an actuator. The unsupervised learning subsystem, the deep reinforcement learning subsystem, the robot controller and the actuator are connected sequentially.

[0041] Among them, the image acquisition subsystem is installed in the task environment where the actuator is located, and is used to collect the real-time state image of the actuator during the execution of the task in this environment. The image acquisition subsystem is connected to the unsupervised learning subsystem and transmits the real-time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a robot skill learning method and system based on reinforcement learning and unsupervised learning, and the method comprises the steps: enabling a robot controller to control an execution mechanism to act when a specific task is executed, and enabling an image collection subsystem to collect a real-time state image of the execution mechanism in a task environment. An encoder of the unsupervised learning subsystem converts the real-time state image into a real-time state code, takes a state image when a set execution mechanism executes and completes a specified task as a target image, and converts the target image into a target code; the deep reinforcement learning subsystem outputs an action value of the execution mechanism at the next moment according to the real-time state code and the target code; and the robot controller controls the execution mechanism to make corresponding actions in real time according to the action value of the execution mechanism until the execution mechanism executes and completes the task. Compared with a common end-to-end network, the system provided by the invention is more universal, can be applied to different targets and tasks, and can also compress the training time.

Description

technical field [0001] The invention relates to the technical field of robot control, in particular to a robot skill learning method and system based on reinforcement learning and unsupervised learning. Background technique [0002] Reinforcement learning is a machine learning algorithm widely used at present. Its biggest feature is that it can learn the mapping from state to action from the environment. Based on the artificially designed reward function, a series of action decisions are based on maximizing Cumulative rewards are the goal. Among them, deep reinforcement learning is based on deep neural network, with state as network input and action as network output value, and the best decision-making network is obtained through continuous training. Deep reinforcement learning can be applied to various scenarios that require decision optimization, such as game AI and robot control. [0003] With the development of robot technology, the application scenarios of robots are ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06N3/045G06F18/217

Inventor 陈艺文占宏杨辰光

Owner SOUTH CHINA UNIV OF TECH

Robot skill learning method and system based on reinforcement learning and unsupervised learning

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

Agents

Company

Robot skill learning method and system based on reinforcement learning and unsupervised learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

Agents

Company

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology