Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Robot skill learning method and system based on reinforcement learning and unsupervised learning

A technology of unsupervised learning and reinforcement learning, which is applied in the field of robot skill learning methods and systems, can solve the problems of deep neural networks such as difficulty, difficulty in expansion, and long training time, so as to improve decision-making accuracy and reduce the need for reward function design. The effect of improving training efficiency

Pending Publication Date: 2021-06-22
SOUTH CHINA UNIV OF TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, using an end-to-end deep neural network is generally difficult to converge and requires more training time, and for different tasks, different reward functions need to be designed. The specially customized network structure and parameters make it only complete specific tasks, which is difficult Expand to other different tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robot skill learning method and system based on reinforcement learning and unsupervised learning
  • Robot skill learning method and system based on reinforcement learning and unsupervised learning
  • Robot skill learning method and system based on reinforcement learning and unsupervised learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] Such as figure 1As shown, this embodiment provides a robot skill learning system based on reinforcement learning and unsupervised learning, which can input a target image, so that the robot actuator can complete the corresponding target task. The system includes an unsupervised learning subsystem, a deep reinforcement learning subsystem, a robot control subsystem and an image acquisition subsystem, and the robot control subsystem further includes a robot controller and an actuator. The unsupervised learning subsystem, the deep reinforcement learning subsystem, the robot controller and the actuator are connected sequentially.

[0041] Among them, the image acquisition subsystem is installed in the task environment where the actuator is located, and is used to collect the real-time state image of the actuator during the execution of the task in this environment. The image acquisition subsystem is connected to the unsupervised learning subsystem and transmits the real-time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot skill learning method and system based on reinforcement learning and unsupervised learning, and the method comprises the steps: enabling a robot controller to control an execution mechanism to act when a specific task is executed, and enabling an image collection subsystem to collect a real-time state image of the execution mechanism in a task environment. An encoder of the unsupervised learning subsystem converts the real-time state image into a real-time state code, takes a state image when a set execution mechanism executes and completes a specified task as a target image, and converts the target image into a target code; the deep reinforcement learning subsystem outputs an action value of the execution mechanism at the next moment according to the real-time state code and the target code; and the robot controller controls the execution mechanism to make corresponding actions in real time according to the action value of the execution mechanism until the execution mechanism executes and completes the task. Compared with a common end-to-end network, the system provided by the invention is more universal, can be applied to different targets and tasks, and can also compress the training time.

Description

technical field [0001] The invention relates to the technical field of robot control, in particular to a robot skill learning method and system based on reinforcement learning and unsupervised learning. Background technique [0002] Reinforcement learning is a machine learning algorithm widely used at present. Its biggest feature is that it can learn the mapping from state to action from the environment. Based on the artificially designed reward function, a series of action decisions are based on maximizing Cumulative rewards are the goal. Among them, deep reinforcement learning is based on deep neural network, with state as network input and action as network output value, and the best decision-making network is obtained through continuous training. Deep reinforcement learning can be applied to various scenarios that require decision optimization, such as game AI and robot control. [0003] With the development of robot technology, the application scenarios of robots are ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/217
Inventor 陈艺文占宏杨辰光
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products