Self-reconfiguration planning method for heterogeneous modular robots based on reinforcement learning algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A reinforcement learning and robotics technology, applied in instruments, motor vehicles, two-dimensional position/channel control, etc., can solve problems such as low efficiency and many module movements, and achieve the effect of improving self-reconfiguration efficiency

Active Publication Date: 2022-06-07

NORTHWESTERN POLYTECHNICAL UNIV

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For robots with complex configurations and many modules, in order to ensure the solvability from the initial configuration to the target configuration, the existing algorithm will introduce an intermediate configuration as a transition, which will lead to too many modules moving and inefficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] The present invention will now be further described in conjunction with the embodiments and accompanying drawings:

[0046] The purpose of the present invention is to use the reinforcement learning algorithm to realize the module disassembly sequence of the self-reconfigurable modular robot from any initial configuration to the specified target configuration for the modular robot composed of n (n>100) modules, Minimize assembly times and improve self-reconfiguration efficiency.

[0047] The objects of algorithm application are mainly robots composed of modules with the same specifications, and each module has at least two faces that can be used for docking.

[0048] In order to achieve the above object, the technical solution adopted in the present invention comprises the following steps:

[0049] Step 1: Given an initial modular robot configuration and a target configuration with N modules, convert the two configurations into an N×N×N matrix respectively, and initiali...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a self-reconfiguration planning method for heterogeneous modular robots based on a reinforcement learning algorithm. First, an initial modular robot configuration and a target configuration are given, the total number of modules N is input, and the graph structure of the modules is established through an initialization process. ;Take the initial configuration as the root node, establish a Monte Carlo tree search, and stop the search when the termination condition is reached (the target configuration is found or n times of exploration have been performed). After each search, the planned path is given and the samples are saved; when the number of samples reaches a given value, the samples are input into the neural network for training, and the training parameters are updated; after the parameters are updated, the Monte Carlo search is performed again. The average number of steps in this search result should be less, and after each search is completed, update the planned path with the least number of steps.

Description

technical field [0001] The invention belongs to the field of artificial intelligence planning, and in particular relates to applying a reinforcement learning algorithm to optimize the planning of a modular robot in the process of autonomous deformation. Background technique [0002] With the wide application of robots in various fields, more and more robots are used in unstructured environments to complete operations. For unknown working environments and different work tasks, robots need to be able to adapt to the environment by changing their configuration. This kind of robot that can dynamically and autonomously change its configuration to meet the needs of the task is called a self-reconfigurable modular robot. The self-reconfigurable modular robot is composed of a series of unit modules with simple structure and different functions. In order to achieve better taskability and environmental adaptability, when the multi-module system is running, each module unit independen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G05D1/02

CPCG05D1/0221

Inventor 张夷斋王文卉黄攀峰孟中杰常海涛

Owner NORTHWESTERN POLYTECHNICAL UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Self-reconfiguration planning method for heterogeneous modular robots based on reinforcement learning algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology