Supercharge Your Innovation With Domain-Expert AI Agents!

Self-reconfiguration planning method for heterogeneous modular robots based on reinforcement learning algorithm

A reinforcement learning and robotics technology, applied in instruments, motor vehicles, two-dimensional position/channel control, etc., can solve problems such as low efficiency and many module movements, and achieve the effect of improving self-reconfiguration efficiency

Active Publication Date: 2022-06-07
NORTHWESTERN POLYTECHNICAL UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For robots with complex configurations and many modules, in order to ensure the solvability from the initial configuration to the target configuration, the existing algorithm will introduce an intermediate configuration as a transition, which will lead to too many modules moving and inefficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-reconfiguration planning method for heterogeneous modular robots based on reinforcement learning algorithm
  • Self-reconfiguration planning method for heterogeneous modular robots based on reinforcement learning algorithm
  • Self-reconfiguration planning method for heterogeneous modular robots based on reinforcement learning algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will now be further described in conjunction with the embodiments and accompanying drawings:

[0046] The purpose of the present invention is to use the reinforcement learning algorithm to realize the module disassembly sequence of the self-reconfigurable modular robot from any initial configuration to the specified target configuration for the modular robot composed of n (n>100) modules, Minimize assembly times and improve self-reconfiguration efficiency.

[0047] The objects of algorithm application are mainly robots composed of modules with the same specifications, and each module has at least two faces that can be used for docking.

[0048] In order to achieve the above object, the technical solution adopted in the present invention comprises the following steps:

[0049] Step 1: Given an initial modular robot configuration and a target configuration with N modules, convert the two configurations into an N×N×N matrix respectively, and initiali...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a self-reconfiguration planning method for heterogeneous modular robots based on a reinforcement learning algorithm. First, an initial modular robot configuration and a target configuration are given, the total number of modules N is input, and the graph structure of the modules is established through an initialization process. ;Take the initial configuration as the root node, establish a Monte Carlo tree search, and stop the search when the termination condition is reached (the target configuration is found or n times of exploration have been performed). After each search, the planned path is given and the samples are saved; when the number of samples reaches a given value, the samples are input into the neural network for training, and the training parameters are updated; after the parameters are updated, the Monte Carlo search is performed again. The average number of steps in this search result should be less, and after each search is completed, update the planned path with the least number of steps.

Description

technical field [0001] The invention belongs to the field of artificial intelligence planning, and in particular relates to applying a reinforcement learning algorithm to optimize the planning of a modular robot in the process of autonomous deformation. Background technique [0002] With the wide application of robots in various fields, more and more robots are used in unstructured environments to complete operations. For unknown working environments and different work tasks, robots need to be able to adapt to the environment by changing their configuration. This kind of robot that can dynamically and autonomously change its configuration to meet the needs of the task is called a self-reconfigurable modular robot. The self-reconfigurable modular robot is composed of a series of unit modules with simple structure and different functions. In order to achieve better taskability and environmental adaptability, when the multi-module system is running, each module unit independen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/02
CPCG05D1/0221
Inventor 张夷斋王文卉黄攀峰孟中杰常海涛
Owner NORTHWESTERN POLYTECHNICAL UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More