Supercharge Your Innovation With Domain-Expert AI Agents!

Multi-agent formation control method based on actor-reviewer reinforcement learning and fuzzy logic

A technology of reinforcement learning and fuzzy logic, applied in the direction of adaptive control, general control system, control/regulation system, etc., can solve problems that are difficult to obtain analytical solutions

Active Publication Date: 2020-11-06
FUZHOU UNIV
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It mainly takes the second-order linear system as the research object. Firstly, the optimal control method is introduced into the leader-follower formation control method of the multi-robot system, and the ability of the fuzzy logic system to approximate the continuous function is used to solve the problem of Hamilton-Jacobi-Bell in optimal control. It is difficult to find an analytical solution to the Mann equation; secondly, combined with the actor-critic reinforcement learning algorithm, an actor fuzzy logic system module and a critic fuzzy logic system module are formed, the former executes the control behavior, and the latter evaluates the behavior selected by the former And feedback the evaluation information to the former; Finally, the Bellman residual error is minimized by the gradient descent method, and the parameter vector update law of the critic fuzzy logic system module and actor fuzzy logic system module is designed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-agent formation control method based on actor-reviewer reinforcement learning and fuzzy logic
  • Multi-agent formation control method based on actor-reviewer reinforcement learning and fuzzy logic
  • Multi-agent formation control method based on actor-reviewer reinforcement learning and fuzzy logic

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] In order to make the features and advantages of this patent more obvious and easy to understand, the following special examples are described in detail as follows:

[0071] Such as figure 1 As shown, the present embodiment provides a second-order linear system optimal formation control algorithm based on actor-critic reinforcement learning algorithm and fuzzy logic system; figure 2 and image 3 As shown, this embodiment uses 4 followers and 1 leader to perform matlab simulation as an example.

[0072] The specific content of this embodiment includes the following points:

[0073] The communication topology between robots is established through graphs, and the position and speed information of their adjacent robots can be obtained between robots;

[0074] The optimal control strategy is introduced, and the cost function and value function are obtained through the calculated formation error;

[0075] The value function is decomposed into the formation error square te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-agent formation control method based on actor-reviewer reinforcement learning and fuzzy logic. An optimal control method is introduced into a pilot follower formation control method of a multi-robot system, and the capability of approaching a continuous function of a fuzzy logic system is utilized to solve the problem that it is difficult to obtain an analytical solution by a Hamilton-Jacobian-Bellman equation in optimal control; meanwhile, an actor-reviewer reinforcement learning algorithm is combined to form an actor fuzzy logic system module and a reviewer fuzzy logic system module, the former executes control behaviors, and the latter evaluates the behaviors selected by the former and feeds evaluation information back to the former. According to the method, the control performance and the resource loss can be balanced, and the adaptability of the multi-robot system to the environment is improved in an online learning mode.

Description

technical field [0001] The invention belongs to the field of robot formation control, and in particular relates to a multi-agent second-order linear system optimal formation control method based on actor-critic reinforcement learning and fuzzy logic. Background technique [0002] In the past ten years, due to its greater redundancy, multi-robot systems have better fault tolerance and robustness than single-robot systems, and can cooperate to complete many tasks that single robots cannot complete. In a multi-robot system, robot formation is one of the control methods for robots to perform tasks cooperatively. As one of the formation control techniques, the leader-follower method can realize the distributed control of multi-robot systems, and has the characteristics of high flexibility and easy use. Introducing the optimal control method into the multi-robot system formation control can achieve the control goal of balancing control performance and resource consumption by mini...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05B13/04
CPCG05B13/042
Inventor 黄捷张子鹏王武蔡逢煌陈宇韬柴琴琴林琼斌张祯毅李卓敏
Owner FUZHOU UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More