Random nonlinear multi-agent reinforcement learning optimization formation control method

A multi-agent, reinforcement learning technology, applied in the direction of adaptive control, comprehensive factory control, general control system, etc., can solve complex algorithms, difficult to apply and popularize random multi-agent systems, difficult to optimize design and other problems, to achieve The effect of mitigating complexity

Inactive Publication Date: 2022-07-12
BINZHOU UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the optimal formation control of stochastic multi-agent systems is seldom reported due to the state coupling required for stochastic multi-agent control, which brings difficulties to the optimal design.
The main reason is that traditional reinforcement learning optimization methods are either complex algorithms or require some strict conditions, such as: continuous incentives, they are difficult to apply and extend to stochastic multi-agent system control

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Random nonlinear multi-agent reinforcement learning optimization formation control method
  • Random nonlinear multi-agent reinforcement learning optimization formation control method
  • Random nonlinear multi-agent reinforcement learning optimization formation control method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The technical solutions in the embodiments of the present invention will be clearly and completely described in further detail below with reference to the accompanying drawings in the embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0048] like figure 1 As shown, for the stochastic nonlinear multi-agent system with unknown dynamics, an optimal formation control method of stochastic nonlinear multi-agent based on reinforcement learning is provided. The specific contents are as follows:

[0049] 1. System Modeling and Problem Statement

[0050] Consider the following stochastic nonlinear multi-agent system, whose interconnection and communication topology between agents is an undirected connected graph:

[0051] dx i (t)=(u i +f i (x i ))dt+ψ i (x i )dw,i=1,...,n (1)...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of adaptive non-linear control, particularly discloses a reinforcement learning optimization formation control method for random non-linear multi-agent, and designs an adaptive identifier for estimating unknown random power based on the function approximation capability of a neural network. Then, optimization control is obtained by constructing an evaluation network and executing reinforcement learning through the network; in the invention, the reinforcement learning algorithm is obtained by executing a gradient descent method on a simple positive function, and the function is designed according to the partial derivative of an HJB equation, so that the optimal control is simpler than the traditional method, and can be more conveniently applied to a random nonlinear multi-agent system; and finally, verifying from two aspects of theorem proving and computer simulation, so that the proposed optimization method can realize an expected target.

Description

technical field [0001] The invention relates to the technical field of adaptive nonlinear control, in particular to a reinforcement learning optimization formation control method of random nonlinear multi-agents. Background technique [0002] The control design of stochastic systems is a very challenging problem because its differentiation involves not only stochastic disturbances, but also Hessian terms in stability analysis. With the development of control theory, some popular nonlinear control techniques, such as Sontag's stabilization formula, backstepping technique and adaptive observer, have been extended to stochastic systems. It is worth mentioning that some optimal control methods have also been extended to stochastic nonlinear systems. However, since stochastic multi-agent control requires state coupling, which brings difficulties to the optimal design, the optimal formation control of stochastic multi-agent systems is rarely reported. The main reason is that tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05B13/02
CPCG05B13/024Y02P90/02
Inventor 文国兴高发亮
Owner BINZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products