Random nonlinear multi-agent reinforcement learning optimization formation control method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A multi-agent, reinforcement learning technology, applied in the direction of adaptive control, comprehensive factory control, general control system, etc., can solve complex algorithms, difficult to apply and popularize random multi-agent systems, difficult to optimize design and other problems, to achieve The effect of mitigating complexity

Inactive Publication Date: 2022-07-12

BINZHOU UNIV

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the optimal formation control of stochastic multi-agent systems is seldom reported due to the state coupling required for stochastic multi-agent control, which brings difficulties to the optimal design.

The main reason is that traditional reinforcement learning optimization methods are either complex algorithms or require some strict conditions, such as: continuous incentives, they are difficult to apply and extend to stochastic multi-agent system control

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0047] The technical solutions in the embodiments of the present invention will be clearly and completely described in further detail below with reference to the accompanying drawings in the embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0048] like figure 1 As shown, for the stochastic nonlinear multi-agent system with unknown dynamics, an optimal formation control method of stochastic nonlinear multi-agent based on reinforcement learning is provided. The specific contents are as follows:

[0049] 1. System Modeling and Problem Statement

[0050] Consider the following stochastic nonlinear multi-agent system, whose interconnection and communication topology between agents is an undirected connected graph:

[0051] dx i (t)=(u i +f i (x i ))dt+ψ i (x i )dw,i=1,...,n (1)...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of adaptive non-linear control, particularly discloses a reinforcement learning optimization formation control method for random non-linear multi-agent, and designs an adaptive identifier for estimating unknown random power based on the function approximation capability of a neural network. Then, optimization control is obtained by constructing an evaluation network and executing reinforcement learning through the network; in the invention, the reinforcement learning algorithm is obtained by executing a gradient descent method on a simple positive function, and the function is designed according to the partial derivative of an HJB equation, so that the optimal control is simpler than the traditional method, and can be more conveniently applied to a random nonlinear multi-agent system; and finally, verifying from two aspects of theorem proving and computer simulation, so that the proposed optimization method can realize an expected target.

Description

technical field [0001] The invention relates to the technical field of adaptive nonlinear control, in particular to a reinforcement learning optimization formation control method of random nonlinear multi-agents. Background technique [0002] The control design of stochastic systems is a very challenging problem because its differentiation involves not only stochastic disturbances, but also Hessian terms in stability analysis. With the development of control theory, some popular nonlinear control techniques, such as Sontag's stabilization formula, backstepping technique and adaptive observer, have been extended to stochastic systems. It is worth mentioning that some optimal control methods have also been extended to stochastic nonlinear systems. However, since stochastic multi-agent control requires state coupling, which brings difficulties to the optimal design, the optimal formation control of stochastic multi-agent systems is rarely reported. The main reason is that tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G05B13/02

CPCG05B13/024Y02P90/02

Inventor 文国兴高发亮

Owner BINZHOU UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Random nonlinear multi-agent reinforcement learning optimization formation control method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology