Networked multi-agent system distributed optimization control method based on reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A multi-agent system and optimal control technology, applied in program control, comprehensive factory control, electrical program control, etc., can solve problems such as complex system model, low system identification accuracy, and high cost

Pending Publication Date: 2020-02-11

LIAONING UNIVERSITY OF PETROLEUM AND CHEMICAL TECHNOLOGY

View PDF4 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0002] In the field of practical application, due to large-scale and complex processing technology and procedures, the system model is complex, the system identification accuracy is low and the cost is high, and the system cannot even be identified

When the system model is completely unknown, the existing fully model-dependent and partially model-dependent optimization control methods cannot be realized in practical applications.

At present, the research on networked multi-agent distributed optimal control based on reinforcement learning is still in its infancy, and there is no complete theoretical system to support the development of related technologies. It is urgent to propose a networked multi-agent system distribution based on reinforcement learning. optimal control method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0033] The present invention adopts following technical scheme:

[0034](1) Solving the optimal control problem based on IRL and Off-policy. In this part, we intend to solve the optimal control problem involving the completely unknown model of the networked multi-agent system for both situations with and without a leader. Based on the previous research results, the IRL technology and the off-policy strategy iterative reinforcement learning method are integrated, and extended to the optimal consensus control of the multi-agent system whose system model is completely unknown. Note that the existing iterative reinforcement learning method based on IRL and off-policy strategy optimizes the control problem, and the object is a single system whose system model is completely unknown, while the agents in the multi-agent system are coupled with each other and have distributed characteristics. , so this extension is not a direct and simple extension, and there are many scientific proble...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a networked multi-agent system distributed optimization control method based on reinforcement learning, which belongs to the technical field of system control methods. A Bellman equation with an integrator is provided for integral reinforcement learning (IRL). According to a non-strategy (Off-poly) method, different control strategy action control systems are applied, dataexploration is added, an optimal control strategy is obtained through iterative learning, a graph game is fused with Off-poly and IRL, and the consistency of networked multi-agents is ensured under the condition that a system model is completely unknown. The system comprises the following specific steps: (1) solving an optimization control problem based on IRL and Off-poly; (2) iteratively designing a distributed approximate optimal control strategy based on an approximate strategy; and (3) adopting simulation software and a multi-manipulator control system physical experiment platform, and correspondingly adjusting a theoretical method and a control technology according to simulation and physical experiment results.

Description

technical field [0001] The invention belongs to the technical field of system control methods, and in particular relates to a distributed optimization control method for a networked multi-agent system based on reinforcement learning, combining Graphical games with Off-policy and Integral Reinforcement Learning (IRL ) fusion, given the system model is completely unknown, to ensure the consistency of the networked multi-agent and optimal performance control optimization problem solution. Background technique [0002] In the field of practical application, due to large-scale and complex processing technology and procedures, the system model is complex, the system identification accuracy is low and the cost is high, and the system cannot even be identified. When the system model is completely unknown, the existing fully model-dependent and partially model-dependent optimization control methods cannot be realized in practical applications. At present, the research on networked m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N3/04G05B19/418

CPCG05B19/41835G05B19/41885G06N3/045Y02P90/02

Inventor 李金娜张一晗

Owner LIAONING UNIVERSITY OF PETROLEUM AND CHEMICAL TECHNOLOGY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Networked multi-agent system distributed optimization control method based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology