Networked multi-agent system distributed optimization control method based on reinforcement learning

A multi-agent system and optimal control technology, applied in program control, comprehensive factory control, electrical program control, etc., can solve problems such as complex system model, low system identification accuracy, and high cost

Pending Publication Date: 2020-02-11
LIAONING UNIVERSITY OF PETROLEUM AND CHEMICAL TECHNOLOGY
View PDF4 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the field of practical application, due to large-scale and complex processing technology and procedures, the system model is complex, the system identification accuracy is low and the cost is high, and the system cannot even be identified
When the system model is completely unknown, the existing fully model-dependent and partially model-dependent optimization control methods cannot be realized in practical applications.
At present, the research on networked multi-agent distributed optimal control based on reinforcement learning is still in its infancy, and there is no complete theoretical system to support the development of related technologies. It is urgent to propose a networked multi-agent system distribution based on reinforcement learning. optimal control method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Networked multi-agent system distributed optimization control method based on reinforcement learning
  • Networked multi-agent system distributed optimization control method based on reinforcement learning
  • Networked multi-agent system distributed optimization control method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] The present invention adopts following technical scheme:

[0034](1) Solving the optimal control problem based on IRL and Off-policy. In this part, we intend to solve the optimal control problem involving the completely unknown model of the networked multi-agent system for both situations with and without a leader. Based on the previous research results, the IRL technology and the off-policy strategy iterative reinforcement learning method are integrated, and extended to the optimal consensus control of the multi-agent system whose system model is completely unknown. Note that the existing iterative reinforcement learning method based on IRL and off-policy strategy optimizes the control problem, and the object is a single system whose system model is completely unknown, while the agents in the multi-agent system are coupled with each other and have distributed characteristics. , so this extension is not a direct and simple extension, and there are many scientific proble...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a networked multi-agent system distributed optimization control method based on reinforcement learning, which belongs to the technical field of system control methods. A Bellman equation with an integrator is provided for integral reinforcement learning (IRL). According to a non-strategy (Off-poly) method, different control strategy action control systems are applied, dataexploration is added, an optimal control strategy is obtained through iterative learning, a graph game is fused with Off-poly and IRL, and the consistency of networked multi-agents is ensured under the condition that a system model is completely unknown. The system comprises the following specific steps: (1) solving an optimization control problem based on IRL and Off-poly; (2) iteratively designing a distributed approximate optimal control strategy based on an approximate strategy; and (3) adopting simulation software and a multi-manipulator control system physical experiment platform, and correspondingly adjusting a theoretical method and a control technology according to simulation and physical experiment results.

Description

technical field [0001] The invention belongs to the technical field of system control methods, and in particular relates to a distributed optimization control method for a networked multi-agent system based on reinforcement learning, combining Graphical games with Off-policy and Integral Reinforcement Learning (IRL ) fusion, given the system model is completely unknown, to ensure the consistency of the networked multi-agent and optimal performance control optimization problem solution. Background technique [0002] In the field of practical application, due to large-scale and complex processing technology and procedures, the system model is complex, the system identification accuracy is low and the cost is high, and the system cannot even be identified. When the system model is completely unknown, the existing fully model-dependent and partially model-dependent optimization control methods cannot be realized in practical applications. At present, the research on networked m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G05B19/418
CPCG05B19/41835G05B19/41885G06N3/045
Inventor 李金娜张一晗
Owner LIAONING UNIVERSITY OF PETROLEUM AND CHEMICAL TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products