Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Power grid transmission line defense method based on multi-agent deep reinforcement learning

A power transmission line and reinforcement learning technology, applied in neural learning methods, based on specific mathematical models, reasoning methods, etc., can solve the problem of increased action space, deep reinforcement learning methods in action space are not scalable, and existing methods are no longer applicable And other issues

Active Publication Date: 2021-03-12
NANJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In fact, research results based on reinforcement learning methods show that given the same attack resources, multi-stage attacks can cause greater losses than single-stage attacks
However, this method only considers small-scale grid
When the scale of the power grid gradually increases, the state space will increase, and the action space will also increase sharply, which will make the existing methods no longer applicable (such as unable to converge)
Although the challenge of large state spaces can be overcome with deep reinforcement learning methods, the exponential growth of action space with the number of attack stages makes single-agent deep reinforcement learning methods not scalable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Power grid transmission line defense method based on multi-agent deep reinforcement learning
  • Power grid transmission line defense method based on multi-agent deep reinforcement learning
  • Power grid transmission line defense method based on multi-agent deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0077] Such as figure 1 As shown, the embodiment of the present invention provides a method for resisting attacks on power grid transmission lines in the present invention, and the method includes the following steps:

[0078] Based on the environment state, the attack behavior of the agent and the reward of the agent, the deep neural network of each agent is trained using the preset training method;

[0079] According to the trained deep neural network and the state of the environment, each agent independently determines the attack behavior, and the optimal attack line set of each agent is obtained after the attack resources are used up;

[0080] The above steps are repeated several times, and the optimal defense line set is obtained according to the optimal attack line set of each agent for key defense. In the case of the same attack resources, the power grid transmission line defense method proposed by the present invention can effectively reduce the losses caused by multi...

Embodiment 2

[0129] This embodiment is a specific application scenario based on a multi-agent deep reinforcement learning-based power grid transmission line defense method provided in Embodiment 1. The following embodiments are only used to illustrate the technical solution of the present invention more clearly, and not to This limits the protection scope of the present invention.

[0130] In this embodiment, the IEEE 118 bus is used as the simulation of the grid environment, and the grid includes 186 transmission lines in total.

[0131] Step 1: Model the multi-stage optimal attack problem of power grid transmission lines as a Markov game, and design the corresponding environmental states, behaviors and rewards.

[0132] Specifically, a Markov game is a multi-agent extension of the Markov decision process, and a Markov game can be defined by a series of states, behaviors, state transition functions, and rewards. The objective function of the multi-stage optimal attack problem on power gr...

Embodiment 3

[0189] The embodiments of the present invention provide that the present invention provides a power grid transmission line defense system based on multi-agent deep reinforcement learning, including:

[0190] Training module: used to train the deep neural network of each agent based on the environment state, the attack behavior of the agent and the reward of the agent, using the preset training method;

[0191] Test module: used to determine the attack behavior of each agent independently according to the deep neural network and environment state obtained through training, and obtain the optimal attack line set of each agent after the attack resources are exhausted;

[0192] Defense module: used to obtain the optimal defense line set according to the optimal attack line set of each agent for key defense.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a power grid transmission line defense method based on multi-agent deep reinforcement learning, and the method comprises the steps: training a deep neural network of each agentthrough employing a preset training method based on the environment state, the attack behaviors of the agents and the rewards of the agents; according to the deep neural network and the environment state obtained through training, enabling each agent to autonomously determine an attack behavior, and obtaining an optimal attack line set of each agent after attack resources are used up; repeating the above steps for many times, and obtaining an optimal defense line set according to the optimal attack line set of each agent to perform key defense. According to the invention, the performance lossof the power grid caused by multi-stage cooperative transmission line attack can be effectively reduced.

Description

technical field [0001] The invention relates to a power grid transmission line defense method based on multi-agent deep reinforcement learning, and belongs to the cross technical field of smart grid security and artificial intelligence. Background technique [0002] The large-scale interconnection of power grids realizes the optimization of large-scale resource sharing. While achieving significant economic benefits, the highly coupled characteristics of information and physical systems in the power grid have brought new challenges to the safe and stable operation of the power grid. When a cyber or physical attack is launched, it is possible that the transmission line will be disconnected, which will trigger a series of chain reactions and cause large-scale power interruptions in the power grid, such as the 2012 blackout in India. Therefore, it is of great significance to identify and defend these transmission lines that may trigger a chain reaction and lead to grid disassem...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06G06N3/04G06N3/08G06N5/04G06N7/00
CPCH04L63/1416H04L63/1441G06N3/08G06N5/042G06N7/01G06N3/045
Inventor 高镇余亮沈超岳东窦春霞刘爱萍
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products