Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voltage regulation method and system based on evolutionary learning and deep reinforcement learning

A voltage regulation and reinforcement learning technology, applied in the field of voltage regulation, can solve the problems of difficult to achieve online control, large amount of calculation, poor communication infrastructure of distribution network, etc., to promote diversity, wide applicability, and strong scalability. Effect

Active Publication Date: 2022-04-12
NANJING UNIV OF POSTS & TELECOMM
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002]With the proposal of China's dual carbon targets and the rapid development of new energy power generation technology, the high proportion of distributed new energy access to the distribution network has brought great benefits to the safe operation of the power grid. Big challenge; the intermittent and random nature of distributed new energy output and the imbalance between power generation and load can easily cause the problem of voltage crossing of feeder nodes. The existing distribution network voltage control has the characteristics of local adjustment and dependence on external equipment. With the exhaustion of external regulation resources, it is gradually unable to meet the voltage regulation requirements of the distribution network under the high proportion of decentralized new energy access. Therefore, how to effectively coordinate the high proportion of new energy active and reactive power output has important practical significance for the grid voltage regulation.
[0003]Currently based on traditional distribution network voltage regulation methods, there are mathematical optimization method, intelligent optimization method, stochastic optimization method and model prediction method, etc. Although these methods can However, these methods generally have problems such as large amount of calculation, easy to fall into local optimum, heavy reliance on forecast data, and difficulty in realizing online control. Moreover, the existing distribution network communication infrastructure is poor, and information interaction is limited to some Observable nodes. At the same time, the traditional voltage regulation method has high requirements on the accuracy of the model, which is difficult to apply to the development trend of new forms of high-proportion new energy generation connected to the distribution network; based on the shortcomings of the above traditional voltage regulation methods and the limitations of artificial intelligence technology Development, the latest research uses data-driven deep reinforcement learning methods, including Deep Deterministic Policy Gradient (DDPG), Multi-Agent Deep Deterministic Policy Gradient (MADDPG), Multi-Agent Constrained Soft Actor-Critic (MACSAC) and other methods, in multiple Remarkable results have been achieved in node coordinated voltage regulation. However, these methods have obvious limitations and cannot be applied to large-scale new energy power generation node coordinated voltage regulation scenarios.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voltage regulation method and system based on evolutionary learning and deep reinforcement learning
  • Voltage regulation method and system based on evolutionary learning and deep reinforcement learning
  • Voltage regulation method and system based on evolutionary learning and deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] Such as figure 1 As shown, a voltage regulation method based on evolutionary learning and deep reinforcement learning provided by an embodiment of the present invention includes:

[0038] S1. Obtain the real-time detected environmental state, input it into the trained policy network, and obtain the voltage regulation policy.

[0039] Use the self-attention mechanism multi-node deep reinforcement learning algorithm to carry out multi-stage progressive multi-node deep reinforcement learning training on the policy network corresponding to each node, and collect historical operation data of the distribution network as sample data for multi-node deep reinforcement learning network training .

[0040] The policy network is established based on the self-attention mechanism, and its function is expressed as:

[0041] P n (x)=h n ([g n (f n (o n )), v n ])

[0042] Among them, o n is the observation of the nth node, f n (o n ) is the observation code of the nth node, g...

Embodiment 2

[0100] An embodiment of the present invention provides a voltage regulation system based on evolutionary learning and deep reinforcement learning, including:

[0101] Voltage regulation strategy acquisition module: obtain the real-time detected environmental status, input it into the trained strategy network, and obtain the voltage regulation strategy;

[0102] Voltage regulation module: According to the voltage regulation strategy, the voltage regulation resources are mobilized to complete the voltage regulation.

[0103] The policy network is trained by:

[0104] Carry out multi-stage progressive multi-node deep reinforcement learning training on the policy network, apply evolutionary learning in each stage of training, and double the number of trained policy networks through crossover between trained policy networks. In the next stage, the trained policy network is mutated until the number of trained policy networks reaches the preset target; each node corresponds to a pol...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voltage regulation method and system based on evolutionary learning and deep reinforcement learning, and belongs to the field of artificial intelligence and control system cross technology, and the method comprises the steps: obtaining an environment state detected in real time, inputting the environment state into a trained strategy network, and obtaining a voltage regulation strategy; voltage regulation resources are called according to the voltage regulation strategy to complete voltage regulation; the strategy network is trained through the following method: multi-stage progressive multi-node deep reinforcement learning training is performed on the strategy network, evolutionary learning is applied in each stage of training, and the number of the trained strategy networks is doubled through intersection between the trained strategy networks. Performing mutation operation on the trained policy networks in the next stage of performing the interlace operation until the number of the trained policy networks reaches a preset target; each node corresponds to one policy network; the method is suitable for multi-node power distribution network collaborative voltage regulation, promotes the diversity of the network training process, and has strong expansibility.

Description

technical field [0001] The invention relates to a voltage regulation method and system based on evolutionary learning and deep reinforcement learning, and belongs to the cross technical field of artificial intelligence and control systems. Background technique [0002] With the proposal of my country's dual carbon goals and the rapid development of new energy power generation technology, the high proportion of distributed new energy access to the distribution network has brought great challenges to the safe operation of the power grid; the intermittent, random and The imbalance between power generation and load can easily cause the problem of voltage crossing of feeder nodes. The existing distribution network voltage control has the characteristics of local adjustment and dependence on external equipment. Therefore, how to effectively coordinate the active and reactive output of high-proportion new energy sources has important practical significance for grid voltage regulation...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08H02J3/46H02J3/48H02J3/50H02J3/12
CPCY02E40/30
Inventor 岳东张廷军窦春霞余亮丁孝华赵景涛郑舒
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products