A Reinforcement Learning-Based Intelligent Scheduling Method for Defective Materials in Power System

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and power system, which is applied in the field of intelligent scheduling of defective materials in power system based on reinforcement learning, can solve the problems of large consumption of computing resources, slow response, and represent the global information of the system, and achieve the effect of good convergence and gain.

Active Publication Date: 2022-07-22

GUIZHOU POWER GRID CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] For the statistical optimization method, this method needs to have complete statistics for all demand distributions in the region. At the same time, every time a state transition or emergency event occurs, the optimal distribution needs to be recalculated, which consumes a lot of computing resources and responds slowly. Limitations: For the data prediction method, the traditional feature selection is usually based on the feature ranking method, according to the importance and correlation of each feature calculated, and take the top k features as the input of demand forecasting. The biggest disadvantage of this method is that Selecting k features with the greatest importance and correlation cannot well represent the global information of the system, so it cannot provide the most abundant information for the prediction system; at the same time, because the predicted result is not the final result, according to the predicted result we will Perform secondary calculations to obtain scheduling and control schemes. The multi-step framework will cause errors to accumulate and lead to deviations in the final results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0043] refer to figure 1 , figure 2 and image 3 , which is the first embodiment of the present invention, provides a reinforcement learning-based intelligent scheduling method for defective materials in a power system, including:

[0044] S1: Define the state, decision-making, transition equation, reward function and requirements and goals in the dynamic scheduling problem of material warehousing in reinforcement learning. It should be noted that,

[0045] Define the state, decision-making, transition equation, reward function, and requirements and objectives in the dynamic scheduling problem of material storage in the reinforcement learning algorithm for power system defect material scheduling;

[0046] The state is the warehousing state and the material defect state at time t, the decision is the scheduling method and the purchasing method at this time, and the transfer equation is the change equation before and after;

[0047] Define the state of the current moment, t...

Embodiment 2

[0073] refer to Figure 4 and Figure 5 , which is the second embodiment of the present invention, which differs from the first embodiment in that it provides an authenticity verification of a method for intelligent scheduling of defective materials in a power system based on reinforcement learning, including:

[0074] In order to better verify and explain the technical effect adopted in the method of the present invention, in this embodiment, the traditional intelligent scheduling method based on greedy algorithm and the method of the present invention are selected to carry out a comparative test, and the test results are compared by means of scientific demonstration to verify the present invention. The real effect of the inventive method.

[0075] The traditional intelligent scheduling method based on greedy algorithm has low convergence and gain. In order to verify that the method of the present invention has higher gain and convergence compared with the traditional method...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an intelligent scheduling method for defective materials in a power system based on reinforcement learning. The decision-making process solves the material storage dynamic scheduling problem; lists the Bellman equations for grid-defective materials and selects a solution strategy; modifies the Bellman equation into a data-driven online update form, and determines scheduling actions based on the ε greedy strategy. The invention proposes to solve the joint control and scheduling problem of emergency materials in power system based on Markov stochastic process and reinforcement learning, and the "end-to-end" algorithm does not predict demand, but directly makes inventory control and scheduling decisions; It has been verified, with good convergence and gain, which proves the usability and practical value of the method.

Description

technical field [0001] The invention relates to the technical field of power grid and artificial intelligence scheduling, in particular to a method for intelligent scheduling of defective materials in a power system based on reinforcement learning. Background technique [0002] Statistical optimization method: Model the distribution of various urgent needs according to statistical laws, and calculate the statistically average optimal warehouse distribution through centralized mathematical modeling. [0003] Data prediction method: Based on the idea of data analysis and mining in each area, using artificial intelligence and machine learning methods, build a time series model (sequence-to-sequence model) for the different needs of each area, so as to carry out time series analysis. Prediction; then, on the basis of prediction, centralized layout and optimization of warehousing system and scheduling. [0004] For the statistical optimization method, this method needs to have...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06Q10/06G06Q50/06

CPCG06Q10/06312G06Q50/06

Inventor 俞虹唐诚旋蒋群群陈珏伊张秀程文美代洲徐一蝶

Owner GUIZHOU POWER GRID CO LTD

A Reinforcement Learning-Based Intelligent Scheduling Method for Defective Materials in Power System

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology