Cluster electric vehicle charging behavior optimization method based on deep reinforcement learning

An electric vehicle and reinforcement learning technology, applied in electric vehicle charging technology, electric vehicles, charging stations, etc., can solve problems such as discrete action space, poor stability, and difficulty in training convergence

Active Publication Date: 2020-11-13
NORTH CHINA ELECTRIC POWER UNIV (BAODING)
View PDF4 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with the traditional optimization control method, TD3 has obvious advantages in speed and flexibility, and can effectively over

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cluster electric vehicle charging behavior optimization method based on deep reinforcement learning
  • Cluster electric vehicle charging behavior optimization method based on deep reinforcement learning
  • Cluster electric vehicle charging behavior optimization method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0077]The present invention provides a charging behavior optimization method for cluster electric vehicles based on deep reinforcement learning, the optimization method is a charging optimization management method for cluster electric vehicles based on deep reinforcement learning; the optimization method is based on a double-delay deep deterministic strategy The gradient (twin delay DDPG, TD3) algorithm realizes the modeling of the continuously adjustable charging process of electric vehicles, trains the agent to control the charging power, optimizes the charging behavior of electric vehicles, and transfers the load when the time-of-use electricity price is high to when the electricity price is low Transfer to achieve the purpose of reducing user charging expenses and smoothing the peak load of the power grid; the charging process of a single electric vehicle is modeled through the twin delay deep deterministic policy gradient algorithm (twin delay deep deterministic policy grad...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cluster electric vehicle charging behavior optimization method based on deep reinforcement learning, and belongs to the technical field of new energy vehicle optimization management. The invention relates to a deterministic strategy gradient algorithm based on double delay depths. Modeling is carried out on the power continuous adjustable charging process of the electricvehicle, an intelligent agent is trained to control the charging power, the charging behavior of the electric vehicle is optimized, the load when the time-of-use electricity price is high is transferred to the load when the electricity price is low, and the purposes of reducing the user charging expenditure and stabilizing the peak-time load of a power grid are achieved. Compared with a traditional optimization control method, the TD3 has obvious advantages in speed and flexibility, and the problems that an existing reinforcement learning method is discrete in action space, difficult in training convergence and poor in stability can be effectively solved. In order to enhance the generalization ability of the intelligent agent, noise is added on the basis of original state observation, a group of electric vehicles with different initial SOCs and different arrival and departure time are simulated, and the method is expanded to cluster electric vehicle charging behavior control.

Description

technical field [0001] The invention belongs to the field of power system optimization scheduling, in particular to a method for optimizing charging behavior of cluster electric vehicles based on deep reinforcement learning. Background technique [0002] China attaches great importance to the development of the new energy vehicle industry; it is estimated that by 2025, the sales volume of new energy vehicles will account for about 25%. Reached about 7 million. The high power and space-time uncertainty of electric vehicles will change the existing load level of the power grid, further increase the peak-valley difference, and have an impact on the safety and stability of the power grid. [0003] As an important means of demand-side management, peak-valley time-of-use electricity pricing plays an important role in guiding and regulating electricity consumption behavior and assisting power grids in peak-shaving and valley-filling. The load aggregator (aggregator) can respond t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H02J3/32G06Q10/04G06Q50/06B60L53/64
CPCH02J3/322G06Q10/04G06Q50/06B60L53/64H02J2203/20H02J2310/48Y02T10/70Y02T10/7072Y02T90/12
Inventor 胡俊杰赵星宇
Owner NORTH CHINA ELECTRIC POWER UNIV (BAODING)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products