A Hybrid System Energy Management Strategy Based on Inverse Deep Reinforcement Learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and energy management, applied in general control systems, control/regulation systems, instruments, etc., can solve problems such as inability to carry out online applications, and achieve fast calculation speed, good effect, and good real-time performance

Active Publication Date: 2021-09-21

SOUTH CHINA UNIV OF TECH

View PDF10 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the global optimization energy management strategy is only aimed at solving known working conditions and cannot be applied online. Traditional reinforcement learning methods perform well in solving tasks with limited state and action spaces, but they are difficult to solve in terms of state and action space dimensions. When the problem is high, it seems powerless (Chen Xiliang, Cao Lei, He Ming, Li Chenxi, Xu Zhixiong. A review of deep reverse reinforcement learning [J]. Computer Engineering and Application, 2018, 54(05): 24-35.)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0039] Such as figure 1 , figure 2 As shown, a hybrid system energy management strategy based on reverse deep reinforcement learning includes the following steps:

[0040] S1: Use the optimization solution method to calculate the global hybrid mode allocation ratio and the global optimized SOC result under one of the complete working conditions, and form an expert state-action pair as expert knowledge for reverse reinforcement learning; the optimization solution method includes Pseudospectral method, dynamic programming method, genetic algorithm.

[0041] S2: Create a reward function neural network and initialize parameters;

[0042] The reward function neural network is composed of a fully connected neural network, a convolutional neural network, and a long-term short-term memory neural network stacked in the order of a convolutional neural network, a long-term short-term memory neural network, and a fully connected neural network; the fully connected neural network consis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an energy management strategy of a hybrid system based on reverse deep reinforcement learning. The strategy includes: using the optimization solution method to calculate the globally optimized SOC result as expert knowledge; creating a reward neural network; using reverse reinforcement learning to learn expert knowledge to obtain parameters of the reward neural network; creating an action neural network and evaluating the neural network; setting the vehicle SOC value before interaction; input the obtained SOC value before interaction into the reward neural network to obtain the reward value; input the obtained SOC value before interaction into the action neural network to obtain the mode allocation ratio; use the mode allocation ratio to interact with the environment, and obtain SOC value after interaction; input the SOC value before interaction, mode allocation ratio, reward value, and SOC value after interaction into the evaluation neural network to obtain the evaluation value; the agent calculates the gradient of each network and backpropagates to update the network parameters until the training is completed. The invention can learn the optimal reward function from expert knowledge, so that the effect of deep reinforcement learning is better.

Description

technical field [0001] The invention relates to the field of hybrid system energy management, in particular to a hybrid system energy management strategy based on reverse deep reinforcement learning. Background technique [0002] The hybrid electromechanical coupling device couples the power of multiple power sources such as the internal combustion engine and the electric motor in a hybrid electric vehicle (HEV) and performs reasonable power distribution, and transmits the power to the drive axle to drive the vehicle. It can be regarded as an automatic transmission system with electric motors by integrating one or more electric motors into the transmission, which is a complex system integrating mechanics, electricity, chemistry and thermodynamics. China's automobile fuel consumption regulations put forward very high requirements for energy conservation and emission reduction of vehicle manufacturers in the next 10-15 years. With the further increase in the pressure of energ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G05B13/04

CPCG05B13/042

Inventor 李梓棋赵克刚

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A Hybrid System Energy Management Strategy Based on Inverse Deep Reinforcement Learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology