Strategy collaborative selection method based on deep reinforcement learning DDPG algorithm framework
Patent Information
- Authority / Receiving Office
- CN Β· China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- UNIV OF ELECTRONICS SCI & TECH OF CHINA
- Publication Date
- 2021-06-04
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the technical field of reinforcement learning, in particular to a method for strategic cooperation selection based on a deep reinforcement learning DDPG algorithm framework. Background technique
[0002] The problem discussed in reinforcement learning is how an agent finds a strategy to maximize the reward it can obtain in a complex and uncertain environment. Lillicrap et al. proposed the DDPG (deep deterministic policy gradient) algorithm in 2015, which is a deep reinforcement learning algorithm on the actor-critic framework (Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez , T., Tassa, Y., Silver, D., & Wierstra, D. (2015). Continuous control with deep reinforcement learning.). DDPG is the first reinforcement learning algorithm to efficiently solve many high-dimensional continuous control tasks. It is also a deterministic policy gradient algorithm based on actor-critic architecture. Contains actor current ...