Deep Q network reinforcement learning method and device for cognitive behavior model acceleration
Patent Information
- Authority / Receiving Office
- CN Β· China
- Current Assignee / Owner
- NAT UNIV OF DEFENSE TECH
- Publication Date
- 2021-10-26
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present disclosure relates to the technical field of reinforcement learning, in particular to a deep Q-network reinforcement learning method and equipment accelerated by a cognitive behavior model. Background technique
[0002] The problem of sampling efficiency (Sample Efficiency) has always restricted the application of reinforcement learning algorithms in complex problems. In reinforcement learning applications, the agent learns to interact with the environment through trial and error, so a large number of interaction samples are often required to fully explore the state-action space and converge to the optimal strategy. Especially in the face of complex tasks (such as high-dimensional, continuous state space or sparse environment rewards), the problem of low sampling efficiency of reinforcement learning agents is particularly prominent.
[0003] Utilizing appropriate prior knowledge or transferring the learned policy model is an effective mean...