Wireless communication receiving window prediction method, device and wireless communication equipment
A receiving window and wireless communication technology, applied in wireless communication, devices dedicated to receivers, sustainable communication technology, etc., can solve the problems of inability to adapt to changes in the communication environment, high power consumption, etc., to ensure reliable reception and reduce reception power Consumption, to avoid the effect of redundant reception
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0065] The deep reinforcement learning task can be implemented using various existing or future deep reinforcement learning techniques. do
In a preferred embodiment, the deep reinforcement learning technology is used to obtain the current receiving strategy or the optimal receiving strategy, which can be
It is formed by the action a performed by the wireless communication device to receive data according to the state space S, s ∈ S formed by the state information s
Action space A, a ∈ A, state transition rule function P and reward function R, modeled with a predetermined decision process, and perform reinforcement
Learning tasks, generating action policies, and iteratively updating the action policies according to multi-step discounted cumulative rewards until convergence is obtained
Optimal reception strategy or current reception strategy;
[0066] Wherein, the receiving strategy is the probability distribution of the receiving action in the state space.
[0067] The pr...
Embodiment 2
Embodiment 3
[0074] If the reception of the data does not meet the preset reliability requirements, switch to the training mode. The unmet reliability
[0075] The wireless communication receiving window prediction method, device and above-mentioned wireless communication in the above-mentioned embodiment will be further described below.
[0077] The device may be in a utilization mode during normal operation. The protection module receives the information from the link control module
[0079] In addition, in a preferred embodiment, the link control module may not receive information from the protection module during the training process.
[0082] The action space is represented as A={0,1}, wherein the action a=0 indicates that the current time slot performs radio frequency reception off, and a=1
[0083] The state space is represented as a set S={1,2,L,i,L,N}, where state i indicates that the current time slot is the most recent time slot
[0084] The state transition rule function P is show...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


