Diversified recommendation method and system based on reinforcement learning and storage medium
A technology for reinforcement learning and recommendation methods, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as difficulty in achieving global optimality, difficulty in training samples, and difficulty in scoring formulas, and achieve the effect of maximizing long-term benefits
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach
[0053] As a preferred implementation manner, step S1 specifically includes:
[0054] Input the labeled training sample set, which contains supervised samples; determine and initialize the algorithm parameters, including determining the recommendation list length T, exploring the probability decay coefficient ξ, the supervision loss function coefficients λ and τ, and initializing each parameter.
[0055] As a preferred implementation manner, in step S1, the method for obtaining training samples includes:
[0056] Based on LSTM to generate a recommendation list, the process is as follows:
[0057] a) Input a user's interest feature vector and candidate item set, and initialize the LSTM hidden state and decision sequence;
[0058] b) Input user interest vector to LSTM as state;
[0059] c) Process the candidate items one by one, and calculate the selection probability of each item. When the maximum selection probability is less than the exploration probability, the random colle...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


