Hierarchical reinforcement learning recommendation system based on dynamic recursion mechanism

A recommendation system, dynamic recursion technology, applied in the field of recommendation system of hierarchical reinforcement learning, can solve problems such as affecting the performance of the recommendation system, not being able to modify user portraits well, and uncertainty of expected returns.

Active Publication Date: 2021-04-02
XIAMEN UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, the actions selected under this policy are random, resulting in not always being able to modify user portraits well
On the other hand, state transition probabilities are also stochastic (i.e. each current state has multiple possible next states), which makes the expected payoff from the environment uncertain
Therefore, both kinds of randomness can affect the performance of recommender systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hierarchical reinforcement learning recommendation system based on dynamic recursion mechanism
  • Hierarchical reinforcement learning recommendation system based on dynamic recursion mechanism
  • Hierarchical reinforcement learning recommendation system based on dynamic recursion mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044]In order to further explain the embodiments, the present invention is provided with the drawings. These drawings are part of the disclosure of the present invention, which is mainly used to illustrate the embodiments of the embodiments, and can be combined with the description of the specification. In conjunction with reference to these, those skilled in the art will understand other possible embodiments and the advantages of the present invention. The components in the figures are not drawn to scale, while similar component symbols are often used to represent similar components.

[0045]The invention is further illustrated in conjunction with the accompanying drawings and specific embodiments.

[0046]Such asfigure 1 As shown, the present invention proposes a recommendation system for dynamic baseline and recursion rectification learning (referred to as HRL / DR). Through a new strategy gradient method, the policy iteration is improved. In the hierarchical task in the user portrait...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hierarchical reinforcement learning recommendation system based on a dynamic recursion mechanism. The system comprises a user portrait corrector which employs a strategy gradient method of the dynamic recursion mechanism and introduces a parameter dynamic sparse weight to delete noise data and modify a user portrait, wherein the strategy gradient method of the dynamic recursion mechanism comprises a dynamic baseline and recursion reinforcement learning based on a time sequence context, the dynamic baseline being used for improvement of a learning strategy for total income by adopting a dynamic sparse weight; an attention mechanism used for automatically adjusting changes of user preferences; and a recommendation model used for recommending the most relevant articles to the user through the attention mechanism. According to the recommendation system, a parameter dynamic sparse weight is introduced into the strategy gradient method, so that an intelligent agentselects an optimal behavior under a global optimal strategy; and secondly, in combination with hierarchical reinforcement learning of the time context, the method can more reliably converge, so that the stability of model prediction is improved.

Description

Technical field[0001]The present invention relates to intelligent recommendation techniques based on hierarchical strengthening, and more particularly to a recommended system based on dynamic recursive mechanism.Background technique[0002]While the hierarchical reinforcement learning, HRLs have made significant progress in the recommendation system, their forecast instability is a key defect, mainly due to the performance of strengthening learning. On the one hand, due to the randomness of the strategy, there are several possible random behavior in each state. For example, the action selected under this policy is random, causing the user portrait that cannot be modified well. On the other hand, the state transition probability is also random (ie, there are multiple possible next states per current state), which makes the expected income from the environment uncertain. Therefore, these two random properties will affect the performance of the recommended system.Inventive content[0003]I...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9535G06Q30/06G06N20/00
CPCG06F16/9535G06Q30/0631G06N20/00
Inventor 林元国林凡曾文华夏侯建兵张志宏
Owner XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products