Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Recommendation system based on meta-learning and reinforcement learning

A technology of reinforcement learning and recommendation systems, applied in the field of recommendation systems based on meta-learning and reinforcement learning, to achieve the effects of multi-parameter adjustable, good portability and strong scalability

Active Publication Date: 2020-05-26
BEIHANG UNIV
View PDF5 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the other hand, with the rapid development of meta-learning in recent years, what it needs to solve is how to achieve good results when learning in the face of few samples. Its combination with reinforcement learning has also achieved good results on some issues. effect, so we use reinforcement learning and meta-learning methods to solve the problem of recommendation learning to quickly learn user preferences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Recommendation system based on meta-learning and reinforcement learning
  • Recommendation system based on meta-learning and reinforcement learning
  • Recommendation system based on meta-learning and reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The following is a preferred embodiment of the present invention and the technical solutions of the present invention are further described in conjunction with the accompanying drawings, but the present invention is not limited to this embodiment.

[0036] Based on the method of meta-learning and reinforcement learning, the present invention learns the user's basic information and tour records or purchase records, extracts the hidden rules, recommends the user, and compares the user's reaction to the recommended content (like or dislike) As model feedback, the strategy learned by the model is adjusted to give new recommended content for the user.

[0037] The overall process of the model is as follows figure 2 As shown, firstly, the model generates a recommended strategy for a user’s characteristic data input, and then calculates the error according to this strategy, and then optimizes the parameters of the model through the optimization method mentioned in the interna...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Through methods in the fields of meta-learning, reinforcement learning and data mining, a recommendation system based on meta-learning and reinforcement learning is realized, an internal updating module and a meta-updating module model are defined and constructed, and two modules form a system model; the system model training process is as follows: a system model is trained, a recommendation strategy is generated for characteristic data input of primary user; an error is solved according to the strategy; by an optimization method mentioned in an internal updating process, optimizing parameters of a model through an optimization method mentioned in the internal updating process to obtain a next strategy, a final strategy is acquired according to a set internal updating step number, finallythe error is generated by inputting feedback of a user to recommended contents, then an initial model is derived and a new model is acquired by updating. After the model training is completed, the system receives the feature data ofa user, recommends push content to the user, and collects feedback of the user to the content.

Description

technical field [0001] The invention relates to the fields of meta-learning, reinforcement learning and data mining, in particular to a recommendation system based on meta-learning and reinforcement learning. Background technique [0002] At present, the recommendation system is almost everywhere. Many apps use the recommendation system, such as travel, shopping, video, news, social networking, etc. We can all see the shadow of the recommendation system, which is closely related to our daily life. The reason is that for users, facing massive amounts of data, they always hope to quickly find information that is interesting or valuable to them; for information producers, they always hope that their content can attract more customers, but different customers have different preferences, so for different users, there should be different recommended content. Although the recommendation system has many benefits for both enterprises and users, if the performance of the recommendati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06Q30/06G06N20/00
CPCG06Q30/0631G06N20/00Y02T10/40
Inventor 李建欣张帅朱琪山杨继远周号益
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products