Information flow recommendation method and device based on deep reinforcement learning, equipment and medium

A technology of reinforcement learning and recommendation methods, applied in the field of information processing, can solve problems such as lack of user interaction, low improvement of user access experience, and disinterest in recommended information, so as to increase the duration of visits and the frequency of user visits, and increase the number of user visits Frequency, the effect of increasing the length of visits

Pending Publication Date: 2020-02-28
CHINA PING AN LIFE INSURANCE CO LTD
View PDF6 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing recommendation system lacks interactivity with users, which may easily lead to users not being inter...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information flow recommendation method and device based on deep reinforcement learning, equipment and medium
  • Information flow recommendation method and device based on deep reinforcement learning, equipment and medium
  • Information flow recommendation method and device based on deep reinforcement learning, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which this application belongs. The terms used herein in the specification of the application are only for the purpose of describing specific embodiments, and are not intended to limit the application.

[0054] It should be noted that the terms "comprising", "comprising" and "having" in the specification and claims of the present application and the above drawings and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed steps or units, but optionally also includes unlisted steps or units, or optionally further includes For other steps or units inherent in these processes, methods, products or devices. In the claims, description and drawings of this application, re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses an information flow recommendation method and a device based on deep reinforcement learning, equipment and a medium, and relates to the technical field of information processing. The method comprises the steps of collecting a historical click sequence of a target user; calling a preset actor neural network and a preset critic neural network; generating a user recommendation list, and displaying the user recommendation list to the target user to obtain feedback result data and a new historical click sequence generated after feedback; calculating a timedifference error; updating parameters in the critic neural network and the actor neural network; and generating a new user recommendation list, and displaying the new user recommendation list until feedback result data of the target user for the new user recommendation list and a new historical click sequence generated after feedback cannot be obtained. According to the method, the interactivity between the recommendation system and the user is enhanced, the feedback of the user is utilized in real time, the recommendation engine can be continuously optimized, the recommendation quality is improved, the user experience is improved, and the user is effectively attracted to remain.

Description

technical field [0001] The embodiments of the present application relate to the field of information processing technology, in particular, a method, device, device and medium for recommending information flow based on deep reinforcement learning. Background technique [0002] With the development of artificial intelligence, more and more product applications use artificial intelligence to improve the interactive experience between users and products, such as recommending products of interest to users based on their interest characteristics and purchasing behavior. With the continuous expansion of the scale of e-commerce and the rapid growth of the number and types of commodities, customers need to spend a lot of time to find the commodities they want to buy. This process of browsing a large amount of irrelevant information and products will lead to the continuous loss of consumers who are drowning in the problem of information overload. In order to solve these problems, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/9535G06F16/9538G06N3/04G06N3/08
CPCG06F16/9535G06F16/9538G06N3/08G06N3/045
Inventor 罗振煜
Owner CHINA PING AN LIFE INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products