Deep reinforcement learning-based recommendation method and system with negative feedback

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and recommendation methods, applied in neural learning methods, data processing applications, biological neural network models, etc., can solve the problems of slow learning rate and low accuracy

Pending Publication Date: 2020-08-11

HUAZHONG UNIV OF SCI & TECH

View PDF11 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In view of the above defects or improvement needs of the prior art, the present invention provides a recommendation method and system based on deep reinforcement learning with negative feedback, aiming at solving the problems of too slow learning rate and low accuracy of existing user recommendation methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0047] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0048] The present invention provides a method such as figure 1 shown, including:

[0049] S1. The data acquisition based on the deep reinforcement learning recommendation method with negative feedback mainly depends on the behavior information of the users in the e-commerce website when they visit the website and the feature information of the product. Extract and divide user behavior data; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a deep reinforcement learning-based recommendation method and system with negative feedback, and the method comprises the steps: collecting commodity feature information, and collecting user behavior data to obtain positive and negative feedback behavior vectors of a user; performing feature extraction on the positive and negative feedback behavior vectors of the user through a feature extraction network model to obtain a positive and negative feedback feature mixed state vector of the user; training a deep deterministic strategy gradient model composed of a strategy network and an estimation network by using the positive and negative feedback feature mixed state vector of the user until the model converges; and generating a positive and negative feedback feature mixed state vector according to historical behaviors of a user needing work recommendation, and generating a user recommendation commodity list for the user for selection through the trained deep deterministic strategy gradient model to complete the user work recommendation. The parameter updating of the related neural network can be delayed, so that the correlation between the networks is reduced,and the training speed and accuracy of the recommendation method are improved.

Description

technical field [0001] The invention belongs to the technical field of data mining and recommendation, and more specifically relates to a recommendation method and system based on deep reinforcement learning with negative feedback. Background technique [0002] In recent years, with the vigorous development of e-commerce, the technology in the field of user recommendation has also developed rapidly. For example, various technologies based on collaborative filtering, machine learning, and deep learning have been applied to recommendation methods. However, traditional recommendation methods have various problems, such as methods based on collaborative filtering, methods based on traditional machine learning, or methods based on deep learning. They all have the characteristics of static recommendation algorithms, that is, they cannot dynamically perform React, and most methods follow a fixed algorithm similar to greed to make recommendations, which will make them pay too much a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06Q30/02G06N3/04G06N3/08

CPCG06Q30/0255G06Q30/0254G06N3/08G06N3/044G06N3/045

Inventor 李玉华李鑫李瑞轩辜希武

Owner HUAZHONG UNIV OF SCI & TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Deep reinforcement learning-based recommendation method and system with negative feedback

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology