Recommendation method, device, electronic device and storage medium based on action pruning

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A recommendation method and action technology, applied in the computer field, can solve problems such as slow convergence speed of reinforcement learning, achieve the effects of accelerating convergence speed, improving learning efficiency, and improving user experience

Active Publication Date: 2022-02-25

INST OF AUTOMATION CHINESE ACAD OF SCI

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present invention provides a recommendation method, device, electronic equipment and storage medium based on action pruning, which is used to solve the defect of slow convergence speed of reinforcement learning in the prior art, and to improve the convergence speed of reinforcement learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0038] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0039] Traditional reinforcement learning algorithms converge slowly in practical applications and can only be applied to learning tasks dealing with small-scale action spaces. In this regard, the embodiment of the present invention provides a brand-new technology in the field of reinforcement learning, that is, the action pruning technology. Before each decision of the agent, the candidate a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention provides a recommendation method, device, electronic device and storage medium based on action pruning. The status and rating prediction model corresponding to the recommended content predicts the rating of each content to be recommended, and recommends to target users based on the rating of each content to be recommended; the rating prediction model is obtained by reinforcement learning. In the process of reinforcement learning, the rating prediction The model obtains the regret value of each candidate score in the current sample state from the regret value set, and predicts the score based on the candidate score whose regret value is greater than the preset threshold. The regret value set stores the historical state and its corresponding regret value. The regret value is based on The advantages of each candidate score in the historical state are determined. The historical state is the sample state before the current sample state, which speeds up the convergence speed of reinforcement learning and realizes personalized and accurate recommendations for users.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a recommendation method, device, electronic equipment and storage medium based on action pruning. Background technique [0002] Due to the advantages of being able to perceive the dynamic environment and obtain rewards from the environment to continuously adapt to the environment, reinforcement learning is especially suitable for business scenarios involving interactions, such as recommending content to users. [0003] However, in practical applications, the traditional reinforcement learning algorithm only improves the policy based on the immediate reward obtained from the environment, which has the problem of slow convergence speed, and can only be applied to learning tasks dealing with small-scale action spaces. Therefore, how to improve the convergence speed of reinforcement learning is an important issue to be solved urgently in the industry. Contents of the inve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G06F16/9535G06N20/00

Inventor张俊格白栋栋黄凯奇

OwnerINST OF AUTOMATION CHINESE ACAD OF SCI

Recommendation method, device, electronic device and storage medium based on action pruning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology