Feature extraction method and device based on reinforcement learning and computer device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A feature extraction and reinforcement learning technology, applied in the computer field, can solve problems such as low extraction efficiency, and achieve the effect of improving the quality of state features and improving the efficiency of feature extraction.

Active Publication Date: 2020-02-14

TENCENT TECH (SHENZHEN) CO LTD

View PDF8 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Based on this, it is necessary to provide a feature extraction method, device, computer-readable storage medium and computer equipment based on reinforcement learning for the technical problem of low efficiency of feature extraction used in model training in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0020] figure 1 It is an application environment diagram of the feature extraction method based on reinforcement learning in one embodiment. refer to figure 1 , the feature extraction method can be applied to the feature extraction system, and the feature extraction system performs feature extraction for different learning objects, and the extracted features can be applied to the insurance business platform for the insurance business platform to calculate whether to provide insurance and insurance prices to users. Wherein, the feature extraction system includes a termina...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a feature extraction method and device based on reinforcement learning, and a computer device. The method comprises the steps of obtaining a feature extraction code of a learning object; wherein the feature extraction code is determined according to manual writing; acquiring state features of the learning object according to the feature extraction code; training a deep network structure based on reinforcement learning by adopting the state features; obtaining an optimal network structure and an optimal weight parameter of the trained deep network structure; generatingan optimal feature extraction strategy according to the optimal network structure and the optimal weight parameter; wherein the optimal feature extraction strategy is used for extracting portrait features of the insurance service user so as to analyze insurance demands of the insurance service user according to the portrait features. By adopting the method, the feature extraction codes are set tobe applied to model training, so that the feature extraction efficiency can be improved, namely, a modeling effect is used as a learning reward to stimulate a computer to continuously optimize a learning strategy so as to learn a new feature extraction mode.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a feature extraction method, device, computer-readable storage medium and computer equipment based on reinforcement learning. Background technique [0002] Reinforcement learning, also known as trial-and-error learning, is a machine learning algorithm that allows the agent to interact continuously in the environment of the learning object and learn according to the feedback incentive (reward) of the environment. The learning algorithm is not based on Any prior knowledge can be learned completely autonomously. According to different learning objects, there can be different agents. For example, when the learning object is insurance business, the agent can be an insured user in the insurance business, etc. [0003] When traditional reinforcement learning such as Deep Q Network (DQN) trains its own neural network model, it completely uses the data obtained by the machine...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N20/00G06N3/08G06Q40/08

CPCG06N20/00G06N3/08G06Q40/08Y02T10/40

Inventor 陈尧

Owner TENCENT TECH (SHENZHEN) CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Feature extraction method and device based on reinforcement learning and computer device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology