Feature extraction method and device based on reinforcement learning and computer device

A feature extraction and reinforcement learning technology, applied in the computer field, can solve problems such as low extraction efficiency, and achieve the effect of improving the quality of state features and improving the efficiency of feature extraction.

Active Publication Date: 2020-02-14
TENCENT TECH (SHENZHEN) CO LTD
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Based on this, it is necessary to provide a feature extraction method, device, computer-readable storage medium and computer equipment based on reinforcement learning for the technical problem of low efficiency of feature extraction used in model training in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature extraction method and device based on reinforcement learning and computer device
  • Feature extraction method and device based on reinforcement learning and computer device
  • Feature extraction method and device based on reinforcement learning and computer device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0020] figure 1 It is an application environment diagram of the feature extraction method based on reinforcement learning in one embodiment. refer to figure 1 , the feature extraction method can be applied to the feature extraction system, and the feature extraction system performs feature extraction for different learning objects, and the extracted features can be applied to the insurance business platform for the insurance business platform to calculate whether to provide insurance and insurance prices to users. Wherein, the feature extraction system includes a termina...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a feature extraction method and device based on reinforcement learning, and a computer device. The method comprises the steps of obtaining a feature extraction code of a learning object; wherein the feature extraction code is determined according to manual writing; acquiring state features of the learning object according to the feature extraction code; training a deep network structure based on reinforcement learning by adopting the state features; obtaining an optimal network structure and an optimal weight parameter of the trained deep network structure; generatingan optimal feature extraction strategy according to the optimal network structure and the optimal weight parameter; wherein the optimal feature extraction strategy is used for extracting portrait features of the insurance service user so as to analyze insurance demands of the insurance service user according to the portrait features. By adopting the method, the feature extraction codes are set tobe applied to model training, so that the feature extraction efficiency can be improved, namely, a modeling effect is used as a learning reward to stimulate a computer to continuously optimize a learning strategy so as to learn a new feature extraction mode.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a feature extraction method, device, computer-readable storage medium and computer equipment based on reinforcement learning. Background technique [0002] Reinforcement learning, also known as trial-and-error learning, is a machine learning algorithm that allows the agent to interact continuously in the environment of the learning object and learn according to the feedback incentive (reward) of the environment. The learning algorithm is not based on Any prior knowledge can be learned completely autonomously. According to different learning objects, there can be different agents. For example, when the learning object is insurance business, the agent can be an insured user in the insurance business, etc. [0003] When traditional reinforcement learning such as Deep Q Network (DQN) trains its own neural network model, it completely uses the data obtained by the machine...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00G06N3/08G06Q40/08
CPCG06N20/00G06N3/08G06Q40/08
Inventor 陈尧
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products