C4.5 decision tree algorithm-based specific user mining system and method thereof

A specific user and decision tree technology, applied in marketing, market data collection, etc., can solve the problems of large amount of data, difficulty in comprehensive statistics of user interest related indicators, and multiple data dimensions, so as to achieve fast calculation speed and shorten calculation cycle Effect

Inactive Publication Date: 2016-12-14
WUHAN DOUYU NETWORK TECH CO LTD
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the above-mentioned manual screening methods often have a large degree of subjectivity. In addition, in the scenario of massive data, the data often has many dimensions and a large amount of data. It is difficult to make comprehensive statistics on the indicators related to user interest.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • C4.5 decision tree algorithm-based specific user mining system and method thereof
  • C4.5 decision tree algorithm-based specific user mining system and method thereof
  • C4.5 decision tree algorithm-based specific user mining system and method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0038] see figure 1 As shown, the present invention provides a specific user mining system based on the C4.5 decision tree algorithm, which includes a sample selection module, a behavior attribute statistics module, a sample processing module and an algorithm platform.

[0039] The sample selection module is used to select user samples, and the user samples are divided into paid user samples and unpaid user samples according to class labels, wherein the paid user samples are marked as 1, and the unpaid user samples are marked as 0.

[0040] Behavior attribute statistics module, which is used to count the attribute values ​​of the classification attributes of user samples. The classification attributes in the present invention mainly include viewing time, viewing times, number of bullet screens, number of virtual gift gifts sent, number of virtual gift gifts r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a C4.5 decision tree algorithm-based specific user mining system and a method thereof, and relates to the live broadcast website data mining field. The system comprises a sample selection module used for selecting the user samples, wherein the user samples are divided into the paying user samples and the non-paying user samples according to the class labels; a behavior attribute statistics module used for gathering the attribute values of the classification attributes of the user samples; a sample processing module used for carrying out the normalization processing on the attribute values of the classification attributes as the training sample data; and an algorithm platform used for receiving the training sample data, wherein the algorithm platform comprises a C4.5 decision tree algorithm, provides an algorithm interface for the C4.5 decision tree algorithm, and trains a C4.5 decision tree model based on the training sample data and the C4.5 decision tree algorithm. The C4.5 decision tree algorithm-based specific user mining system of the present invention can gather the user interest degree correlated indexes comprehensively on the conditions of many data dimensions and large data size, and is convenient to mine the specific users.

Description

technical field [0001] The invention relates to the field of live website data mining, in particular to a specific user mining system and method based on a C4.5 decision tree algorithm. Background technique [0002] In recent years, with the rapid development of the live broadcast industry, the users of live broadcast websites have also experienced explosive growth. How to quickly and effectively screen out potential users from all users on the site, so that operators can make further refined marketing plans for specific users, and improve the conversion rate of users' payment is an unavoidable problem for every live broadcast website. [0003] At present, the traditional user interest degree mining is mostly manual extraction, and the user interest degree is artificially screened based on personal experience and effective behavioral characteristics. However, the above-mentioned manual screening methods often have a relatively high degree of subjectivity. In addition, in th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q30/02
CPCG06Q30/02G06Q30/0201
Inventor 龚灿
Owner WUHAN DOUYU NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products