A feature selection method for mobile users outbound based on fisher score and approximate Markov blanket

A feature selection method and technology for mobile users, applied in character and pattern recognition, special data processing applications, instruments, etc., can solve the problems of easy deletion of useful features, easy overfitting, poor classification performance, etc., and improve the accuracy of the model. performance, improve mining efficiency, and achieve the effect of dimensionality reduction

Active Publication Date: 2022-06-03
CHONGQING UNIV OF POSTS & TELECOMM
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Mobile data generally contains high-dimensional features and is non-linear data. When the number of samples is limited, if a large number of features are used to design a classifier, the computational overhead is high, the classification performance is poor, and overfitting is prone to occur.
However, the above methods have the disadvantages of being easy to fall into local optimum and easy to delete useful features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A feature selection method for mobile users outbound based on fisher score and approximate Markov blanket
  • A feature selection method for mobile users outbound based on fisher score and approximate Markov blanket
  • A feature selection method for mobile users outbound based on fisher score and approximate Markov blanket

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] S1: Obtain the surfing, call, track and attribute data of the mobile user sample, mark the user sample, and construct

[0082] S21: Outbound feature extraction, including: 1) retrieving APP data that provides outbound services, using domain names and keywords as associations

[0084]

[0086] x

[0090]

[0091]

[0092]

[0093]

[0094]

[0099] First, calculate the MIC value of the feature: transform the random variables x, y into a scatter plot and distribute them in a two-dimensional space, using k × s

[0100]

[0101]

[0104]

[0105]

[0106]

[0108]

[0111]

[0116] Output: optimal feature subset F

[0117] S51: initialize the feature set

[0121] S55: For all features x in F, calculate features x and x in turn

[0126] S62: Divide the total samples into two sets, 80% of the total samples are used as the training set train, and 20% are used as the test set

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for selecting a mobile user's outbound feature based on Fisher points and an approximate Markov blanket, which belongs to the field of data mining. Firstly, the Fisher criterion is used to retain the features with strong classification ability, and the irrelevant and weakly relevant features are eliminated. Secondly, the two measurement methods of maximum information coefficient MIC and symmetric uncertainty SU are combined, and the correlation measurement standard MSCC is designed, and the irrelevant features are further eliminated by using the MSCC standard. Finally, combined with the MSCC metric, the redundant features in the Fisher candidate feature set are eliminated by using the approximate Markov-Blanket judgment condition, and finally the optimal feature subset with a smaller dimension is obtained. The invention can effectively select the outbound features of the mobile users and improve the classification accuracy of the model.

Description

A mobile user outbound feature selection based on Fisher score and approximate Markov blanket choose method technical field The invention belongs to the technical field of data mining, relate to a kind of movement based on Fisher score and approximate Markov blanket User outbound feature selection method. Background technique With the advent of the mobile Internet era, the scale of mobile Internet users continues to increase, and people's living and working styles are It was very different before, the high penetration rate of mobile devices brought the explosive growth of mobile data. Mobile data has data sampling The advantages of comprehensive and real-time performance are quite authoritative in the field of trend analysis and potential user mining. Digging provides good convenience. Feature selection is a key data preprocessing step in machine learning and data mining, which is to filter the most The process of reducing the feature dimension of the dataset ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2458G06K9/62
CPCG06F16/2465G06F18/214G06F18/24
Inventor 许国良张轩王超李万林雒江涛易燕
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products