Unlock instant, AI-driven research and patent intelligence for your innovation.

A mobile app traffic statistics feature selection method

A feature selection method and traffic statistics technology, which is applied in the field of mobile app traffic statistics feature selection, can solve the problems of not analyzing feature stability and not being able to select feature subsets, and achieve the effect of avoiding performance loss and high discrimination ability

Active Publication Date: 2021-10-26
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this invention eliminates redundant features for network traffic data, it does not have the stability of analyzing features and cannot be used to select a more stable feature subset

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A mobile app traffic statistics feature selection method
  • A mobile app traffic statistics feature selection method
  • A mobile app traffic statistics feature selection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0071] Such as figure 1 Shown, a kind of Mobile App traffic statistical characteristic selection method, comprises the following steps:

[0072] Step S1, obtaining the original traffic data set of the mobile App, extracting the flow statistical features of the mobile App traffic, obtaining the labeled data set LD for training, and the unlabeled data set UD to be classified;

[0073] Step S2, on the LD data set, use the information gain rate to evaluate the distinguishing ability of each flow statistical feature between classes;

[0074] Step S3, on the LD and UD data sets, calculate the value distribution of each flow statistical feature, use the Hellinger distance to evaluate the difference of the feature value distribution, and evaluate the drift degree of the flow statistical feature;

[0075] Step S4, using the degree of drift as a penalty factor for feature discrimination ability, and calculating the comprehensive evaluation value of flow statistical features;

[0076] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for selecting mobile App traffic statistics features, the method comprising: S1, acquiring the original mobile App traffic data set, extracting the flow statistical features of Mobile App traffic, acquiring a marked data set LD for training, and The unlabeled data set UD to be classified; S2. On the LD data set, use the information gain rate to evaluate the ability of each flow statistical feature to distinguish between classes; S3. On the LD and UD data sets, calculate each flow statistics The value distribution of the feature, use the Hellinger distance to evaluate the difference of the feature value distribution, and evaluate the drift degree of the flow statistical feature; S4, use the drift degree as the penalty factor of the distinguishing ability, and calculate the comprehensive evaluation value of the flow statistical feature; S5, based on the comprehensive Evaluation value, searching for a subset of flow statistical features with strong discriminative power and low degree of drift. The method of the invention relates to mobile App traffic classification technology in the field of mobile Internet traffic measurement, reduces data dimension, and improves classification robustness.

Description

technical field [0001] The invention relates to the technical field of traffic classification in the field of traffic measurement, in particular to a method for selecting mobile App traffic statistics features. Background technique [0002] The mobile app traffic classification technology based on machine learning is described as follows: the original traffic is grouped based on the five-tuple {source IP, destination IP, source port, destination port, transport layer protocol} group flow, and the traffic is extracted from the statistical characteristics of the flow to establish a flow sample set. for training classification models. A variety of flow statistical features have been proposed. For example, Moore et al. proposed 248 flow statistical features in 2005 (A.Moore, D.Zuev, M.Crogan. Discriminators for use in flow-based classification. Queen Mary and Westfield College, Department of ComputerScience, 2005.), including: statistical characteristics of packet size (minimum...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/851
CPCH04L47/2441
Inventor 王若愚张凌刘珍
Owner SOUTH CHINA UNIV OF TECH