Network traffic classification method based on feature strong correlation

A network traffic and classification method technology, which is applied in the field of network traffic classification based on strong correlation of features, can solve the problems of decreased utilization of network resources and large throughput, and achieve good promotion ability and adaptability, improve efficiency, and improve accuracy Effect

Inactive Publication Date: 2019-07-12
NANJING UNIV OF POSTS & TELECOMM
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, while promoting the further development of the network, it also brings a lot of problems
The ever-expanding data scale and the increasing number of application types will lead to

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network traffic classification method based on feature strong correlation
  • Network traffic classification method based on feature strong correlation
  • Network traffic classification method based on feature strong correlation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further described below in conjunction with the accompanying drawings.

[0036] figure 1 It is a principle flowchart of the present invention, including steps:

[0037] (1) Perform feature extraction on the input data to form a multi-dimensional feature vector set: for data extraction, the n features are respectively f 1 , f 2 ,..., f n , and then the eigenvector formed by it is F={f 1 , f 2 ,..., f n}.

[0038] (2) Calculate the correlation between features: Assuming that X and Y represent two random variables, the mutual information formula of the two is as follows:

[0039]

[0040] Among them, Ω X and Ω Y are the sample spaces of random variables X and Y, respectively, p(x, y) is the joint probability density function, and p(x) and p(y) are the marginal probability density functions.

[0041] The correlation of features R S The calculation formula is:

[0042]

[0043] Among them, c ∈ C = {+1, -1} represents a class va...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a network traffic classification method based on feature strong correlation, which comprises the following four steps of: inputting data to be trained, and performing feature extraction on the input data to form a multi-dimensional feature vector set; calculating the correlation between the features by using the mutual information between the features and the response variables; calculating redundancy among the features according to the mutual information, and selecting the feature with the highest score as a final feature vector through iterative calculation; and constructing a network traffic classification model based on feature strong correlation according to the classification target and obtaining a classification result. According to the method, the correlationbetween features can be fully utilized, and the feature with the maximum correlation and the minimum redundancy is extracted in the learner training process. Under the same classification model, theclassification efficiency can be effectively improved on the premise that the classification precision is guaranteed, and the defect caused by the fact that correlation between features is not considered in an existing feature selection method based on heuristic search is overcome.

Description

technical field [0001] The invention relates to data processing and machine learning, in particular to a method for classifying network traffic based on strong correlation of features. Background technique [0002] With the development of network technology, the traffic in the network increases very rapidly. However, while promoting the further development of the network, it also brings many problems. The ever-expanding data scale and increasing application types will lead to a decrease in the utilization rate of network resources. Some links have a high throughput, while some links are almost idle, which will lead to a decrease in the utilization of network resources. [0003] Therefore, in the past few years, network traffic classification techniques have attracted more and more attention. From a security perspective, quickly identifying malicious traffic will help secure containment and isolate attackers. From a QoS perspective, accurate classification of different tr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/2411G06F18/214
Inventor 张登银吴思远丁飞赵莎莎张恩轩郭诗源
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products