Adaptive network flow concept drift detection method based on information entropy

An adaptive network, drift detection technology, applied in the field of data processing, can solve the problem of category imbalance, not well applicable, type attribute labels need manual labeling, etc., to solve concept drift, good classification performance and generalization ability Effect

Pending Publication Date: 2019-11-12
INSPUR ARTIFICIAL INTELLIGENCE RES INST CO LTD SHANDONG CHINA
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention is based on the problem that the concept drift detection method based on the existing classification accuracy cannot be well applied in network traffic detection due to category imbalance, and the type attribute labels required for evaluating the classification error rate in the traffic identification process need to be manually marked , and difficult to obtain, providing an adaptive network flow concept drift detection method based on information entropy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Adaptive network flow concept drift detection method based on information entropy
  • Adaptive network flow concept drift detection method based on information entropy
  • Adaptive network flow concept drift detection method based on information entropy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0058] This embodiment proposes an adaptive network flow concept drift detection method based on information entropy. This method uses sliding window technology to discretize flow feature attributes into several branches for new and old data flows, and multiple flow feature Compare together, and compare the difference between the new and old data stream sliding windows by counting the information entropy of each feature attribute and each branch.

[0059] combined with figure 1 , the specific implementation process of the detection method includes:

[0060] I, utilize the Hoeffding boundary theory to obtain the threshold calculation formula of the sliding window size;

[0061] Ⅱ. Use the Tie-breaking method to determine the size of the sliding window;

[0062] Ⅲ. Using information entropy to calculate the entropy value of the sliding window of the new and old data streams;

[0063] Ⅳ. According to the threshold value and the information entropy difference between the old an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an adaptive network flow concept drift detection method based on information entropy, and relates to the technical field of data processing. The concept drift problem caused byflow characteristic changes due to time lapse and different network environments is solved. By using sliding window techniques, aiming at new and old data streams, discretizing the flow characteristic attributes into a plurality of branches; comparing the plurality of flow characteristics together; comparing differences of new and old data stream sliding windows by counting information entropiesof each characteristic attribute and each branch; and when specific detection is carried out, the method comprises the following steps: firstly, solving a threshold calculation formula of the size ofa sliding window by utilizing a Hoeffding boundary theory; then, determining the size of a sliding window by utilizing a Tie-buffering method; according to the method, firstly, obtaining a new data flow sliding window and an old data flow sliding window, then calculating entropy values of the new data flow sliding window and the old data flow sliding window through information entropy, finally, jusging whether concept drifting happens to data or not according to threshold values and information entropy difference values of the new window and the old window, thus concept drifting can be effectively detected, classifiers are updated, and good classification performance and generalization capacity are shown.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to an adaptive network flow concept drift detection method based on information entropy. Background technique [0002] With the rapid development of the mobile Internet, new services in web browsing, streaming media, and social networks are constantly emerging. At the same time, the user network security requirements make the proportion of encrypted traffic continue to increase, making traditional traffic classification methods face severe challenges. However, the machine learning classification method based on flow characteristics will cause concept drift due to differences in the distribution of services carried by traffic in different time periods and different regions. [0003] Concept drift refers to the phenomenon in predictive analytics and machine learning where the statistical properties of the target variable change in unforeseen ways over time. Over time, the pre...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/851
CPCH04L47/2441H04L47/2483
Inventor 安程治李锐段强
Owner INSPUR ARTIFICIAL INTELLIGENCE RES INST CO LTD SHANDONG CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products