Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Network traffic identification method based on random sampling multi-classifier

A technology of network traffic and multi-classifiers, applied in the field of network traffic identification based on random sampling multi-classifiers, to improve classification accuracy and efficiency, realize adaptive traffic classification and identification processing, and improve accuracy

Inactive Publication Date: 2017-07-28
SOUTHEAST UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Decision trees are used to deal with classification problems. The applicable target variable belongs to categorical variables. It has also been extended to deal with continuous variables, such as the CART model. However, different decision tree algorithms have different requirements and restrictions on data types.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network traffic identification method based on random sampling multi-classifier
  • Network traffic identification method based on random sampling multi-classifier

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0034] A network traffic identification method based on random sampling multi-classifiers, characterized in that:

[0035] Step 1: Set the number of classifiers to T, (11 , B 2 ,...,B T ;Set the number of sampling classifiers to t, 1<=t<=T; give the network flow data set A with network traffic classification labels, and give the network flow record set N without network traffic classification labels; set the loop variable i , go to step 2;

[0036] Step 2: Set i equal to 1, if the network flow data set A with network traffic classification labels is empty, go to step 5, otherwise select a network flow data set A with network traffic classification labels label the network flow record F, and delete the selected network flow record F with the network flow classification label from the network flow data set A with the network flow classification label, and enter step 3;

[0037] Step 3: Generate a random number S between 0 and 1, if S is greater than p, go to step 4, otherwise...

Embodiment 2

[0046] A network traffic identification method based on random sampling multi-classifiers, characterized in that:

[0047] Step 1 (1): Set the number of classifiers to T, T=5; set the random sampling ratio p, P=0.5; set 5 empty sets B 1 , B 2 , B 3 , B 4 , B 5 ; Set the number of sampling classifiers to 3, see figure 2 ; Given the network flow data set A with network traffic classification labels, and the network flow record set N without network traffic classification labels; set the loop variable i=0,

[0048] Set A has 2 application types, each flow includes 2 features,

[0049] A={(www, 10,8), (ftp, 1,3), (ftp, 1,4), (www, 9,8)}

[0050] There are 2 records in set N,

[0051] N={(10,10),(2,4)},

[0052] Go to step two (2);

[0053] Step 2 (2): Set i equal to 1, select a network flow record (www, 10, 8) with a network flow classification label from the network flow data set A with a network flow classification label, and at the same time select Delete the selecte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Based on the network traffic identification method of random sampling multi-classifiers, a network flow data set A with network traffic classification labels and a network flow record set N without network traffic classification labels are given, and network flow records are selected from network flow data set A, Random sampling generates a subset of data B 1 , B 2 ,...,B T , using the decision tree classification algorithm to randomly extract the data subset B 1 , B 2 ,...,B T Carry out learning separately, and record the learning results as T classifiers F 1 , F 2 ,...,F T , randomly extract t classifiers from T classifiers, and use the extracted t classifiers to classify each network flow record in the network flow record set N respectively, and obtain t classification results, and the classification results are counted The classification type with the largest number is used as the classification type of network flow records without network traffic classification labels.

Description

technical field [0001] The technical invention relates to the technical field of network measurement, in particular to a network traffic identification method based on random sampling multi-classifiers. Background technique [0002] The research hotspot of traffic identification technology is the detection technology based on traffic statistical behavior characteristics. The detection technology based on traffic behavior characteristics is a traffic identification technology based on sessions. This type of method does not require any information about the application layer protocol. It is identified by analyzing the statistical measurement of network traffic and using machine learning (Machine Learning) to process traffic. Classification problems are an emerging research direction for this technique. Generally speaking, the research object is a set of sequences with the same five-tuple (source IP, sink IP, source port, sink port, protocol) values, that is, network flow (flo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/26
Inventor 程光
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products