Method and device for classifying Internet data streams

A classification method and data flow technology, applied in the field of communication, can solve problems such as frequent upgrades of software and hardware, failure to identify in time, and increased costs

Inactive Publication Date: 2012-10-17
HUAWEI TECH CO LTD
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This makes the existing Internet data flow classification technology have the following disadvantages: the protocol feature library needs to be continuously updated to prevent new protocols and protocol variants from being identified in t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for classifying Internet data streams
  • Method and device for classifying Internet data streams
  • Method and device for classifying Internet data streams

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] figure 1 The flow chart of the method for classifying Internet data streams provided by Embodiment 1 of the present invention, such as figure 1 As shown, the method may include:

[0021] Step 101, extract feature data of the data stream to be classified according to classification requirements.

[0022] Specifically, feature data may include, but is not limited to: flow table relationship features between packets and other packets, statistical features of packets (such as packet length, time interval, etc.), topology characteristics of packets in the network (such as : connection relationship between hosts, etc.), message load characteristics (such as: ASCII distribution, encryption, etc.), etc. Classification requirements can be understood as the purpose of Internet data flow classification. For example, the purpose of Internet data flow classification is to distinguish the data flow in the network into long packet flow and short packet flow, so the dimension feature...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for classifying Internet data streams. The method comprises following steps that characteristic data of the data streams to be classified are extracted according to a classification requirement; diversity indexes of the characteristic data with each cluster center are calculated, wherein the cluster center is formed after aggregating training data in a characteristic training set, and the diversity indexes are used for representing characteristic difference degrees between the represented characteristic data and the cluster centers; and if the diversity index between the characteristic data and one cluster center in the cluster centers is smaller than a preset threshold value, the data streams to be classified determinately belong to the class presented by the cluster center. By using the scheme provided by the invention, the classification of the Internet data streams is only related to characteristics and is unrelated to protocols, and new protocols and protocol variants can be timely classified and processed and do not need to be stored in a protocol data base, so the classification of the network data streams can be adapted to high speed variation frequencies of the network protocols and software and hardware resources do not need to be frequently upgraded.

Description

technical field [0001] The invention relates to communication technology, in particular to a method and device for classifying Internet data streams. Background technique [0002] Existing Internet data stream classification technologies can be divided into several categories: Simple Packet Inspection (SPI for short), Deep Packet Inspection (DPI for short) feature matching, DPI behavior recognition, and deep packet inspection. Flow analysis (Deep Flow Inspection, referred to as: DFI). Among them, SPI mainly determines the basic information of the current data flow by analyzing the five-tuple (source address, destination address, source port, destination port, and protocol type) of the message. DPI feature matching mainly determines the application carried by the service by identifying fingerprint information such as a specific character string or bit sequence in the message. DPI behavior recognition is mainly to study the behavior of the terminal and establish a behavior r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L12/56H04L12/24
Inventor 王磊孙灵燕吴富强
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products