Network traffic classification device and method based on Spark performance optimization
A technology of network traffic and classification device, which is applied in data exchange network, instrument, character and pattern recognition, etc., to achieve the effect of improving processing speed, meeting high real-time requirements, and meeting the needs of network traffic classification and processing.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach
[0069] According to the present invention figure 1 The flow of the network traffic classification device based on Spark performance optimization is shown, including a data preprocessing module, a model training module, a real-time classification module, and a Spark performance optimization module.
[0070] In the data preprocessing module;
[0071] Collect the original pcap packet traffic data of the backbone network through the data collection unit;
[0072] Extract time-related features from the collected pcap package by a feature extraction unit, and convert it into a csv file;
[0073] Use the data cleaning unit to clear empty records and duplicate records in the csv file, and save them to the HDFS file system.
[0074] In the module training model:
[0075] Use the cleaned csv file as input data to train the weight random forest classification model;
[0076] The classification accuracy of each decision tree in the random forest to its out-of-band data is used as the ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


