Method and system for constructing user network data fingerprint based on distributed processing and dpi data

A distributed processing and network data technology, applied in special data processing applications, transmission systems, electrical digital data processing, etc., can solve the problems of redundant fields, buried effective information, consumption of time resources and space resources for data analysis, etc., to maintain Sustained effectiveness, high robustness and safety, and strong portability

Active Publication Date: 2022-05-10
BEIJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Telecom operators generate a huge amount of DPI data every day, simply recording the sending / receiving information of each data packet, making a lot of effective information buried in the massive data, and the fields in the original DPI data are complicated, too many fields will seriously Consume time resources and space resources in the process of data analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for constructing user network data fingerprint based on distributed processing and dpi data
  • Method and system for constructing user network data fingerprint based on distributed processing and dpi data
  • Method and system for constructing user network data fingerprint based on distributed processing and dpi data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0027] refer to figure 1 , figure 1 It is a schematic diagram of the overall flow of the network data fingerprinting system, and the method may include:

[0028] Facing the requirements of network data fingerprints, the original data is preprocessed, and the original data is cleaned and de-redundant;

[0029] Select M commonly used mobile APPs, obtain the domain name of each APP through packet capture, analyze and regularize the domain name, use the matched regular expression as the identification rule for each APP, and number them to form a traffic rule file; accurate and effective traffic The rule file provides an important guarantee for the later construction of the network data fingerprint system.

[0030] The method of determining the user set whose network data fingerprint needs to be counted includes, but is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and system for constructing user network data fingerprints based on a distributed processing framework and DPI data, so as to extract user mobile terminal online behavior characteristics and preferences. The system includes: data preprocessing module: face the demand of network data fingerprint to clean and remove the redundancy of original data; rule extraction module: select M commonly used mobile APPs, capture packets to get the domain name of each APP and regular match, will match The formula is used as the identification rule of each APP and forms a rule file; the user set extraction module: extracts the user set that needs to be counted by the network data fingerprint system; the user behavior extraction module: counts the user's access to M APPs per unit time period; the data Storage module: save the result partitions to the data warehouse, create indexes and back them up. The present invention establishes the corresponding relationship between network space and real life by describing the online behavior of the user's mobile terminal, provides convenience for analyzing mobile Internet user behavior, and saves space and time resources.

Description

technical field [0001] The invention discloses a method and system for constructing user network data fingerprints based on a distributed processing framework and DPI data, so as to extract user behavior characteristics and preferences when surfing the Internet at a mobile terminal. Background technique [0002] Through the analysis and feature extraction of data such as network access records, data characteristics and pattern rules with significant signs and distinctions are obtained, and based on this, a research system for network personality and behavior is established. We call this method data fingerprinting . In view of the background of massive mobile Internet data, the present invention can establish the corresponding relationship between network space and real life through the accumulation and research of network data fingerprints based on the distributed framework processing method, and clearly describe the user mobile terminal network. The access behavior provide...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/9535H04L41/5061
CPCH04L41/5064G06F16/9535
Inventor 禹可吴晓非吴楚婷谭尧文
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products