Method and device for extracting characteristic string, network equipment and storage medium

A technology for extracting features and feature strings, which is applied in the field of application recognition and can solve problems such as the inability to automatically extract feature strings

Active Publication Date: 2018-06-29
NSFOCUS INFORMATION TECHNOLOGY CO LTD +1
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a method, device, network device and storage medium for extracting feature strings, so as to solve the problem that feature strings cannot be automatically extracted in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for extracting characteristic string, network equipment and storage medium
  • Method and device for extracting characteristic string, network equipment and storage medium
  • Method and device for extracting characteristic string, network equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0087] figure 1 A schematic diagram of a process of extracting a feature string provided by an embodiment of the present invention, the process includes the following steps:

[0088] S101: According to each first five-tuple information corresponding to the application, count each first session corresponding to each first five-tuple information; for each first session, determine the feature to be extracted in the first session A data packet of a string, and determine the word segmentation of the data packet; use each determined word as each candidate feature string.

[0089] The method for extracting a characteristic string provided by the embodiment of the present invention is applied to a network device, and the network device may be a network security device, a traffic monitoring device, and the like.

[0090] After the client triggers the application, the application performs session transmission, and the client can obtain each first quintuple information corresponding to ...

Embodiment 2

[0110] In order to make the determination of the transition entropy of the candidate feature string more accurate, on the basis of the above-mentioned embodiments, in the embodiment of the present invention, the candidate is determined according to the transition probability of each adjacent two characters and the logarithm of the transition probability. The transfer entropy of the feature string includes:

[0111] Calculate the product of the transition probability of each adjacent two characters and the logarithm of the transition probability, sum each of the obtained products, and determine the negative number of the obtained sum value as the transition entropy of the candidate feature string.

[0112] After the network device determines the transition probability of every two adjacent characters in the candidate feature string, it can calculate the logarithm of the transition probability of every two adjacent characters, and then calculate the logarithm of the transition pr...

Embodiment 3

[0120] In order to reduce the amount of calculation for extracting feature strings and ensure the validity of the extracted feature strings as much as possible, on the basis of the above-mentioned embodiments, in this embodiment of the present invention, for each first session, determine the first The data packets to be extracted feature string in the session include:

[0121] For each first session, in the preset data packet transmission direction, obtain a preset number of data packets in the first session, and use the preset number of data packets as the feature string to be extracted in the first session data packets.

[0122] A preset data packet transmission direction is stored in the network device, wherein the preset data packet transmission direction may be from the client to the server, or from the server to the client, or from the client to the server or from the server to the client. Moreover, a preset number is stored in the network device, and after each first s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for extracting a characteristic string, network equipment and a storage medium. The method comprises the steps of determining a transition probability ofeach two adjacent characters in a candidate characteristic string according to a first-order Markov transition probability matrix for each candidate characteristic string; determining a transition entropy of the candidate characteristic string according to the transition probability of each two adjacent characters and a logarithm of the transition entropy; and recording the candidate characteristic string of which the transition entropy is more than a preset threshold as a first taking characteristic string, and using the effective first taking characteristic string as the extracted target characteristic string. According to the method and the device provided by the embodiment of the invention, according to the first-order Markov transition probability matrix, the transition entropy of thecandidate characteristic string of the data packet can be determined, the candidate characteristic string meeting the transition entropy requirement is recorded as the first taking characteristic string, and the effective first taking characteristic string is used as the extracted target characteristic string. According to the method for extracting the characteristic string provided by the embodiment of the invention, automatic extraction of the characteristic string can be completely achieved without manual intervention.

Description

technical field [0001] The present invention relates to the technical field of application identification, in particular to a method, device, network equipment and storage medium for extracting feature strings. Background technique [0002] In the application identification technology, we need to identify and distinguish the traffic of different applications in the network traffic by extracting characteristic strings, so as to realize functions such as traffic control for different applications. In recent years, the vigorous development of the Internet has led to a sharp increase in the number and types of applications, and the fierce competition has also greatly shortened the application update cycle. The average update frequency of well-known applications is maintained at 1-3 weeks per version, and the update frequency of relatively well-known applications It is kept at 1-3 months / version, and even for lesser-known niche applications, the update frequency does not exceed 6...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/859H04L47/2475
CPCH04L47/2475H04L67/02H04L67/14G06F40/279G06F17/18G06F18/213G06F18/214G06F18/253
Inventor 何东静赵洪亮任家西
Owner NSFOCUS INFORMATION TECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products