c&c domain name identification method based on domain name features

An identification method and a technology of domain name characteristics, applied in the field of network security, can solve the problems of error-prone division of host domain name request sequences, low applicability and generalization of prediction models, and difficulty in realizing accurate identification of actual domain names, so as to overcome low applicability and Promote, save manpower and material resources, and enhance the effect of strong landing

Active Publication Date: 2018-10-09
CTRIP COMP TECH SHANGHAI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Disadvantages: The C&C domain name generated by the DGA algorithm contains a single type of domain name, resulting in a single type of domain name contained in the training data set. Therefore, the prediction model generated by training in this way has low applicability and generalizability, and it is difficult to realize the category of the actual domain name. Accurate discrimination
[0008] Disadvantages: The thresholds taken for some features are subjective and arbitrary, not calculated by the model, and lack a certain degree of objectivity
[0011] Disadvantages: Two detection algorithms based on DNS invalidation, DGA domain name detection and invalid C&C domain name detection, are easily affected by IP (interconnection protocol between networks) spoofing and DNS spoofing
Invalid DNS request sequence detected by C&C has a division boundary of 0 points, which is easy to mistakenly divide the host domain name request sequence and affect the accuracy of periodic judgments

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • c&c domain name identification method based on domain name features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention is further illustrated below by means of examples, but the present invention is not limited to the scope of the examples.

[0030] Such as figure 1 As shown, the C&C domain name identification method based on the domain name feature of the present invention comprises the following steps:

[0031] Step 101, based on the qualitative characteristics of distinguishable domain name categories, generate quantitative indicators for determining domain name categories for a given domain name; the generated quantitative indicators may include, for example, the proportion of vowels in domain names, the number of occurrences of pinyin in domain names, etc. ;

[0032] Step 102, randomly extract some domain names from the given domain names to enter the training data set, and enter the remaining domain names into the test data set, and apply the decision tree integration algorithm bagging algorithm to generate a domain name category judgment model based on the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a C&C domain name identification method based on domain name features, including: S1, based on the qualitative features of the domain name, generating quantitative indicators for determining the domain name category for a given domain name; S2, randomly extracting from the given domain name Part of the domain names enter the training data set, and the remaining domain names enter the test data set, and apply the decision tree integration algorithm to generate a domain name category judgment model based on the training data set; S3. Apply the generated domain name category judgment model to the domain name category of the remaining domain names in the test data set Make a judgment and compare it with the actual category of the remaining domain names, and calculate the predictive performance index of the domain name category determination model; S4. Correct the domain name category determined by applying the domain name category determination model; S5. Based on the corrected domain name category , to generate statistical results for a single domain name. The invention can accurately find the C&C domain name, and enhances the robustness, feasibility and comprehensibility of the model.

Description

technical field [0001] The invention relates to the field of network security, in particular to a C&C domain name identification method based on domain name features. Background technique [0002] The prior art on C&C domain name (a type of domain name) identification in this field is specifically as follows: [0003] 1. Topic: Using Machine Learning to Identify Randomly Generated C&C Domain Names [0004] Content: Take the C&C domain names generated by the DGA algorithm (domain name generation algorithm) and the top 100,000 legitimate domain names in the Alexa ranking (the world ranking of websites) as positive and negative examples, and generate quantitative indicators that can effectively identify the two types of domain names. After generating the corresponding indicators, use the support vector machine model to judge the domain name category. [0005] Disadvantages: The C&C domain name generated by the DGA algorithm contains a single type of domain name, resulting in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/12H04L29/06
CPCH04L63/1408H04L61/4511
Inventor 唐力岳扶天周海燕
Owner CTRIP COMP TECH SHANGHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products