Unlock instant, AI-driven research and patent intelligence for your innovation.

Counterfeit domain name training data generation method and system based on ED-GAN

An ED-GAN, training data technology, applied in transmission systems, neural learning methods, biological neural network models, etc., can solve the problems of difficult identification of new attack samples, imbalanced data sets, etc., to achieve the effect of maintaining real characteristics

Active Publication Date: 2021-01-12
BEIJING UNIV OF TECH
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] In the detection of counterfeit domain names, in order to solve the problems of unbalanced data sets and difficult identification of new attack samples, this invention will introduce the ED-GAN character-level domain name generation model to generate usable attack data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Counterfeit domain name training data generation method and system based on ED-GAN
  • Counterfeit domain name training data generation method and system based on ED-GAN
  • Counterfeit domain name training data generation method and system based on ED-GAN

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention is explained and elaborated below in conjunction with relevant accompanying drawings:

[0034] In order to make the object, technical solution and features of the present invention clearer, the present invention will be further described in detail below in conjunction with specific implementation examples and with reference to the accompanying drawings. The basic framework of the ED-GAN-based counterfeit domain name training data generation of the present invention is as follows: figure 1 shown. Each module is described as follows:

[0035] (1) Encoding module of real counterfeit domain name set

[0036] The real counterfeit domain name data set is obtained by web crawling. After being encoded by the encoder, the character-level counterfeit domain name is encoded into a domain name vector, which is used to generate the input of the discriminant network in the adversarial network, which is the re-expression of the character data of the domain name...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a counterfeit domain name training data generation method and system based on an EDGAN. The method comprises the steps: designing an encoder and a decoder of a domain name aredesigned; combining a domain name encoder, a domain name decoder and a GAN neural network, designing a character-level domain name generative adversarial network model to generate similar counterfeitdomain name samples, and predicting and detecting counterfeit domain names; and finally, through multi-classifier parameter performance comparison, carrying out validity check on the generated counterfeit domain name sample data. In order to maximize the utilization of the characteristic that GAN can directly sample and learn samples, data is not subjected to complex processing and transformation(such as a convolution layer and a pooling layer), but is directly input into a GAN original model for learning and training, so that the real characteristics of the data can be kept; the structure ofthe domain name encoder and the domain name decoder has the characteristics of simplifying and being close to the original data, so that the real characteristics of the data can be kept to the maximum extent.

Description

technical field [0001] The invention belongs to the field of deep learning and information security, in particular to an ED-GAN-based method and system for generating counterfeit domain name training data, and belongs to counterfeit domain name protection technology. Background technique [0002] With the rapid development of Internet applications, the interests carried by the Internet are increasing, and attacks against the domain name system of network communication are also becoming more frequent, which has caused a great impact on network security. Among them, phishing domain name attack has become one of the important issues threatening the safe operation of the Internet due to its low attack cost, wide range of harm, and diversified profit means. It lures users to visit the domain name by registering a domain name similar to a legitimate domain name (such as facebo0k.com, gooqle.com, etc.) instead of the domain name of the target website to publish fake advertisements,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L29/06H04L29/12G06N3/08G06N3/04
CPCH04L63/1483H04L63/1441G06N3/08H04L61/4511G06N3/045
Inventor 朱怡宁振虎
Owner BEIJING UNIV OF TECH