A phishing website URL detection method based on depth learning

A phishing website and deep learning technology, applied in biological neural network models, special data processing applications, instruments, etc., can solve problems that cannot fully reflect the characteristics of phishing website URLs

Active Publication Date: 2018-12-28
SOUTHEAST UNIV
View PDF4 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method mainly includes processes such as URL character embedding representation, CNN-LSTM classification model, and model training. It can effectively capture the association and semantic information before and after characters in the URL character sequence, and effectively solve the problem that traditional phishing website detection methods based on URL features cannot fully reflect phishing. The problem of website URL characteristics, and apply convolutional neural network and long-term short-term memory network model to phishing website detection, improve detection accuracy and reduce detection false negative rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A phishing website URL detection method based on depth learning
  • A phishing website URL detection method based on depth learning
  • A phishing website URL detection method based on depth learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] In the following, the present invention will be further clarified with reference to specific examples. It should be understood that these examples are only used to illustrate the present invention and not to limit the scope of the present invention. After reading the present invention, those skilled in the art will understand various equivalent forms of the present invention. All the modifications fall within the scope defined by the appended claims of this application.

[0014] The specific implementation steps of this method are as follows:

[0015] Step 1, URL character embedding representation. URL character embedding means to quantize and encode URL string sequence as the input of convolutional neural network CNN. To this end, we must first determine all the alphabetic characters, numeric characters, and special characters that may appear in the URL, and construct character mapping rules. According to the ASCCI code table and the actual situation of URL characters, a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a phishing website URL detection method based on depth learning, which can detect the phishing website on the Internet in real time only according to the website URL. At first,the URL string sequence is encoded into one-Hot two-dimensional sparse matrix, then transformed into dense character embedding matrix and input to a convolution neural network, local depth features are extracted, and then the input and the output of convolution neural network are input to long-term and short-term memory network, the correlation of URL sequences is captured, and finally a softmaxmodel is accessed to classify URLs. The invention can avoid redundant feature engineering, extract local depth correlation feature through convolution neural network, learn long-range dependency in URL through long-term and short-term memory network, and quickly and accurately detect phishing website URL.

Description

Technical field [0001] The invention relates to a method for detecting URL of a phishing website based on deep learning. The method extracts relevant characteristics of URL string sequence, uses a deep learning method to improve classification accuracy, can detect phishing websites on the Internet in real time, and belongs to the technical field of cyberspace security. Background technique [0002] In recent years, with the rapid development of the Internet, the security deficiencies of the Internet architecture have become increasingly apparent, and various security issues such as phishing, cybercrime, and privacy leakage have become more prominent. Without cyber security, there can be no national security. Cyberspace security has become a problem that all countries in the world must face and solve together. Among various network security issues, phishing is a criminal act of stealing personal information of website users through social engineering or other complex technical mea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/04
CPCG06N3/045
Inventor 杨鹏曾朋李幼平张长江郑斌
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products