Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A multi-stage phishing website detection method and system

A technology for phishing websites and detection methods, applied in the fields of network security and information, can solve problems such as imbalance, doubtful generalizability, and difficulty in obtaining detection results for classification algorithms, and achieves the effect of speeding up detection and improving efficiency.

Active Publication Date: 2018-12-07
CHINA INTERNET NETWORK INFORMATION CENTER
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the model learned based on the above machine learning algorithm is to achieve good results in the actual Internet, a necessary condition is that the training samples need to cover various Internet pages. However, most of the existing anti-phishing technology research is based on relatively small sample sets to verify the algorithm Effectiveness, some sample sets even contain only dozens of samples, and its generalizability is doubtful
In addition, even if the sample set is really large enough to cover all kinds of samples, and all kinds of samples conform to the proportion of the actual Internet, considering that phishing detection is an extremely unbalanced problem (that is, there are only hundreds of thousands of websites in the world's billion-level websites every year. Phishing websites), it is difficult to achieve good detection results by directly using existing pattern classification algorithms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-stage phishing website detection method and system
  • A multi-stage phishing website detection method and system
  • A multi-stage phishing website detection method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] A necessary prerequisite for the phishing detection method based on pattern classification to achieve good results is that the training samples should be rich enough, that is, cover all kinds of Web pages. However, the problem of phishing website detection in the actual Internet environment is an extremely class-imbalanced problem, such as figure 1 As shown in , the black spot in the center of the figure indicates a phishing website, and the gray circle indicates a non-phishing website.

[0035] None of the existing phishing detection methods and strategies based on statistical learning takes this fact into account, and there is no necessary explanation for the coverage and rationality of the constructed test data set. Aiming at the above situation, the present invention designs a layered detection strategy, that is, multi-stage phishing detection. The core of this strategy is to rationally design the filtering rules of each layer to achieve the purpose of improving de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a multi-stage phishing website detection method and system, and combines means of both fast filtering and accurate filtering. Multiple stages of fast filtering are used to control the number of potential phishing websites to be in a relatively small range; furthermore, an accurate determination model is trained by analyzing statistical features of positive and negative samples in a small range. The method comprises the following steps: selecting a to-be-detected range of websites to perform fast filtering and excluding obvious non-phishing websites therefrom; and performing accurate determination on the remaining range of the websites after the fast filtering to determine whether said websites are phishing websites. The system comprises: a fast filtering module, configured to select a range of to-be-detected websites to perform fast filtering and exclude obvious non-phishing websites therefrom; and an accurate determination module, configured to perform accurate determination on the remaining range of the to-be-detected websites after the fast filtering.

Description

technical field [0001] The invention relates to the field of information technology, in particular to the field of network security technology, in particular to a multi-stage phishing website detection method and system. Background technique [0002] Today, the Internet has become an important part of people's social life, but with the continuous popularization of the Internet and the continuous improvement of application levels, in addition to traditional information security threats such as Trojan horses, viruses, and botnets, Internet phishing fraud has gradually become a One of the top attack vectors for cybercriminals. [0003] Internet phishing (phishing) is a new word commonly used in the world. The first two letters ph of phreak (the person who steals the phone line) are replaced with the f of fishing (fishing). It is a combination of social engineering (that is, deception) and network communication technology. Cybercrime means. The purpose of Internet phishing is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F21/55
CPCG06F21/55
Inventor 耿光刚李晓东
Owner CHINA INTERNET NETWORK INFORMATION CENTER
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products