Multi-stage phishing website detection method and detection system based on supervised learning

A supervised learning, phishing website technology, applied in the field of digital information transmission, can solve the problems of black and white list lag, long running time, inability to detect phishing websites, etc., to reduce costs, ensure accuracy, and shorten detection time. Effect

Active Publication Date: 2019-03-22
HANGZHOU ANHENG INFORMATION TECH CO LTD +1
View PDF9 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem solved by the present invention is that the phishing website detection in the prior art has the defects that the detection of the page content characteristics is not comprehensive enough, the detection accuracy is low, the black and white list detection has hysteresis, and the new phishing website cannot be detected, and the running time is long. , and then provides an optimized multi-level phishing website detection method and detection system based on supervised learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described in further detail below in conjunction with the examples, but the protection scope of the present invention is not limited thereto.

[0034] The invention relates to a multi-level phishing website detection method based on supervised learning. Aiming at the problem that black and white lists cannot detect new phishing websites, the method of machine learning is used for heuristic detection of data in URL and page content detection; For problems of incompleteness and low accuracy, select features about URL and page content to improve accuracy; for problems that take a long time to detect, use a hierarchical method to reduce the amount of data for three-level detection and reduce the time for detection. The invention regularly updates the blacklist of phishing websites, and utilizes the method of machine learning to independently detect the URL and page content characteristics of the website to be tested, with high detection accuracy...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a multi-stage phishing website detection method and detection system based on supervised learning. A first detection layer performs phishing website judgment based on a blacklist database or a white list database, the to-be-detected website is directly output if the website is matched with the blacklist database or the white list database; or a second detection layer extracts features of the to-be-detected website URL and constructs a classifier model to perform detection according to the URL features of the known phishing website; if the website is detected as a suspicious website, a third detection layer downloads the page of the to-be-detected website to acquire the page content features, and the classifier model is constructed according to the known phishing website content features so as to perform the detection; an output end outputs the result that to-be-detected website is the phishing website or the normal website, and adds the data to the blacklist database and the white list database. The primary black-white list judges the known website and reduces the detection cost; the secondary URL detects and distinguish the clear phishing website or the normal website; and the third-stage page content detection identifies the suspicious website of the secondary detection, the judgment result is precise; an identification result is accurate, and the detection time is short.

Description

technical field [0001] The present invention relates to the transmission of digital information, such as the technical field of telegraph communication, and in particular to a multi-level phishing website detection method and detection system based on supervised learning with high detection accuracy and short running time. Background technique [0002] Internet phishing fraud, referred to as phishing, means that attackers trick users into clicking and visiting fake and counterfeit phishing websites by sending deceptive spam emails, instant messaging messages, etc. or credit card and other detailed information, the attacked user may lose personal private information, or suffer serious economic losses, causing extremely bad effects. This type of attack has become one of the biggest security threats to the current Internet. The main characteristics of current phishing websites are that the relevant pages are hidden more and more deeply, the URL structure is more and more compl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06G06K9/62
CPCH04L63/1483G06F18/214
Inventor 谷勇浩范渊崔兆林刘博彭渝董效宇郭振洋李凯悦林明峰金丽慧李凯
Owner HANGZHOU ANHENG INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products