System and method for distinguishing phishing websites

A technology for phishing websites and websites, applied in the field of network security, can solve the problems of inaccurate identification of new phishing websites, high false positive rate and imperfect identification technology, and achieve the effect of reducing dimensions, improving training efficiency, and improving accuracy.

Active Publication Date: 2014-01-29
SHENZHEN INST OF ADVANCED TECH
View PDF2 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] However, there are still some problems in the current research methods and technologies for detecting phishing websites: 1) Manual reporting and identification requires personal experience, and the efficiency is relatively low; 2) Blacklist-based detection technology can only identify phishing websites in the blacklist phi

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for distinguishing phishing websites
  • System and method for distinguishing phishing websites
  • System and method for distinguishing phishing websites

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.

[0038] see figure 1 , the first embodiment of the present invention provides a phishing website identification system 100, which includes a page crawling module 10, a feature extraction module 20, a web page relationship modeling module 30, a decision tree classification module 40, and an identification module 50; Get the page source code that module 10 is used for crawling website, and extract the Chinese text of website and the internal / external link quantity of website; Described feature extraction module 20 is connected with described page crawling module 10, is used for extracting described The page feature words of the website, the ratio of the number of internal / external links and the ranking information; the web page relationship modeling module 30 is connected with the feature extraction module 20, and is used to obtain the w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a system for distinguishing phishing websites. The system comprises a page crawling module, a feature extracting module, a webpage relationship modeling module, a decision tree classification module and a distinguishing module. The page crawling module crawls page source codes of the websites and extracts Chinese texts and internal/external link numbers of the websites. The feature extracting module extracts page feature words, the internal/external link numbers and ranking information of the websites. The webpage relationship modeling module acquires relationships between the websites and black/white lists according to the page feature words. The decision tree classification module utilizes decision trees for training and creating decision tree classification models. The distinguishing module stores the decision tree classification models and distinguishing whether unknown websites are phishing websites or not according to the decision tree classification models. By the system, accuracy of classification distinguishing can be effectively improved, and limitation that detecting techniques based on black lists can only identify phishing websites in the black lists can be overcome. The invention further provides a method for distinguishing the phishing websites.

Description

technical field [0001] The invention relates to the technical field of network security, in particular to a phishing website identification system and method. Background technique [0002] With the rapid development of the Internet and the deepening of the informatization process, people's work, study and lifestyle have become more and more closely integrated with the Internet. Instant messaging, e-mail, e-commerce, online games, online office, etc. are closely related to daily life. However, following the development of informatization, information security issues have become increasingly prominent, and cybercrimes emerge in endlessly. Phishing is one of the most serious forms of Internet crime, and it has become more frequent in recent years. The so-called "phishing website" refers to criminals using various means to counterfeit the address and page content of the real website, or using loopholes in the server program of the real website to insert dangerous HTML codes in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/55G06F17/30
CPCH04L63/1483
Inventor 张巍姜青山
Owner SHENZHEN INST OF ADVANCED TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products