Phishing webpage detection method based on Hungary matching algorithm

A technology of phishing webpages and matching algorithms, which is applied in the intersecting field of information security and information acquisition, can solve the problems of low calculation accuracy of webpage similarity, affect the accuracy and recall rate of phishing webpage detection, and deceive phishing webpage creators, etc., and achieve information The effect of reducing the amount, facilitating rapid positioning, and improving the detection speed

Inactive Publication Date: 2010-09-08
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 68 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method expands the characteristics of phishing webpages and further improves the detection accuracy of phishing webpages to a certain extent. However, this method still only uses the information of a single webpage when extracting the characteristics of phishing webpages, so it is easy to be detected by phishing webpage creators. cheat
[

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phishing webpage detection method based on Hungary matching algorithm
  • Phishing webpage detection method based on Hungary matching algorithm

Examples

Experimental program
Comparison scheme
Effect test

example

[0117] 1. Web page rendering feature extraction

[0118] (1) Realize the extraction of the feature of each text node in the web page, the feature of each node is a 6-dimensional vector, which has the following 6 components:

[0119] ●Text content, obtained through the childNode.nodeValue.trim() function of JavaScript

[0120] ●Foreground color

[0121] ●Background color

[0122] ●Font size

[0123] ●Font name

[0124] ●The position of the text node in the web page

[0125] (2) Realize the extraction of the feature of each picture node in the webpage, the feature of each node has:

[0126] ●The value of the src attribute of the image, which is obtained by calling the imgNode.src method of JavaScript

[0127]●The area of ​​the image, call the image attributes img.width and img.height of JavaScript, and multiply them to get the area value

[0128] ●Color histogram

[0129] ●Two-dimensional Haar wavelet transform

[0130] ●The position of the picture node in the web page ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A phishing webpage detection method based on Hungary matching algorithm is characterized by firstly extracting the text feature signatures, image feature signatures and general webpage feature signatures of the rendered webpages and more comprehensively depicting the features after access to webpages; and then computing the optimal matching of bipartite graphs by Hungary algorithm to search for the matched feature pairs among different webpage signatures and more objectively measuring the similarity among the webpages on the basis, thereby improving the phishing webpage detection efficiency. The method is also characterized by determining the inside weights of the text features, image features and global image features by utilizing the area under curve and determining the relative weightsamong the text similarity, image similarity and global image similarity during webpage similarity computation by utilizing logarithmic regression analysis. The precision and the recall rate are greatly improved in the method provided by the invention.

Description

technical field [0001] The invention relates to a method for detecting phishing websites, which mainly uses a Hungarian matching algorithm to analyze and identify phishing webpages from the perspective of similarity detection, and belongs to the intersecting field of information security and information acquisition. Background technique [0002] "Phishing website" is an online fraud that has become extremely rampant with the popularity of the Internet and the increase in online transactions. "Phishing websites" are fraudulent websites made by criminals. "Phishing websites" are usually almost identical to bank websites or other well-known websites, thereby luring website users to submit sensitive information (such as: user name, password, account ID, ATM PIN or credit card details, etc.). The most typical phishing attack process is as follows: first, lure users to a carefully designed phishing website that is very similar to the target organization's website, and then obtain...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张卫丰贡亮张迎周周国强陆柳敏许碧娣田先桃李涛贤曾兵彭寅陆柳青
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products