Fishing webpage detection method based on spatial layout and visual features

A technology of phishing web pages and layout features, applied in the field of information security, can solve problems such as lack of consideration and failure of similar detection, and achieve the effects of improving detection speed, reducing time complexity, and shortening detection time

Inactive Publication Date: 2011-08-31
NANJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

According to Gesta's visual principle, relative position plays a major role in human vision, especially the relative positional relationship between multiple shapes. Changes in relative positional relationship will inevitably lead to visual differences, and the algorithm does not consider relative positional factors. May lead to failure of similarity detection, so this method can only detect web pages that are visually similar to real web pages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fishing webpage detection method based on spatial layout and visual features
  • Fishing webpage detection method based on spatial layout and visual features
  • Fishing webpage detection method based on spatial layout and visual features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The technical solution of the present invention is mainly divided into three parts:

[0040] 1. Layout feature extraction part.

[0041] The layout feature here refers to the rectangular boundary of all visible information on the webpage, such as the rectangular boundary of a piece of text in the webpage, the rectangular boundary of a picture, or the rectangular boundary of a combination of visually close elements. The main work of the layout feature extraction module is to combine the browser kernel and the document object model tree analysis tool to extract all the rectangular block information of appropriate size in the web page.

[0042] Therefore, the function of this module is to traverse the document object model tree of a web page, analyze the html, cascading style sheets, and java page script source code of the page in combination with the layout rendering engine in the browser kernel, and obtain the label information represented by each node Display the posit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The fishing webpage detection method based on spatial layout and visual features relates to a design plan which is based on webpage visual layout features and is combined with a spatial database and a picture feature similarity degree comparison. The fishing webpage detection method mainly solves the problem of rapid fishing webpage detection from the angle of webpage layout and visual similarity degree. The system is composed of six modules. The upper most layer is a user interface module which is mainly responsible for acquiring the user input and feeding back the result to the user. The intermediate layer is a control module which is responsible for dispatching all the function modules to complete the fishing webpage detection. The core of the system are four function modules, i.e., the layout feature extracting module, the spatial database module, the machine learning matching module, the picture feature extracting and comparison module. Proved by a great deal of experimental data, the method builds a fishing webpage detection system with a high speed and a high precision. The data processing capacity is greatly increased and the webpage detecting time is shortened while ensuring a high accuracy rate.

Description

technical field [0001] The invention relates to a method for detecting phishing pages, mainly matching and identifying phishing webpages from the perspective of visual similarity of webpage visual layout, belonging to the field of information security. Background technique [0002] Phishing website is an online fraud that has become extremely rampant with the popularity of the Internet and the increase in online transactions. Phishing websites are fraudulent websites made by criminals. Phishing websites are usually almost identical to banking websites or other well-known websites, thereby luring website users to submit sensitive information (such as user names, passwords, bank account numbers or credit cards) on phishing websites. detailed information, etc.) [Zhang2007]. [0003] figure 1 is the architecture of the phishing website. The most typical phishing attack process is as follows: first, lure users to a carefully designed phishing website that is very similar to th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06H04L12/26G06F17/30
Inventor 张卫丰曾兵张迎周周国强许碧欢陆柳敏
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products