Multi-feature fusion phishing webpage detection method

A multi-feature fusion and phishing webpage technology, which is applied in the direction of network data retrieval, other database retrieval, special data processing applications, etc., can solve problems such as time-consuming, labor-intensive, labor-intensive, and difficulty in obtaining marked data

Active Publication Date: 2017-05-31
CHONGQING UNIV OF POSTS & TELECOMM
View PDF7 Cites 53 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing phishing webpage detection technology is mainly based on blacklist mechanism and machine learning technology, and there are still some problems in the model construction: (1) Blacklist-based detection mechanism is the main means of browser security protection at present, but blacklist-based The establishment of the list database mainly relies on heuristic learning, manual reporting, and manual verification to determine that it takes a certain amount of labor costs and is easily affected by subjective factors. Secondly, the update of the blacklist database also has a certain lag, and it is difficult to meet large-scale The need for timely detection under phishing; (2) Most of the existing phishing web page detection methods based on machine learning use supervised learning methods to build classification models. Supervised learning methods can only use labeled data for training. In order to ensure the generalization of learning Due to the short life cycle of phishing websites, it is difficult and time-consuming to obtain labeled data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-feature fusion phishing webpage detection method
  • Multi-feature fusion phishing webpage detection method
  • Multi-feature fusion phishing webpage detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The specific implementation of the present invention will be further explained in detail below in conjunction with the accompanying drawings.

[0053] The phishing web page detection system of the present invention is divided into three layers: a user access layer, a phishing web page detection layer and a data storage layer. The user access layer includes: access terminal; the phishing web page detection layer includes: information acquisition module, model training module and model detection module; the data storage layer includes: blacklist library. When a user accesses the Internet, the access layer intercepts the URL requested by the user and sends it to the phishing web page detection layer. The phishing webpage detection layer compares the received URL with the blacklist library of the data layer storage layer, and if it exists, sends a warning message to the access layer to remind the user of the threat; otherwise, the URL is handed over to the phishing webpage ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a multi-feature fusion phishing webpage detection method, which comprises two parts such as a training process and a detection process. The multi-feature fusion phishing webpage detection method integrates three views of phishing webpage characteristics by combining a semi-supervised learning tri-training method, and mainly solves a problem that the existing phishing webpage detection methods mostly need to perform classification model training by using supervised learning through a large amount of annotation data. The method provided by the invention mainly combines a coordinated training algorithm, starts from webpage URL characteristics, webpage information characteristics and webpage search information characteristics, applies the idea of multiple views and multiple classifiers to phishing webpage detection, and achieves the purposes of reducing the total numbers of manual annotation training samples and timely recognizing a phishing webpage through coordinated training and learning of different classifiers.

Description

technical field [0001] The invention relates to a method for detecting phishing webpages, which mainly matches and recognizes phishing webpages from three characteristic views of URL, page and search information combined with a tri-training semi-supervised learning method, and belongs to the intersection field of information security and data mining. Background technique [0002] Today, with the rapid development of the Internet, the rise of e-commerce and the popularity of Internet payment, Internet commerce has become an indispensable part of more and more people's life and work. However, just under the background of the rapid development of Internet payment, the security situation of Internet payment is also becoming more and more serious. Among them, phishing attack, referred to as phishing, is an online identity forgery attack with the most serious harm and the highest success rate among various forms of Internet fraud. Criminals mainly use visual effects to imitate sp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06G06F17/30
CPCG06F16/95H04L63/1483
Inventor 徐光侠宋洋洋高郭威刘宴兵刘俊齐锦郑爽王天羿
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products