Phishing website detection method and system based on adaptive heterogeneous multi-classification model

A phishing website, multi-classification technology, applied in transmission systems, character and pattern recognition, instruments, etc., can solve the problems of poor timeliness of detection methods, incomplete feature coverage, and unsatisfactory detection technology robustness and generalization performance. , to achieve the effect of improving accuracy and stability, high availability and stability, and superior generalization performance

Active Publication Date: 2018-12-07
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT +1
View PDF7 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] Among the above technologies, the detection method based on the black and white list is poor in timeliness and the scope of the list is also insufficient; the detection technology based on visual similarity has complex algorithms and takes a long time to detect, and cannot be applied to massive URLs (UniformResoure Locator: Uniform Resource Locator) online real-time detection; Bayesian algorithm-base

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phishing website detection method and system based on adaptive heterogeneous multi-classification model
  • Phishing website detection method and system based on adaptive heterogeneous multi-classification model
  • Phishing website detection method and system based on adaptive heterogeneous multi-classification model

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0060] The technical solutions of the present invention will be described in detail below with reference to the accompanying drawings and embodiments. The examples are only used to explain the present invention, not to limit the scope of the present invention.

[0061] like figure 1 As shown, the present invention provides a phishing website detection method based on an adaptive heterogeneous multi-classification model (AHMC), which includes the learning of the adaptive heterogeneous multi-classification model and the detection of phishing websites. step.

[0062] Step 1, select a phishing website of the same type, for example, a counterfeit phishing website of the same type as a bank, as a sample set D, |D|=n, where n represents the number of samples in D. The samples were classified into training and test sets using leave-one-out cross-validation.

[0063] The jth training sample set is: D j ={(x 1 ,y 1 ),(x 2 ,y 2 ),…,(x m ,y m )}(1≤j≤n, 1

[0064] The corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a phishing website detection method and system based on an adaptive heterogeneous multi-classification model. The method is characterized by for a multiple-base classification algorithm, through linear addition, constructing the adaptive heterogeneous multi-classification model; training the multi-classification model, wherein a model input is the input of each base classification algorithm and an output is a sample label, and each base classification algorithm extracts a corresponding characteristic from a sample record and is taken as the input; and using a machine learning algorithm to solve a model parameter, adopting a test set to test and optimize, and finally acquiring the detection model of the type of a phishing website. The system comprises a domain name morpheme characteristic classifier, a subject index characteristic classifier, a content similarity characteristic classifier, a structural style characteristic classifier, a visual rule characteristicclassifier, a linear addition training module, an integrated classifier, a training data set management module, and a detection and alarm module. In the invention, the phishing website can be detectedin real time, and the accuracy and the stability of phishing website detection are increased.

Description

technical field [0001] The invention relates to the field of computer network security, in particular to a method and system for detecting phishing websites based on an adaptive heterogeneous multi-classification model. Background technique [0002] With the vigorous development of Internet technology, network security issues emerge in endlessly. Phishing is a typical online fraud. It uses the Internet as a carrier to deceive users to obtain sensitive information of users by disguising themselves as reputable and legitimate websites. loss. How to quickly and accurately detect phishing websites has become a research hotspot in Web (Global Wide Area Network) information security. Currently public phishing website detection technologies mainly include the following methods: [0003] (1) Detection technology based on the black and white list mechanism: as a practical core technology, the black and white list has the advantages of high efficiency and accuracy. Through the det...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L29/06G06K9/62
CPCH04L63/1416H04L63/1483G06F18/24
Inventor 臧天宁强倩杜飞周渊
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products