Website classifying method

A classification method and website technology, applied in the field of network security, can solve problems such as poor website classification effect, and achieve the effect of increasing convenience, versatility, and strong stability

Inactive Publication Date: 2014-02-26
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF1 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved by the present invention is to provide a website classification method to solve the problem of poor website classification effect in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Website classifying method
  • Website classifying method
  • Website classifying method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0036] Such as figure 1 As shown, the embodiment of the present invention relates to a website classification method based on a self-encoding deep learning model, comprising the following steps:

[0037] Step S101, obtain the multi-dimensional attributes of the website, and use the set to represent the multi-dimensional attributes:

[0038] This step specifically includes the following steps:

[0039] Step S1011, performing HTML (HyperText Markup Language, Hypertext Markup Language) processing on the homepage of the website, extracting the HTML title, HTML text and CSS (Cascading Style Sheets, Cascading Style Sheets) theme color of the homepage;

[0040] Step S1012, perform wor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a website classifying method. The website classifying method comprises the following steps: obtaining multidimensional attributes of a website and representing the multidimensional attributes by utilizing a set; carrying out self-coding characteristic learning for the set that represents the multidimensional attributes; carrying out website clustering learning by utilizing a self-coding learning result to obtain a support vector machine (SVM) used for carrying out website classifying; a step S104: while classifying any unmarked website, firstly carrying out a step S101 and a step S102 to obtain the self-coding learning result corresponding to the web site; and then, inputting the structure into the SVM obtained in the step S103, and finally carrying out website classifying to obtain the category of the website. The website classifying method disclosed by the invention can efficiently and accurately classify the website according to the industry category, and also can quickly detect a fishing webpage with malicious characteristics. A way of multidimensional attribute description is adopted, so that convenience and universality of the system are increased; and moreover, the system has extremely strong stability.

Description

technical field [0001] The invention relates to the technical field of network security, in particular to a website classification method. Background technique [0002] With the vigorous development of the Internet industry, network security incidents such as phishing fraud, Trojan hidden links and privacy leaks occur frequently, causing serious property and mental harm to network users. How to quickly and intelligently identify phishing websites and provide appropriate privacy protection levels for different types of websites has become a hot research topic in the current security field. This requires an intelligent and accurate website classification technology to deal with massive Internet websites. [0003] At present, the domestic and foreign research on website classification technology is not very extensive, and the website feature description used in the analysis is relatively simple. Overall, the main research directions are as follows: (1) Based on webpage text. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/02
CPCG06F16/958G06N3/08
Inventor 胡俊王明华云晓春李佳贺敏纪玉春何能强高胜朱天
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products