Illegal website identification system and method based on critical path

An identification system and critical path technology, which is applied in the field of illegal website identification system based on critical path, can solve the problems of insufficient detection of unknown illegal websites, insufficient accuracy of website clustering methods, inability to quickly identify illegal websites, etc.
CN106776958AInactive Publication Date: 2017-05-31THE THIRD RES INST OF MIN OF PUBLIC SECURITY

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
THE THIRD RES INST OF MIN OF PUBLIC SECURITY
Publication Date
2017-05-31
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to an illegal website identification system and method based on a critical path. A system framework is divided into four layers, namely a user layer, an application service layer, a technical support layer and a data storage layer. The user layer provides a main account of the system; the application service layer provides a main function module of the system; the technical support layer comprises a relevant tool used in the system development process and a core algorithm program; the data storage layer provides data used in the system. According to the detailed functional division, the function module of the illegal website identification system based on the critical path comprises data preprocessing, website similarity calculation, website clustering, illegal website critical path extraction and illegal website identification. The system develops a similarity calculation program based on the Path through taking URL self characteristics as research start points, can accurately calculate the similarity among websites and obtain the effective URL critical path based on the website similarity and a Fast Unfolding clustering algorithm, and finally can discover illegal websites in unknown websites through the URL critical path.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to the field of website identification and classification, in particular to the technical field of illegal website identification, and specifically refers to an illegal website identification system and method based on a critical path. Background technique

[0002] Identifying illegal websites is an important task in the field of network security, and the accuracy and timeliness of the identification method have higher requirements. At present, the existing website clustering research mostly starts from the perspective of user access behavior, and obtains the data of user access to the website from the Web log, including the user's access path, access frequency, access time, and access hobbies, etc., and establishes a user transaction matrix. Then cluster user groups and websites. However, this indirect website clustering method is not accurate enough to achieve fast identification of illegal websites. In the professiona...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More