Method and apparatus for classifying websites

A web page classification and web page technology, applied in the field of network communication, can solve the problems of unfavorable domain name maintenance, low implementation efficiency, low accuracy, etc.

Active Publication Date: 2015-08-05
BEIJINGNETENTSEC
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the rapid development of the World Wide Web, users have higher and higher requirements for web page access control, resulting in an increasing demand for web page classification. However, the classification of web page domain names is mainly realized by comparing the host field of each website. The common Application scenarios such as: the user requests to only visit the 163 website, and other websites cannot be accessed; the implementation method is to compare whether the host field contains ".163.com", if it is included, it can be accessed; if it is not included, it cannot be accessed, but , the 163 website also includes some *.126.com and *.netease.com domain names, so there are problems of low implementation efficiency and low accuracy, and it is also not conducive to the maintenance of domain names

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for classifying websites
  • Method and apparatus for classifying websites
  • Method and apparatus for classifying websites

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] In the embodiment of the present invention, the first-level domain name of the webpage is added to the list of domain names to be analyzed, and the cross-domain policy file of the first-level domain name is analyzed to obtain one or more first domain names, and the obtained Add the first domain name to the list of domain names to be analyzed, and classify the first-level domain names according to the preset classification standards; analyze the obtained cross-domain policy files of the first domain name according to the levels of the obtained first domain name in order to obtain One or more second domain names, adding the obtained second domain names to the list of domain names to be analyzed according to the levels of the obtained second domain names, and classifying the obtained first domain names according to the preset classification standards until the obtained first domain names are classified according to the preset classification criteria After classifying the cu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a method for classifying websites, comprising: adding a top-level domain name of a website into a list of domain names to be analyzed; parsing a cross-domain policy file of the top-level domain name to obtain one or a plurality of first domain names; adding the one or the plurality of first domain names into the list of domain names to be analyzed and classifying the top-level domain names; analyzing cross-domain policy files of the obtained first domain names one by one to acquire one or a plurality of second domain names; adding the second domain names into the list of domain names to be analyzed and classifying the obtained first domain names; after current domain names are classified and when it is determined that the series of domain names in the list of domain names to be analyzed is no smaller than due series, analyzing the cross-domain policy files and classifying the domain names for the domain names to be analyzed in the list, leaving the domain name with the series obtained from the analysis greater than the due series alone to acquire a relation table of domain classification. The present invention further discloses an apparatus for classifying websites.

Description

technical field [0001] The invention relates to the technical field of network communication, in particular to a web page classification method and device. Background technique [0002] With the rapid development of the World Wide Web, users have higher and higher requirements for web page access control, resulting in an increasing demand for web page classification. However, the classification of web page domain names is mainly realized by comparing the host field of each website. The common Application scenarios such as: the user requests to only visit the 163 website, and other websites cannot be accessed; the implementation method is to compare whether the host field contains ".163.com", if it is included, it can be accessed; if it is not included, it cannot be accessed, but , the 163 website also includes some domain names of *.126.com and *.netease.com, so there are problems of low implementation efficiency and low accuracy, and it is also not conducive to the maintena...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 张磊
Owner BEIJINGNETENTSEC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products