Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for classifying web pages

A web page classification and web page technology, applied in the field of network communication, can solve the problems of low accuracy, unfavorable domain name maintenance, and low implementation efficiency.

Active Publication Date: 2018-04-27
BEIJINGNETENTSEC
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the rapid development of the World Wide Web, users have higher and higher requirements for web page access control, resulting in an increasing demand for web page classification. However, the classification of web page domain names is mainly realized by comparing the host field of each website. The common Application scenarios such as: the user requests to only visit the 163 website, and other websites cannot be accessed; the implementation method is to compare whether the host field contains ".163.com", if it is included, it can be accessed; if it is not included, it cannot be accessed, but , the 163 website also includes some *.126.com and *.netease.com domain names, so there are problems of low implementation efficiency and low accuracy, and it is also not conducive to the maintenance of domain names

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for classifying web pages
  • Method and device for classifying web pages
  • Method and device for classifying web pages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In the embodiment of the present invention, the first-level domain name of the webpage is added to the list of domain names to be analyzed, and the cross-domain policy file of the first-level domain name is analyzed to obtain one or more first domain names, and the obtained Add the first domain name to the list of domain names to be analyzed, and classify the first-level domain names according to the preset classification standards; analyze the obtained cross-domain policy files of the first domain name according to the levels of the obtained first domain name in order to obtain One or more second domain names, adding the obtained second domain names to the list of domain names to be analyzed according to the levels of the obtained second domain names, and classifying the obtained first domain names according to the preset classification standards until the obtained first domain names are classified according to the preset classification standards. After classifying the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for classifying webpages. The first-level domain name of the webpage is added to the list of domain names to be analyzed, the cross-domain policy file of the first-level domain name is analyzed to obtain one or more first domain names, and the obtained first domain names are added to all Describe the list of domain names to be analyzed, and classify the first-level domain names; sequentially analyze the obtained cross-domain policy files of the first domain names to obtain one or more second domain names, and add the obtained second domain names to the list of domain names to be analyzed , and classify the obtained first domain name until the current domain name is classified, and when it is determined that the number of domain names in the list of domain names to be analyzed is not less than the agreed The processed domain names are analyzed for cross-domain policy files and domain names are classified, and the domain names whose levels obtained by analysis are greater than the agreed levels are not processed, and the domain name classification relationship table is obtained. The invention also discloses a web page classification device.

Description

technical field [0001] The invention relates to the technical field of network communication, in particular to a web page classification method and device. Background technique [0002] With the rapid development of the World Wide Web, users have higher and higher requirements for web page access control, resulting in an increasing demand for web page classification. However, the classification of web page domain names is mainly realized by comparing the host field of each website. The common Application scenarios such as: the user requests to only visit the 163 website, and other websites cannot be accessed; the implementation method is to compare whether the host field contains ".163.com", if it is included, it can be accessed; if it is not included, it cannot be accessed, but , the 163 website also includes some domain names of *.126.com and *.netease.com, so there are problems of low implementation efficiency and low accuracy, and it is also not conducive to the maintena...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 张磊
Owner BEIJINGNETENTSEC