Unlock instant, AI-driven research and patent intelligence for your innovation.

Webpage classification method and device

A webpage classification and webpage technology, applied in the field of communication, can solve the problem that the webpage classification method cannot take into account the complexity and classification effect, and achieve the effect of simple realization and accurate classification results

Inactive Publication Date: 2019-07-05
NEW H3C SECURITY TECH CO LTD
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, the embodiment of the present application provides a web page classification method and device to solve the problem that the prior art web page classification method cannot achieve both complexity and classification effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage classification method and device
  • Webpage classification method and device
  • Webpage classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0071] The technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, rather than all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.

[0072] In order to solve the problem of the inability to achieve both the complexity and the classification effect in the prior art, an embodiment of the present application provides a web page classification method, which can be executed by a server for classifying web pages, such as figure 1 As shown, the method includes:

[0073] S101. Determine a webpage representative word of the webpage to be classified according to the text content in the webpage ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a webpage classification method and device, relates to the technical field of communication, and can solve the problem that a webpage classification method inthe prior art cannot give consideration to both complexity and classification effect. The scheme of the embodiment of the invention comprises: determining webpage representative words of the to-be-classified webpage according to the character content in the to-be-classified webpage and the character content in the associated page of the to-be-classified webpage; wherein the associated page of theto-be-classified webpage is a page corresponding to a link in the to-be-classified webpage; and then generating a word vector matrix corresponding to the webpage representation words of the to-be-classified webpage, inputting the word vector matrix corresponding to the webpage representation words of the to-be-classified webpage into a webpage classification model, and determining the type of theto-be-classified webpage according to an output result of the webpage classification model.

Description

Technical field [0001] This application relates to the field of communication technology, and in particular to a method and device for web page classification. Background technique [0002] In network security monitoring and analysis, it is necessary to monitor which webpages the user visits and the category of the webpage the user visits. For example, determine whether the webpage the user visits is a news webpage, a video webpage or a forum webpage, etc., so as to analyze the user's behavior based on this information feature. [0003] The traditional webpage classification method is to manually classify known webpages in advance (for example, manually classify 1000 commonly used webpages), and save the classification results in a database. Later, when it is monitored that the user visits a webpage, the webpage accessed by the user can be compared with the webpage stored in the database to determine the category of the webpage accessed by the user. However, this method requires ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/954G06F16/906
Inventor 孙尚勇
Owner NEW H3C SECURITY TECH CO LTD