URL classification method and system
A classification method and classification system technology, applied in the field of information classification, can solve the problem of low accuracy and achieve the effect of high accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] figure 1 It shows a flow chart of a method for classifying URLs provided by Embodiment 1 of the present invention, which is described in detail as follows in conjunction with the accompanying drawings:
[0049] In this embodiment, at first the URL to be classified is searched in the URL classification library, and when the URL to be classified cannot find the corresponding category in the URL classification library, the web page corresponding to the URL is analyzed to extract and express the content of the web page feature phrases, and perform lexical analysis on the feature phrases to obtain classification marks expressing user behavior, and classify according to the URL and the classification marks to update the URL classification library.
[0050] Step S101 , judging whether there is classification information of the URL to be classified in the preset URL classification library.
[0051] URL category information is set in the URL category library. Classification ma...
Embodiment 2
[0073] figure 2 It shows a flow chart of a method for classifying URLs provided by Embodiment 2 of the present invention, which is described in detail as follows in conjunction with the accompanying drawings:
[0074] Step S201, intercepting the character string of the URL to be classified.
[0075] The feature character string is a representative character string in the URL and can represent a type of URL. For example, the URL is "bbs.phicomm.com / article / title?s=123", and the corresponding feature string is: "phicomm.com / article". The present invention does not limit the specific feature string interception method. Generally speaking, the feature string includes at least the main part of the domain name and the fields in the upper-level directory.
[0076] Step S202, querying the URL classification library according to the feature string, to determine whether there is classification information of the URL to be classified in the URL classification library.
[0077] Step ...
Embodiment 3
[0084] image 3 A structural block diagram of a URL classification system provided by Embodiment 3 of the present invention is shown, and is described in detail as follows in conjunction with the accompanying drawings:
[0085] The URL classification system includes:
[0086] A judging module 31, configured to judge whether there is classification information of the URL to be classified in the preset URL classification library;
[0087] Feature phrase acquisition module 32, for when there is no classification information of the URL to be classified in the URL classification library, from the webpage corresponding to the URL to be classified, obtain the feature phrase that expresses the content of the webpage;
[0088] Classification mark generation module 33, is used for carrying out lexical analysis to described feature phrase, to generate the classification mark that expresses user's behavior;
[0089] The classification module 34 is configured to generate corresponding cl...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


