Method and system for classifying website

A classification method and website technology, applied in the field of information classification, to achieve the effect of improving classification efficiency, improving accuracy and improving classification speed

Inactive Publication Date: 2007-09-19
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF0 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] The technical problem to be solved by the present invention is to provide a website classification method and system to solve the problem of how to determine the website category more accurately and quickly and realize accurate classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for classifying website
  • Method and system for classifying website
  • Method and system for classifying website

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0035] The embodiment of the present invention mines the log information of the search engine, extracts the frequent query words that the user enters a website from the search engine, and completes the classification of the website through an automated process based on the frequent query words.

[0036] Referring to FIG. 1 , it is a flow chart of the steps of the website classification method according to the embodiment of the present invention.

[0037] Step 101, acquire user query words. The query word is the text information entered by the user in the input box of the search engine, that is, the aforementioned search word. There are many ways to obtain user query words, but one of the more common and convenient methods is to o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A website classification method and system are disclosed for determining the website category more rapidly and accurately, the method comprises the steps of: setting feature vectors for each website, in which each single dimension of the feature vector is a different user query word, the value of which is equal to the emergence times of correspondent query word, classifying the website. The query words are more representative than the common words in the webpage, and can benefit the classification of websites and improve the classification accuracy because the emergence times of the query words is counted by the click times of the user which represents the close relationship between the clicked website and the query word, in addition, the generated feature vector is very short which can improve the classification efficiency greatly.

Description

technical field [0001] The invention relates to information classification technology, in particular to a website classification method and system. Background technique [0002] Among the websites that provide search engine services, it is necessary to classify other websites in order to provide more complete services. For example, websites can be classified into pornographic websites and normal websites; if a certain website is classified into the category of pornographic websites, further measures can be taken against the website. Or classify the content into military websites, financial websites, news websites, etc.; the content classification of the websites can be applied to category-based search engine services. In addition, in the website navigation service, it is also necessary to classify multiple websites, and divide each website into the most suitable category to provide convenience for users to inquire. [0003] As for how to determine the category of a website...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 张阔张智敏
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products