Automatic webpage classification method and system

A technology for automatic classification and web pages, applied in the Internet field, can solve problems such as low accuracy, low efficiency, and huge data volume

Inactive Publication Date: 2010-08-25
SHANGHAI FUGE INFORMATION SCI & TECH
View PDF0 Cites 58 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The technical problem to be solved by the present invention is to provide a method and system for automatically classifying webpages in order to overcome the defects of the prior art such as low accuracy, low efficiency, and inapplicability to situations with a huge amount of data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic webpage classification method and system
  • Automatic webpage classification method and system
  • Automatic webpage classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic webpage classification method and a system. The method comprises the following steps: S1: searching website webpages related to a client webpage, and capturing titles and variable data information of the webpages therefrom to form a webpage list set; S2: processing characters in the webpage list set to form a classification keyword list set; S3: statistically gathering the frequency of occurrence of classification keywords in the classification keyword list set on the webpage to enable each webpage to correspond to one classification keyword frequency vector; and S4: establishing a statistical model, calculating the classification keyword frequency vector distance between each target webpage and the client webpage to obtain the correlation degree between each target webpage and the client webpage, and automatically classifying the target webpage on the basis of the correlation degree. The invention can automatically find the optimum webpage based on unbiased estimation and thereby a great amount of high-quality potential webpage can be recommended to clients for link exchange.

Description

Web page automatic classification method and system technical field The invention relates to Internet fields such as search engine marketing, network link exchange and automatic webpage classification, and in particular to a method and system for automatic webpage classification, which uses statistical methods to perform automatic search, content analysis and correlation classification on webpages. Background technique Exchanging links with related web pages can increase website traffic, increase website popularity, and improve search engine rankings. It is the most commonly used technical means in search engine marketing (SearchEngineMarketing, SEM). However, how to obtain high-quality links that are highly relevant to the content of the customer's web page is a difficult problem in this technical means at present. The current search engine optimization (Search Engine Optimization, SEO) technology uses manual search, third-party recommendation and other artificial means t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 魏亮丁力韩雪岭郭为张薇
Owner SHANGHAI FUGE INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products