Automatic webpage classification method and system
A technology for automatic classification and web pages, applied in the Internet field, can solve problems such as low accuracy, low efficiency, and huge data volume
Inactive Publication Date: 2010-08-25
SHANGHAI FUGE INFORMATION SCI & TECH
View PDF0 Cites 58 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
The technical problem to be solved by the present invention is to provide a method and system for automatically classifying webpages in order to overcome the defects of the prior art such as low accuracy, low efficiency, and inapplicability to situations with a huge amount of data
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
Embodiment Construction
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More PUM
Login to View More
Abstract
The invention discloses an automatic webpage classification method and a system. The method comprises the following steps: S1: searching website webpages related to a client webpage, and capturing titles and variable data information of the webpages therefrom to form a webpage list set; S2: processing characters in the webpage list set to form a classification keyword list set; S3: statistically gathering the frequency of occurrence of classification keywords in the classification keyword list set on the webpage to enable each webpage to correspond to one classification keyword frequency vector; and S4: establishing a statistical model, calculating the classification keyword frequency vector distance between each target webpage and the client webpage to obtain the correlation degree between each target webpage and the client webpage, and automatically classifying the target webpage on the basis of the correlation degree. The invention can automatically find the optimum webpage based on unbiased estimation and thereby a great amount of high-quality potential webpage can be recommended to clients for link exchange.
Description
Web page automatic classification method and system technical field The invention relates to Internet fields such as search engine marketing, network link exchange and automatic webpage classification, and in particular to a method and system for automatic webpage classification, which uses statistical methods to perform automatic search, content analysis and correlation classification on webpages. Background technique Exchanging links with related web pages can increase website traffic, increase website popularity, and improve search engine rankings. It is the most commonly used technical means in search engine marketing (SearchEngineMarketing, SEM). However, how to obtain high-quality links that are highly relevant to the content of the customer's web page is a difficult problem in this technical means at present. The current search engine optimization (Search Engine Optimization, SEO) technology uses manual search, third-party recommendation and other artificial means t...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More Application Information
Patent Timeline
Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 魏亮丁力韩雪岭郭为张薇
Owner SHANGHAI FUGE INFORMATION SCI & TECH
Who we serve
- R&D Engineer
- R&D Manager
- IP Professional
Why Patsnap Eureka
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com