Method for classifying Chinese webpages based on keyword frequency analysis
Patent Information
- Authority / Receiving Office
- CN · China
- Current Assignee / Owner
- HUAIHAI INST OF TECH
- Publication Date
- 2009-12-02
- Estimated Expiration
- Not applicable · inactive patent
Abstract
Description
technical field
[0001] The present invention is aimed at the research of the keyword frequency analysis of Chinese webpage and the webpage classification method based on the keyword frequency analysis, and mainly studies how to filter and extract the content of the Chinese webpage through technical means, word segmentation and frequency analysis of webpage keywords, It also studies how to classify webpages by weighted Chinese webpage keywords, involving technical fields such as automatic webpage acquisition, Chinese webpage preprocessing, Chinese word segmentation and keyword frequency analysis, and fuzzy classification of Chinese webpages. Background technique
[0002] With the rapid development of Internet technology and Web technology, the number of web pages on the Internet is constantly increasing. The increase of network information greatly facilitates people to obtain information, but the excessive amount of information also brings a lot of difficulties for people to ...