Student browsed webpage classification method
A technology for browsing webpages and classification methods, applied in network data navigation, network data retrieval, instruments, etc., can solve problems such as reducing classification accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0076] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.
[0077] Step 1: Crawl the URL, URL description content, URL primary classification and URL secondary classification from the navigation website, and save them in the URL collection, build a four-category corpus, and express the URL description content text in the corpus as uni-gram and In the form of bi-gram, use TF-IDF as the weight of the text feature, and use the naive Bayesian classification algorithm to obtain the classifier, specifically as figure 2 Shown:
[0078] Step 1.1: Define textual stop word set SWORD={sword 1 ,sword 2 ,...,sword num}, among them, sword swi is the swi-th stop word, and nun is the total number of stop words; define the Naive Bayesian smoothing parameter Alpha, where Alpha∈(0,1); define four categories of the corpus, namely entertainment and leisure, computer network, and life Service and Cultural Education, G ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com