Webpage classification algorithm based on distributed computation
A technology of distributed computing and web page classification, applied in computing, special data processing applications, instruments, etc., can solve the problem of low efficiency of classification algorithms, and achieve the effect of improving classification accuracy and accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The process of web page classification algorithm is as follows: figure 1 shown. Webpage classification algorithm includes two processes of classification model establishment and webpage classification. The establishment of the classification model mainly includes: preprocessing the web pages in the training set; calculating the TFIDF of the category feature words according to the web page data; calculating the association relationship between the feature words; and calculating the position information of the feature words in the document. Among them, TFIDF is the weight calculation method used in the traditional naive Bayesian classification model, and the relationship and location information are the calculation contents added in the present invention. The web page classification process includes: preprocessing of web pages; calculating the posterior probability of categories according to the classification model; establishing and updating the dynamic lexicon. F...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com