System for automatic classification analysis for website based on website content
An automatic classification and website technology, applied in the direction of network data retrieval, network data indexing, special data processing applications, etc., can solve the problems of slow update speed, low efficiency, high maintenance cost, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0039] The present invention will be further described below in conjunction with the accompanying drawings.
[0040] Such as figure 1 As shown, the number of links to the industry benchmark website is judged, and if it is greater than a certain threshold, the homepage data is captured, otherwise, the next-level link data is captured; the captured data is preprocessed and the text content of the web page is analyzed. Then determine the effective node of the container, if not, it is judged to be noise and deleted, otherwise the node block word segmentation is processed; the importance of the feature word category is calculated, and the feature word category discrimination is obtained through the calculation of the website category feature thesaurus, combined with the importance and The degree of differentiation is used to obtain the weight set of characteristic keywords; the set of website category characteristic keywords is further obtained to establish the website category tem...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com