Method for judging affiliation of Internet website through clustering algorithm
A clustering algorithm and Internet technology, applied in computing, computer components, network data retrieval, etc., can solve problems such as wrong determination of attribution, inability to determine the attribution of websites, etc., and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0042] Specific implementation mode 1, such as figure 1 As shown, a method for determining the attribution of an Internet website through a clustering algorithm according to the present invention is characterized in that it comprises the following steps:
[0043] Step a, input the website collection of the unit to be determined to belong to, and the basic data is the website URL;
[0044] Step b, extracting the basic information of the website;
[0045] Step c, quantifying all the information extracted in step b;
[0046]Step d, map various eigenvalues to the [0, 1] interval under the same dimension; use the normalize function of the sklearn module to realize the normalized eigenvector FN website ;
[0047] FN website =[FN ip ,FN domain ,FN title ,FN keywords ,FN copyright ,FN recordID ,FN recordENTITY ];
[0048] Step e, using the unsupervised clustering algorithm DBSCAN to cluster the data set, so that the websites belonging to the same unit are clustered under...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com