A method for detecting spam
A technology of spam web pages and detection methods, applied in the fields of natural language processing, information retrieval, and data mining, can solve serious problems, high time complexity, and the influence of noise points on clustering, etc. Persuasion and representation, the effect of maintaining cultural health
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] Below in conjunction with accompanying drawing, the present invention will be further described:
[0026] The invention provides a method for detecting garbage web pages, such as figure 1 Shown is the overall flow diagram of the method of the present invention, including:
[0027] Step S101: Carry out the K-Means algorithm on the data set, store all objects n in the data set D, and the expression form of D is shown in formula (1).
[0028] D={x i |x i =(x i1 ,x i2 ,...,x id ),i=1,2,…,n} (1)
[0029] In formula (1), x i =(x i1 ,x i2 ,...,x id ) is a d-dimensional vector representing d different attributes of the i-th data, where i is the sample size. The data set D used in this embodiment is from the WEBSPAM-UK2007 data set, and the characteristic attributes are provided by the WebSpam Challenge platform, and its link is http: / / webspam.lip6.fr / wiki / pmwiki.php.
[0030] Step S201: Perform IPR calculation on the data set D, and sort the IPR values from high t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



