Method for ranking web pages on basis of hyperlink source analysis
A web page ranking and hyperlink technology, applied in the field of information retrieval, can solve problems such as web page cheating
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0082] Example 1: Comparative analysis of the present invention and 4 kinds of existing algorithms based on artificial network to suppress the effect of web page cheating
[0083] The experimental data is a synthetic scale-free network. The network is generated using the BA model (Barabási-Albert model). The model parameters are shown in Table 1. The generated network contains 100 nodes and 1098 edges, and the network diameter is 4.
[0084] Table 1 Parameter settings of BA model
[0085] Initial number of nodes
5
The probability that an edge exists between the initial nodes
0.3
node average degree
10
The total number of nodes in the network
100
[0086] The experiment chooses the following two commonly used cheating methods to detect the effect of the algorithm to suppress cheating:
[0087] (1) Link exchange cheating: Set up several nodes in the network as cheating nodes, and these nodes add links to each other t...
example 2
[0100] Example 2: Comparative analysis of the present invention and 4 kinds of existing algorithms based on actual network data to suppress the effect of web page cheating
[0101] The experimental data adopts the WEBSPAM-UK2007 data set provided by Yahoo Labs. There are a total of 114,529 web pages and links under the website in the data set. Volunteers have marked some websites as "non-cheating" or "cheating" at the host level. The specific information is shown in Table 3. This experiment uses a host-level network for experiments. If a page in one website points to a page in another website, then there is a directed edge between the two website hosts. Because the TrustRank, DiffusionRank and AIR algorithms all need seed sets, some of these artificially marked "non-cheating" websites are used as seed sets for these algorithms. The remaining part of "non-cheating" sites and sites with domain names such as gov, ac, mod, nhs, sch, etc. together constitute the collection of auth...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com
