Webpage searching result sequencing method based on content reference
A sorting method and web search technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as result interference and achieve the effect of avoiding interference
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0050] In the specific implementation plan, we used the Google search engine as a relevant webpage query tool to obtain 100 pending webpages. Use the jericho-html-2.5 toolkit to extract the text of the webpage and convert the webpage into a plain text format. Using the Sogou Internet Corpus as a large-scale Internet corpus, a list of invalid citation blocks is generated. Next, we describe the specific steps of the algorithm for an actual query "cross star" as follows:
[0051] Preparation: Divide the Sogou Internet Corpus into chunks, find the 50 chunks with the most occurrences, and generate a list of invalid reference chunks.
[0052] 1. Call the Google search engine to search for "cross star" and get the first 100 pages returned by it. These pages serve as relevant documents for the query term. We do not use the page ranking information given by Google, but use this algorithm to recalculate the ranking output for these 100 pages.
[0053] 2. Call the jericho-html-2.5 to...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com