Check patentability & draft patents in minutes with Patsnap Eureka AI!

An Optimal Method for Result Cache Replacement

A technology of result caching and optimization method, applied in the field of computer network, can solve problems such as having a large distance, not considering the influence of the probability of the search request reappearing in the search result, and not considering the semantic characteristics of the search request, etc.

Active Publication Date: 2016-12-07
PEKING UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] These methods are very meaningful for improving the hit rate of the result cache. However, most of them only consider the characteristics of the search request itself, and do not consider the impact of the quality of the search results on the probability of the search request reappearing, nor the semantic characteristics of the search request. There is still a large distance between the hit rate of the result cache and the best hit rate that can be obtained

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Optimal Method for Result Cache Replacement
  • An Optimal Method for Result Cache Replacement
  • An Optimal Method for Result Cache Replacement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention is applicable to the result cache, and its specific implementation method is to apply the two improvements of F1 and F2 to the original result cache replacement strategy, thereby improving the original result cache replacement strategy to improve the hit rate of the result cache the goal of.

[0050] The method process of the present invention is as figure 1 As shown, this embodiment includes the following steps:

[0051] Step 1: Initialize the result cache, that is, initialize a result cache space, including the initialization replacement method, the number of stored result items and other environment variables.

[0052] The second step: extract and preprocess the data required in this method, including:

[0053] ●Read the system search log (Query Log).

[0054] ● Obtain the total number of occurrences of all pages and the total number of user clicks.

[0055] ● Extract the URL of each different page in the system log, and use the URL as the K...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an optimization method for result cache replacement and belongs to the technical field of computer networks. The method comprises the following steps: (1) taking a user ID (Identity) in a system searching log as a key and a request sent by a user as a key value to establish a hash table h2; (2) splicing all the requests sent by the same user ID to obtain a character string; carrying out a topic model analysis on a document set composed of the spliced character strings of all users to obtain a topic set M and the relevancy; (3) calculating appearing times of searching requests related to a certain topic and a specific value of the appearing times of all the requests in the same time period; calculating a weight contribution value of the request; (4) when obtaining one request q, finding a matched result cache and returning; updating the h2; if no matched result cache exists, finding the corresponding key value in the h2 to be spliced with the q; calculating the weight contribution value of the q; and (5) substituting a result item with a lower weight in the result cache and adding a new cache item. According to the optimization method for the result cache replacement, the efficiency of the cache hit ratio can be greatly improved.

Description

technical field [0001] The invention relates to a method for improving the hit rate of result cache in a search engine and a full-text retrieval system, specifically a method for improving the hit rate of result cache by using the result quality and semantic features of search requests, and belongs to the technical field of computer networks. Background technique [0002] Modern search engines have to process hundreds of millions to billions of search requests every day, so fast processing of search requests is a crucial issue. Result caching is a very important performance component in modern search engines. Therefore, how to effectively improve the efficiency of result caching is a problem that has attracted much attention in academia and industry. [0003] As a kind of cache, the result cache satisfies the basic assumption of the cache mechanism, that is, the principle of locality of search requests, that is, search requests that have appeared before may appear again in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F12/127G06F16/9535
Inventor 宗朗陈薇王腾蛟杨冬青
Owner PEKING UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More