Selection method of distributed information retrieval sets based on historical click data

A click data, information retrieval technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as unsatisfactory correlation

Inactive Publication Date: 2014-07-16
ZHEJIANG UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Usually users only care about the top results returned by search engines, but the query results returned by current search engines are not ideally related to user needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Selection method of distributed information retrieval sets based on historical click data
  • Selection method of distributed information retrieval sets based on historical click data
  • Selection method of distributed information retrieval sets based on historical click data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Such as figure 2 As shown, the implementation steps of the distributed information retrieval collection selection method based on historical click data in the embodiment of the present invention are as follows:

[0044] 1) The search proxy server preprocesses the query log, and extracts historical queries and their corresponding click data;

[0045] 2) The retrieval proxy server calculates the correlation between historical queries and various information collections stored on the information retrieval server based on the click data;

[0046] 3) The retrieval proxy server obtains the new query sent by the user, and calculates the comprehensive similarity between the new query and each historical query;

[0047] 4) The retrieval proxy server selects multiple historical queries most similar to the new query according to the comprehensive similarity, and calculates the correlation between the new query and each information collection according to the selected historical ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a selection method of distributed information retrieval sets based on historical click data, wherein the method comprises the steps: 1), a retrieval proxy server performs preprocessing to a query log to extract historical query and click data; 2, the retrieval proxy server computes correlation degree between the historical query and each information set according to the click data; 3), the retrieval proxy server computes comprehensive similarity between the new query and each historical query; 4), the retrieval proxy server selects the most similar historical query according to the comprehensive similarity and computes correlation degree between the new query and each information set according to the historical query and the selected correlation degree between the historical query and each information set;5), the retrieval proxy server selects a plurality of information sets, sends a retrieval request and combines the result returned by the retrieval proxy server to output to a user sending new query. The method has the advantages of high retrieval result accuracy, low network bandwidth consumption, fast response speed and economic and efficient retrieval.

Description

technical field [0001] The invention relates to a distributed information retrieval technology, in particular to a collection selection method for retrieval information in a distributed information retrieval system. Background technique [0002] With the rapid development of computer technology, communication technology, and network technology and the increasing popularity of Internet applications, the number of electronic documents is increasing day by day, making electronic documents a huge information base. The explosive growth of information on the World Wide Web has also made the Web a huge information repository. How to manage these ultra-large-scale data, prevent users from being submerged in huge databases and quickly find the information they need. At present, there are mainly two solutions: one is centralized, that is, a single high-performance server is used to manage massive data in a unified manner, and to provide users with unified services. This solution has ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 陈岭刘颖
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products