Search result clustering method

A technology for search results and clustering, which is applied in the field of information retrieval and can solve the problems that clustering methods and systems have not yet appeared.

Inactive Publication Date: 2005-04-27
孙斌
View PDF0 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such clustering methods and systems have not yet appeared

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Search result clustering method
  • Search result clustering method
  • Search result clustering method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The above technical solutions will be further described below in conjunction with the accompanying drawings and embodiments.

[0027] The first step of the document retrieval system is to index the acquired document collection and generate a data structure suitable for computer search operations, so as to effectively find relevant documents according to user queries. Document collections generally include various forms of electronic documents, such as web pages (HTML documents) posted on Internet sites and data files in other formats. Large-scale document retrieval systems usually use an inverted index, that is, use keywords to index each document containing the keyword, and record information such as the frequency and location of the keyword in the document.

[0028] In the field of information retrieval, "keywords" generally refer to terms used for document indexing and retrieval, including the characteristic terms in documents, namely "index terms" and the characteri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The search result clustering process includes the following steps: pre-recording one or several sorts relative to the key word(s) included in the indexed document; and classifying the documents of the search result based on the sorts relative to the key word(s) included search request. The said sorts may be any document classifying marks or key words, and each sort may have one set weight. The documents in the search result is set in the sort set of corresponding inquiry key words, and the grade of the clustering sort may be calculated based on the included document grade. The clustering process may be completed in high efficiency, and is suitable for clustering of search result in large scale document searching system. In addition, the grading of clustering sorts makes it possible to exhibit documents with higher grade to the user first.

Description

technical field [0001] The invention relates to the technical field of information retrieval, in particular to a method for automatically clustering retrieved results, such as a method for clustering user query results in an online document retrieval system or a network search engine. Background technique [0002] At present, the search results returned by computer or computer network-based document retrieval systems for user queries usually include a list of document representations (such as titles, abstracts) or document links. The degree of relevance is sorted from high to low. The user further finds and selects actually relevant or useful documents in this list. For a very large document library, such as a webpage library collected by an Internet search engine, the search results returned by the system to the user are usually hundreds or even thousands of document links. It is a great burden for users to find useful information in a large number of returned results, an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F17/30707G06F16/353
Inventor 孙斌
Owner 孙斌
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products