Retrieved result clustering system in coal mine search engine

A search engine and clustering technology, which is used in network data retrieval, network data indexing, and other database retrieval directions.

Inactive Publication Date: 2014-06-25
HENAN POLYTECHNIC UNIV
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Based on the above analysis, for the retrieval results, the traditional display method of only providing a list of documents sorted by relevance shows certain shortcomings, and it is urgent to carry out in-depth analysis and processing of the retrieval results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Retrieved result clustering system in coal mine search engine
  • Retrieved result clustering system in coal mine search engine
  • Retrieved result clustering system in coal mine search engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The search result clustering system in the coal mine search engine includes a search result clustering and category label extraction device. The device includes a search engine server, a text search result clustering module and a category label extraction module. The coal mine search engine server processes query requests submitted by users , the initial search results generated are returned to the user after the text search result clustering module; in the text search result clustering module, the following methods are used for data analysis:

[0020] (1) Initialization: express the retrieval result document set as Among them, A represents the document-feature word matrix corresponding to the document set, m is the number of documents, n represents the number of feature words, and w ij Indicates the weight of the j-th feature word in the i-th document, i and j are natural numbers, 1≤i≤m, 1≤j≤n.

[0021] (2) Dimensionality reduction: decompose the matrix A into the pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a retrieved result clustering system in a coal mine search engine. The search result clustering system in the coal mine search engine comprises a retrieved result clustering and category label drawing device. The retrieved result clustering and category label drawing device comprises a search engine server, a text retrieved result clustering module and a category label drawing module. The coal mine search engine server processes inquire requests submitted by a user, and a generated initial retrieved result passes through the text retrieved result clustering module and then returns to the user. By the adoption of the retrieved result clustering system, the clustering speed of text sets can be effectively increased, and subjectivity and randomness caused when a similarity calculation method is selected can also be avoided. When data objects are combined into clusters, the similarity relation of the data objects is measured by calculating mutual information loses generated when the data objects are combined, and retrieved result documents can be grouped in a high-quality mode on the basis of the similarity relation.

Description

technical field [0001] The invention belongs to the field of coal mine safety. Background technique [0002] In the field of coal mines, the explosive growth of Internet information has brought certain challenges to the use and management of information. It has become an extremely urgent need to accurately and quickly discover the coal mine field information that users need from such a large amount of data that is complex and disorderly. Therefore, information retrieval technology has been deeply researched and widely used in the field of coal mines. [0003] Search engine is one of the tool applications frequently used by coal mine users. In a typical interaction process between a user and a Web search engine, the user expresses a specific information requirement as a query and submits it to the Web search engine; after the server processes the retrieval request, it returns a list of retrieval results. Among these results, some may be relevant to the user's search intent...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/951G06F40/30G06Q50/02
Inventor 刘永利赵珊王建芳雒芬赵建贵
Owner HENAN POLYTECHNIC UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products