Graph-index-based graph database keyword vicinity searching method

A proximity search and keyword technology, applied in the field of database keyword search, can solve problems such as performance degradation, inefficient index structure, failure, etc., to achieve the effect of improving efficiency and eliminating adverse effects

Inactive Publication Date: 2011-08-24
WUHAN UNIV
View PDF0 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First of all, a Steiner graph is not a reasonable query answer, because it contains too much information, and compared with the minimum connection tree, users cannot accurately identify useful information from it. In addition, there are too many repeated information among Steiner graphs. None of the answers make sense
Secondly, there are still some technical problems that have not been solved, mainly including: the r-radius subgraph may be very large if there is no upper limit on the size, according to our investigation of actual data, when the subgraph is too large, the entire The method fails, and the query speed is slower; the simple graph index structure from keywords to subgraphs is relatively inefficient, because the query process also needs to know the nodes containing keywords, and the mapping relationship between nodes and subgraphs And other information, multiple index queries caused an increase in the number of disk I / O, which directly led to performance degradation (see Document 5)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph-index-based graph database keyword vicinity searching method
  • Graph-index-based graph database keyword vicinity searching method
  • Graph-index-based graph database keyword vicinity searching method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] 1. Theoretical basis

[0057] The process of keyword proximity search on the graph is: Given a user’s keyword query {k 1 , k 2 ,...k l}, so that all the keywords that contain the query k i The set of nodes with (1≤i≤l) is M i , called the keyword k i set of matching nodes; using a certain heuristic algorithm, starting from the matching node set of each keyword, iteratively explores the nodes in the graph, if a visited node is connected to the matching nodes of all keywords by a known path , then generate a minimum connection tree with this node as the root node, if it is not covered by other generated answer trees, then save it as an answer tree; when k answer trees have been found , and the convergence condition is met, that is, no answer tree better than the current best k answer trees will be found, then the search is terminated.

[0058] Existing related methods generally adopt two ways to optimize this process: improve the efficiency by improving the heuristi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of database keyword search, in particular to a graph-index-based graph database keyword vicinity searching method. The method comprises an off-line index constructing step and an on-line searching step, wherein the off-line index constructing step comprises the following steps of: defining a d-distance sub-graph for a weighted graph G=(V, E, w), generating d-distance sub-graphs which take certain nodes in the graphs as centers by an incomplete Dijkstra algorithm, storing the acquired largest d-distance sub-graph in a sub-graph base, constructing indexes from the nodes to the sub-graphs, designing a d-distance graph index structure, and storing the d-distance graph indexes in an interpreted mode; and the on-line searching step comprises the following steps of: searching in a combined sub-graph by a searching algorithm, and outputting a top-k answer. By zooming out a search target from an integral graph to a group of sub-graphs of limited sizes, the searching efficiency can be greatly improved.

Description

technical field [0001] The invention relates to the technical field of database keyword search, in particular to a graph index-based graph database keyword proximity search method. Background technique [0002] Keyword proximity search is a mainstream technology for keyword search of structured and semi-structured data in databases. Different from the traditional keyword search technology applied to unstructured data, such as the method of search engines such as Google, this technology does not only find a single document or object that contains all the given keywords, but searches in the database A structure that includes all keywords, and these structures are composed of some objects that are structurally related to each other, and as a whole can meet the information needs of users. It does not require users to master structured query languages ​​and complex database schemas, but also fully explores the potential value of structured information in the data in the database...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 钟鸣刘梦赤桑雷汪帅
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products