Space keyword query method and device based on geohash and Patricia Trie

A query method and keyword technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as retrieval efficiency constraints, and achieve the effect of avoiding multi-path query problems and low storage space overhead

Active Publication Date: 2018-11-20
KUNMING UNIV OF SCI & TECH
View PDF9 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The variant of R-Tree optimizes R-Tree based on the principles of minimum area, minimum overlapping area, minimum perimeter and high storage utilization, but the retrieval efficiency is still restricted by the multi-path query problem of R-Tree.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Space keyword query method and device based on geohash and Patricia Trie
  • Space keyword query method and device based on geohash and Patricia Trie
  • Space keyword query method and device based on geohash and Patricia Trie

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0041] Embodiment 1: as Figure 1-Figure 6 As shown, a spatial keyword query method based on geohash and Patricia Trie, including:

[0042] In the data preprocessing step, all the position points in the data set D are encoded into a string geoStr of length n by the geohash spatial encoding method, and geoStr is used as the sorting object to sort each row of data in the data set D in lexicographical order and generate a serial number id , each row of data is called a record r, and a data set composed of one or more rows of records r is called a record set R;

[0043] The step of building a spatial keyword index is to construct a Patricia Trie for geoStr, and the leaf nodes of the Patricia Trie point to the inverted index constructed according to the keywords in the field. The list elements of the inverted index are keywords and their corresponding id lists, and the spatial key is obtained Word index structure; wherein, the field refers to the character string formed from the r...

Embodiment 2

[0103] Embodiment 2: a kind of spatial keyword query method based on geohash and Patricia Trie, comprising:

[0104]In the data preprocessing step, all the position points in the data set D are encoded into a string geoStr of length n by the geohash spatial encoding method, and geoStr is used as the sorting object to sort each row of data in the data set D in lexicographical order and generate a serial number id , each row of data is called a record r, and a data set composed of one or more rows of records r is called a record set R;

[0105] The step of building a spatial keyword index is to construct a Patricia Trie for geoStr, and the leaf nodes of the Patricia Trie point to the inverted index constructed according to the keywords in the field. The list elements of the inverted index are keywords and their corresponding id lists, and the spatial key is obtained Word index structure; wherein, the field refers to the character string formed from the root node to the leaf node...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a space keyword query method and device based on geohash and Patricia Trie. The method comprises a data preprocessing step of encoding all position points in a data set D intoa character string geoStr with the length of n through a geohash spatial coding method and sorting and generating numbers id for each row of data in the data set D by a lexicographic order with geoStr as a sorting object, a space keyword index establishment step of constructing Patricia Trie for geoStr and obtaining a space keyword index structure, wherein a leaf node of Patricia Trie points at an inverted index constructed according to a keyword in a domain, a list element the inverted index is a keyword and a corresponding id list, and the domain refers to a character string formed from a root node to the leaf node, a space keyword query step of retrieving a space keyword index structure, obtaining an id satisfying a query condition, obtaining an id candidate set through filtering, verifying the candidate set and returning a position point satisfying a space query condition. According to the method and the device, the keyword query in any space range can be efficiently supported.

Description

technical field [0001] The invention relates to a spatial keyword query method and device based on geohash and Patricia Trie, belonging to the field of spatial keyword query (Spatial Keyword query), location-based service (Location-Based Service, LBS) and other fields. Background technique [0002] In recent years, with the popularization of GPS mobile devices and the development of social networks, a large number of location-based services have emerged, such as: social services (WeChat, Momo, Twitter, Foursquare, etc.), food and housing services (Ctrip, Qunar, Met Group takeaway, etc.), navigation services (Gaode Map, Tencent Map, Google Map, etc.), entertainment services (City Hot Fight, Wikitude, etc.). These services generate a large amount of data with location attributes, that is, spatial text. How to quickly obtain the required information from these data is the core issue in the field of spatial text query. According to application scenarios, spatial text query can ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 沈兵林贾连印李孟娟李晓武丁家满左喻灏胡俊涛雷妍
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products