Modelling method and system based on position top-k keyword query under sliding window

A technology of sliding window and modeling method, which is applied in the computer field, can solve problems such as the lack of update rate of the system, and achieve the effect of improving query speed, high accuracy, and high arrival rate

Active Publication Date: 2017-12-08
SHENZHEN UNIV
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, neither system has a good update rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Modelling method and system based on position top-k keyword query under sliding window
  • Modelling method and system based on position top-k keyword query under sliding window
  • Modelling method and system based on position top-k keyword query under sliding window

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention is described in further detail now in conjunction with accompanying drawing. These drawings are all simplified schematic diagrams, which only illustrate the basic structure of the present invention in a schematic manner, so they only show the configurations related to the present invention.

[0050] 1. Problem definition

[0051] Let D be a two-dimensional Euclidean space, W be a sliding window, and S be a collection of geographic text information in D and W. Each geographic text information is expressed as o=(pos, text), where pos is a position point in D, and text is text information. A LkTQ q consists of a tuple (loc,k), where loc represents the query location point, and k represents the number of result keywords that can be specified by the user. Finally, k keywords with the highest position-aware word frequency scores in the information in W are returned.

[0052] The position-aware word frequency score of a word t in a sliding window W is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a modelling method and system based on position top-k keyword query under a sliding window. The method comprises the following steps that 1, a geographical range covered by a quadtree and a node-splitting rule are determined; 2, a data flow is received, and data is inserted into nodes; 3, nodes which conform to the node-slitting rule in step 1 are split, and the intact quadtree is generated constantly by data inserting; 4, the word frequency of each leaf node is counted, and an reverse index of each leaf node is stored; 5, MG aggregated abstract information of each child node of each nonleaf node is stored; 6, the size of the sliding window needs to be maintained in the data inserting processes in step 4 and 5, a data item provided with the oldest timestamp is deleted, the newest data is added, and the index structure of the quadtree is adjusted. By means of the method, the cost is effectively reduced, the query speed is effectively improved, and geographical text data flows with high arrival rates can be processed.

Description

technical field [0001] The invention belongs to the field of computers, and in particular relates to a modeling method, in particular to a modeling method based on position top-k keyword query under a sliding window. In addition, the invention also relates to a modeling system based on position top-k keyword query under a sliding window. Background technique [0002] With the proliferation of social media, cloud storage and location-based services, the number of messages containing text and geographic information (for example, geotagged tweets) has soared. Such news, which can be modeled as geotext data streams, can often provide first-hand information for a variety of local events of different types and sizes, including news stories in a region, urban disasters, local business promotions, and events of public concern in the city. hot topics etc. [0003] The data streams of location-based social media have the following properties: (1) Bursty nature—if users do not discov...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/319G06F16/322G06F16/3323
Inventor 毛睿李荣华陆敏华王毅罗秋明商烁刘刚
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products