Marine space-time big data parallel KNN query processing method based on PID

A processing method and marine data technology, applied in the direction of digital data processing, special data processing applications, database indexing, etc., can solve the problems of affecting query processing efficiency, low indexing efficiency, low iterative processing efficiency, etc., to reduce data scanning time, improved query processing, and the effects of faster query processing

Active Publication Date: 2021-06-22
NORTHEASTERN UNIV
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional KNN query processing method usually adopts a centralized data processing method, which is not suitable for the processing of marine big data. However, the existing KNN query processing algorithms in a distributed environment, such as ParallelCircularTrip, etc., are mostly based on the MapReduce framework. MapReduce is Disk-based processing framework, less efficient for iterative processing
At the same time, these existing algorithms usually use index structures such as grid index and R-tree index, and their indexing efficiency is not high.
When performing KNN query processing, the query radius growth step is fixed, resulting in too many queries and subsequent calculations are too large, which affects the efficiency of query processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Marine space-time big data parallel KNN query processing method based on PID
  • Marine space-time big data parallel KNN query processing method based on PID
  • Marine space-time big data parallel KNN query processing method based on PID

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0042] In this embodiment, a Spark cluster composed of five IBM X3650 M4 servers is set up as the test operation environment of the method of the present invention, wherein one server is used as a Master node, and the remaining servers are used as Worker nodes. The memory configuration, network card configuration, hard disk configuration, and CPU configuration of each node are the same, as shown in Table 1.

[0043] Table 1 Server configuration

[0044] configuration Specification CPU Intel(R) Xeon(R) CPU 2.00GHz Memory 32GB DDR RAM hard disk 3.5 inches 7200rpm 2TB network card 1Gb / s adaptive Ethernet card

[0045] In this embo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a marine space-time big data parallel KNN query processing method based on PID, and relates to the technical field of space-time big data management. According to the method, a PID controller technology which is widely used in the industry is introduced for the first time, and variable step size searching processing based on a feedback mechanism is achieved. The method comprises the following steps: firstly, pre-processing collected marine data, dividing the data by adopting a grid division method, and indexing the pre-processed marine data by adopting a grid indexing technology on the basis; coding each grid unit in a row sorting mode; judging which rows and columns are within the radius range of a circle by using a row sorting grid index, so as to directly judge whether the rows and the columns intersect with the circle or not; when KNN query is carried out, the adjustability of a PID system is utilized, the search range is dynamically adjusted through negative feedback, dynamic prediction of the query radius in KNN query processing is achieved, the number of KNN query times is reduced, and therefore the KNN query processing speed is increased.

Description

technical field [0001] The invention relates to the technical field of spatiotemporal big data management, in particular to a PID-based parallel KNN query processing method for marine spatiotemporal big data. Background technique [0002] Since the 21st century, with the rapid development of information technology and the rapid development of ocean observation technologies such as ocean remote sensing and ocean buoys, the scale of ocean data has exploded and has become a very important type of big data. The ocean field has entered the era of big data. KNN query usually refers to given a spatial data set and a query point, and returns the k results closest to the query point that meet the query conditions. As a very important spatial query operation, KNN query is widely used in space application systems, and also has important applications in marine application systems such as marine detection, marine rescue, and marine information privacy protection. How to perform efficien...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/215G06F16/2455G06F16/27G06K9/62G06N3/04
CPCG06F16/2228G06F16/215G06F16/2455G06F16/27G06N3/049G06N3/045G06F18/213G06F18/24147Y02A90/10
Inventor 乔百友马玲郝元卿胡兵孙永佼吴刚韩东红
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products