Unlock instant, AI-driven research and patent intelligence for your innovation.

Querying method using MapReduce-based IVkNN algorithm

An algorithm and key-value pair technology, applied in the field of cloud computing and big data, can solve the problems of unsatisfactory index time consumption, inability to adapt to MapReduce parallel processing, poor scalability, etc., and achieve the effect of improving the efficiency of algorithm indexing

Inactive Publication Date: 2018-05-11
DALIAN UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, because the existing indexing algorithms cannot adapt to the parallel processing of MapReduce, the time consumption of index construction is not ideal, and the scalability is not good.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Querying method using MapReduce-based IVkNN algorithm
  • Querying method using MapReduce-based IVkNN algorithm
  • Querying method using MapReduce-based IVkNN algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] Embodiment 1: a kind of large-scale neighbor query method based on MapReduce and inverted Thiessen polygon, wherein: MapReduce is existing programming model, is used for the parallel operation of large-scale data set, and described method comprises the steps:

[0030] S1. Construct a MapReduce-based inverted Voronoi index (Tyson polygon index, InvertedVoronoiIndex, IVI);

[0031] S2. Use the inverted Voronoi index to partition the data sets R and S to obtain VC partitions. The two partitions are because the Voronoi diagram needs to be combined with two partitions when the Voronoi diagram needs to be established later, so it is carried out in this step. Partitioning of the two datasets;

[0032] S3. Use MapReduce-based IVKNN to perform distributed kNN (neighborhood algorithm) query. IVKNN is an inverted Voronoi index using IVI.

[0033] Wherein: the steps of constructing the inverted Voronoi index based on MapReduce are as follows:

[0034] Given two data sets R and S ...

Embodiment 2

[0048] Embodiment 2: This embodiment can be implemented as an independent technical solution or as a further illustration of each solution in Embodiment 1. This embodiment provides a large-scale neighbor query method based on MapReduce and inverted Thiessen polygons. This method It is an efficient algorithm based on MapReduce that uses Voronoi diagrams to process kNN queries. It can also solve the future development trend of wireless, networked, and mobile medical call systems. This embodiment also improves on the deficiencies in the prior art, and has good efficiency and scalability. In order to achieve the above object, the execution steps of the technical solution adopted in this embodiment are as follows: Carry out the establishment of the large-scale neighbor query index of MapReduce and inverted Thiessen polygon; MapReduce is a kind of current popular programming frame based on cloud platform, it can Process and generate large data sets, which leverage shared-nothing clu...

Embodiment 3

[0057] Example 3: With the rapid development of social security services today, people's material living standards are improving day by day, and the demand for medical services has become more humane and personalized. At the same time, more and more people need more convenient and perfect medical services.

[0058]With the rapid growth of mobile communication and location-based service-related technologies, technologies such as cloud computing, big data, Internet of Things, mobile computing, and spatial positioning have gradually matured, such as GPS, cameras, and Bluetooth data. This makes the storage and processing of various spatial data or objects face great challenges. Therefore, with the development of informatization, applications such as electronic medical records, nursing call center systems, and large-scale medical databases in the medical service industry are also developing rapidly, playing a greater role in improving work efficiency, improving medical services, an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a querying method using a MapReduce-based IVkNN algorithm, belongs to the field of cloud computing and big data, and aims at solving the problem of improving the indexing efficiency of existing data querying methods. According to the technical scheme, a file is loaded into a distributed cache through a master node, Mappers read key-value pairs formed by information of eachpartition where Ri belongs to set R and Sj belongs to set S, a map function generates new key-value pairs, and for each object, r belongs to set R, wherein the key is the id of the partition. The method has the advantages that the algorithm index efficiency can be improved, and MapReduce is no longer affected by space and time.

Description

[0001] This application is a divisional application with application number 2017109759745, application date 2017-10-19, and invention name "Large-scale neighbor query method based on MapReduce and inverted Thiessen polygons". technical field [0002] The invention belongs to the fields of cloud computing and big data, and relates to a MapReduce index that can effectively improve query efficiency in a distributed environment. Background technique [0003] MapReduce is a popular cloud-based programming framework that can process and generate large data sets, and it utilizes shared-nothing clusters to support data-intensive applications. The specific processing steps are: in the distributed cache system, when a MapReduce task processes a key / value pair, a set of intermediate key / value pairs is generated in the map function, and all intermediate values ​​are merged according to the same intermediate key, Each map is independent of other operations, i.e. all maps can be executed ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2237G06F16/2462G06F16/27
Inventor 季长清汪祖民秦静吴锐
Owner DALIAN UNIV