Multi-dimensional sequencing and searching method for mass data

A search method and mass data technology, applied in the field of data search, to achieve the effect of improving query speed, accuracy and time consumption

Inactive Publication Date: 2017-09-15
NORTHEASTERN UNIV
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Tian Xue et al. optimized the ciphertext retrieval framework MRSE, and proposed a new ciphertext index structure: MRSE-SS, which introduced the similar query tree structure into the ciphertext index framework to improve the efficiency of multi-keyword sorting and retrieval, and proposed A dynamic clustering algorithm DK-MEDOIDS, the clustering process changes dynamically with the increase of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dimensional sequencing and searching method for mass data
  • Multi-dimensional sequencing and searching method for mass data
  • Multi-dimensional sequencing and searching method for mass data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be described in detail below with reference to the accompanying drawings.

[0035] The invention provides a multi-dimensional sorting and searching method for massive data, and proposes a dynamic interval clustering algorithm DIK-MEDOIDS in the environment of cloud storage. When initializing a document set, the method takes the vector difference between the largest and smallest documents in the document set, and equals the is divided into k slots, the size of the slot is the diameter of the hypersphere, the document closest to the middle vector of the slot is set as the center of the hypersphere, the size of each document slot depends on the number of document sets, and with the number of documents Added, the slots are dynamically divided. At the same time, a new similarity query tree is used to organize clusters in different fields. By controlling the number of sub-node hyperspheres in the superior hypersphere, the structure is dynamically ad...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-dimensional sequencing and searching method for mass data. The method comprises the following steps: clustering files according to field relevance of files in database to obtain search trees having similar clustering structures; clustering the clustering structures in different fields and forming similar search trees; obtaining search vectors submitted by users and presenting vectors as search supraspheres; and obtaining a suprasphere having most intersections with the search supraspheres by searching the positioning relationship of supraspheres represented by nodes of similar search trees and search supraspheres, searching in the next layer of nodes of supraspheres till nodes of leaves, searching neighboring nodes on the left and right and according to correlation proportions, returning most-related k file lists and file vectors in the nodes. The multi-dimensional sequencing and searching method for mass data has the following beneficial effects: a DIK-MEDOIDS algorithm in the environment of big data has obvious advantages; and search speed of data is increased and accuracy of data is improved.

Description

technical field [0001] The invention relates to the technical field of data search, in particular to a multi-dimensional sorting search method for massive data. Background technique [0002] The information privacy of the wireless body area network includes various physiological parameters of the user, and a large amount of private data is stored in the cloud server. The ciphertext retrieval technology is an effective method to solve the privacy security problem in the cloud environment. In the current data protection technology, encryption algorithms can better protect data, but encryption and decryption calculations will have a great impact on system efficiency; the data splitting and reloading strategy is more efficient, but it does not affect the structure and physical properties of the cloud platform. Hierarchical dependencies are too large. Therefore, finding a balance between data availability and security is the most critical issue in cloud storage platform applicat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/322G06F16/334G06F16/338G06F16/353
Inventor 赵志滨顾佳良姚兰高福祥
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products