Multi-dimensional index structure under cloud environment, construction method thereof and similarity query method

An index structure, cloud environment technology, applied in the field of computer information retrieval, can solve the problem of not supporting efficient similarity query, and achieve the effect of reducing index storage space, improving query efficiency, and reducing resource consumption

Inactive Publication Date: 2012-12-19
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to overcome the deficiency that the existing cloud environment index structure does not support efficient simila

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dimensional index structure under cloud environment, construction method thereof and similarity query method
  • Multi-dimensional index structure under cloud environment, construction method thereof and similarity query method
  • Multi-dimensional index structure under cloud environment, construction method thereof and similarity query method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The technical scheme of the present invention is described in detail below in conjunction with accompanying drawing:

[0027] The idea of ​​the present invention is to introduce the VA-File method in the traditional distributed computing environment into the cloud environment, quantify and compress the original data in each storage node, and then cluster the obtained approximate vector sets respectively, and use the clustering result As a local index, the clustering center information of all local indexes and the address of the storage node where each clustering center is located are published to the entire overlay network through the overlay network interface. In this way, when performing similarity query, it is only necessary to query the class where the nearest clustering center of the approximate vector of the vector data to be queried is located, instead of scanning all the approximate vectors, which greatly reduces the scope of the query and improves the efficiency...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-dimensional index structure under a cloud environment, a construction method thereof and a similarity query method. The index structure disclosed by the invention comprises a global index and local indexes which are respectively positioned at all storage nodes, the cloud environment uses an overlay network to organize the storage nodes, and the local indexes are of clustering results obtained by clustering approximate vectors of all vector data in the storage nodes where the local indexes are located; and the global index is of information of clustering centers of all the local indexes, which are distributed to the whole overlay network and addresses of the storage nodes where the clustering centers are located. The index structure disclosed by the invention has the advantages of reducing index storage space, reducing resource consumption, effectively supporting multi-dimensional data index and similarity query under the cloud environment, using the clustering information obtained by clustering all the approximate vectors as the local indexes and improving query efficiency by only performing query on corresponding categories through the information of the clustering centers without scanning all the approximate vectors during the query of the local indexes.

Description

technical field [0001] The invention relates to a multidimensional indexing method in a cloud computing environment, in particular to a multidimensional indexing method in a cloud environment supporting similarity query and a construction method thereof, belonging to the technical field of computer information retrieval. Background technique [0002] With the increasing popularity of the Internet and the rapid development of IT technology, Internet data is rapidly expanding, how to store and manage massive data has become a challenging problem that needs to be solved urgently. The concept of cloud computing came into being. Cloud computing has brought new service methods for users and enterprises. At present, some cloud computing systems have been successfully applied, such as: Amazon's Elastic Cloud (EC2), IBM's Blue Cloud and Google's cloud computing platform, etc. These cloud computing systems contain a large number of computer nodes, store massive amounts of data, and s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30H04L29/08
Inventor 程春玲孙春菊张登银徐小龙
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products