Measurement space data similarity query method and device based on SQL
A technology for data similarity and space measurement, applied in the field of data processing, can solve problems such as mismatching index structure types, and achieve the effect of improving applicability and performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0054] This embodiment provides a SQL-based method for querying the similarity of metric space data, such as figure 1 As shown, the method may include:
[0055] Step 101, perform partition processing on the data set to obtain multiple partitions; wherein, each partition contains: data object, reference point;
[0056] Due to the variety of data types and the huge amount of data in the metric space, in order to improve the query efficiency, the preprocessing method of dividing all the data in the database can be used to divide the data into multiple partitions. All the data in the metric space constitute a data set, and each partition after the data set is divided contains a partition serial number for identifying the partition, and each partition also contains at least one reference point and at least one data object.
[0057] Specifically, such as figure 2 As shown, multiple objects can be arbitrarily selected as reference points in the data set, and the data space can be ...
Embodiment 2
[0078] This embodiment provides a SQL-based method for querying the similarity of metric space data, such as image 3 As shown, the method may include:
[0079] Step 201, in the data set, determine multiple reference points;
[0080] There are many ways to determine the reference point in the data set. For example, according to the number of data objects remaining in the data set except for the reference point, the data objects can be equally divided into multiple reference points according to the number; The distance to the reference point, divide the data objects whose distance to the reference point is within the preset range to each reference point.
[0081] Since the distribution of reference points and data objects is irregular, the selection of reference points affects the number of data objects in each partition and the distance from each data object to the reference point. Therefore, the quality of reference point selection directly affects the performance of simila...
Embodiment 3
[0122] This embodiment provides a SQL-based method for querying the similarity of metric space data, such as Figure 4 As shown, the method may include:
[0123] Step 301, perform partition processing on the data set to obtain multiple partitions; wherein, each partition contains: data object, reference point;
[0124] Due to the variety of data types and the huge amount of data in the metric space, in order to improve the query efficiency, the preprocessing method of dividing all the data in the database can be used to divide the data into multiple partitions. All the data in the metric space constitute a data set, and each partition after the data set is divided contains a partition serial number for identifying the partition, and each partition also contains at least one reference point and at least one data object.
[0125] Specifically, such as figure 2 As shown, multiple objects can be arbitrarily selected as reference points in the data set, and the data space can be...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com