Method for realizing two-dimensional predication selection rate estimation by using wavelet compressed histograms
A technology of selectivity and histogram, which is applied in the field of estimating the distribution of stored data, can solve problems such as optimization errors and corrected results deviating from actual results, and achieve the effects of reduced data loss, low storage and construction costs, and accurate estimation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] As shown in Fig. 1, the present invention is divided into two stages. The first stage is to perform statistics on the data in the database and store it as statistical information for future query optimization. The second stage is to estimate the selection rate during user queries.
[0037] The specific steps of the first stage are as follows:
[0038] Step 1: Data sampling
[0039] Sampling is to obtain a part of the sample from the whole so that this sample can describe the characteristics of the whole. Random sampling is performed on the relationship of the two-dimensional statistical information to be created, and the attribute value of the attribute involved in the two-dimensional statistical information is obtained, thereby forming the two-dimensional data set on which the statistical information is created.
[0040] Step 2: Extract the most frequent value MCV (Most Common Value)
[0041] First, fix a dimension order for the two-dimensional attributes of the statistica...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 