A data distribution method and device suitable for distributed databases
A data distribution and database technology, applied in the field of distributed databases, can solve problems such as increasing computing workload, increasing I/O dispersion, and increasing system burden, so as to reduce system burden, reduce disk I/O, reduce The effect of budgeted workload
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] figure 1 The flow chart of the data distribution method applicable to distributed databases provided by Embodiment 1 of the present invention, this embodiment is applicable to the situation of distributing data to distributed database nodes, and this method can be implemented by a data sub-device applicable to distributed databases To execute, the device can be realized by software / hardware, and can be integrated into the DBMS.
[0041] see figure 1 , the data distribution method applicable to distributed databases, including:
[0042] S110. When there is a data distribution task, split the data according to the minimum storage unit.
[0043] The DBMS determines that there is a data distribution task through detection, and based on the data distribution task, splits the data to be distributed according to the smallest storage unit. For example, the smallest storage unit is a row, reads a row of data, and divides the data into each row split.
[0044] S120. Calculate...
Embodiment 2
[0049] figure 2 The second embodiment of the present invention provides a flow chart of a data distribution method suitable for distributed databases. The embodiment of the present invention is based on the above-mentioned embodiments. Further, when distributing data to nodes according to the distribution target, the following steps are further included: Materialize the data result set corresponding to the distribution task.
[0050] see figure 2 , the data distribution method applicable to distributed databases, including:
[0051] S210. When there is a data distribution task, split the data according to the smallest storage unit.
[0052] S220. Calculate the distribution target of the split minimum storage unit data, and distribute the data to the nodes according to the distribution target.
[0053] S230. Materialize the data result set corresponding to the distribution task.
[0054] Materialization can be used to pre-calculate and save the results of time-consuming o...
Embodiment 3
[0057] figure 2 The second embodiment of the present invention provides a flow chart of a data distribution method suitable for distributed databases. The embodiment of the present invention is based on the above-mentioned embodiments. Further, the data result set corresponding to the materialized distribution task is specifically optimized. It is: when querying the result set, obtain the distribution tasks in turn according to threads with a fixed size.
[0058] see image 3 , the data distribution method applicable to distributed databases, including:
[0059] S310. When there is a data distribution task, split the data according to the minimum storage unit.
[0060] S320. Calculate the distribution target of the split minimum storage unit data, and distribute the data to the nodes according to the distribution target.
[0061] S330. When querying the result set, obtain distribution tasks in turn according to threads with a fixed size.
[0062] When the query result set i...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


