Performance prediction method of concurrency working loads in distributed database

A workload and performance prediction technology, used in database management systems, database design/maintenance, digital data processing, etc., and can solve problems such as workload performance prediction without concurrency

Active Publication Date: 2016-04-20
GUIZHOU UNIV +1
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the particularity of distributed databases, the previous database performance prediction methods are no longer applicable to the cur

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Performance prediction method of concurrency working loads in distributed database
  • Performance prediction method of concurrency working loads in distributed database
  • Performance prediction method of concurrency working loads in distributed database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, but not as a basis for limiting the present invention.

[0040] Embodiments of the invention:

[0041] 1. Performance prediction

[0042] The object of the invention is to study the performance prediction of concurrent query delay in the distributed database system. The performance of a distributed database system is mainly affected by resource competition in the case of shared basic resources, such as RAM, CPU, disk I / O, network bandwidth, and so on. Therefore, in the present invention, firstly, an effective measurement value that can be used for query latency performance prediction under concurrent workload is selected, especially for a distributed database system.

[0043] The invention focuses on predicting the concurrent query delay of the distributed analytical workload. Analytical queries in distributed database systems mainly involve two aspect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a performance prediction method of concurrency working loads in a distributed database. A linear regression model is established and used for judging the interaction between queries in the distributed database and predicting query time delay L under different concurrency degrees in the distributed database, and the database selectively distributes tasks through the query time delay L. The method includes the main steps that A, metrics of the query time delay L are selected; B, the interaction under combined concurrency conditions is inquired about and the linear regression model is established; C, correctness and validity of the linear regression model are verified through experiments. It is proved through repeated experiments that the total average relative error of the query time delay is 14%, the total average relative error of network delay is 30% and the total average relative error of the number of I/O block reading times is 37%, it can be seen from experiment results that the linear regression model can well conduct performance prediction of concurrency work responsibility on the distributed database, and therefore subsequent task distribution of the database is facilitated, and the average waiting time of query can be shortened.

Description

technical field [0001] The invention relates to a performance prediction method for workloads in a database, in particular to a performance prediction method for concurrent workloads in a distributed database. Background technique [0002] At present, there have been related researches on the performance prediction of database workloads. However, the databases studied are limited to single-node databases, that is to say, the database has only one server, and the performance of a server depends mainly on the disk and CPU utilization of the server. As the amount of data generated in research and industry grows, distributed database systems are applied to store and manage PB-level data and provide high parallelism and scalability. Data in a distributed database is processed through a scatter / gather pattern. For example, a query can be split into multiple subqueries by one node, and these subqueries can be executed concurrently by many other nodes, and then partial results of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/21G06F16/25
Inventor 李晖陈梅
Owner GUIZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products