Incremental sampling algorithm for row mode of window function

A technology of window function and sampling algorithm, applied in the field of query analysis in database systems, can solve the problem of unacceptable cost of query processing, and achieve the effect of reducing query response time, computing optimization, and improving execution efficiency.

Inactive Publication Date: 2018-11-06
EAST CHINA NORMAL UNIVERSITY
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Window functions are widely used in analytical databases, but the cost of query processing is still unacceptable when the a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Incremental sampling algorithm for row mode of window function
  • Incremental sampling algorithm for row mode of window function
  • Incremental sampling algorithm for row mode of window function

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0024] Example

[0025] figure 1 It is a specific implementation flow chart of the window function optimization strategy incremental sampling algorithm based on the Row mode in the present invention. like figure 1 Shown, concrete steps of the present invention include:

[0026] S101: Partition and sort the data in the table.

[0027] Table 1 is the order data generated by the standard test framework DBGEN of the database evaluation benchmark TPC-H (some columns have been removed for ease of presentation). Among them, the first column represents the serial number of the order. The second column indicates the customer ID corresponding to the order. The third column indicates the status of the order, O means the transaction is successful, and F means the transaction failed. The fourth column indicates the price corresponding to the order. The fifth column indicates the date the order was generated. The sixth column indicates the priority of the order.

[0028] Table 1

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an incremental sampling algorithm for a row mode of a window function. According to the method, by utilizing random sampling and incremental sampling thoughts, data in a windowis sampled in a calculation process of each row of data; a window function value of original data is fitted by using statistical information of the sampled data; and a confidence interval is returnedto a user. The calculation process for the window function is optimized; the whole data set is prevented from being processed; the efficiency and accuracy are taken into account; and the performanceis excellent in big data application.

Description

technical field [0001] The invention belongs to the field of query analysis in the database system, aims at the window function newly proposed by SQL-2003, improves the query efficiency of the window function by means of approximate calculation, and improves the usability of the window function. Background technique [0002] With the accumulation of data, how to effectively analyze and utilize the knowledge contained in the data has become the focus of all walks of life. Traditional databases focus on basic daily transaction processing. These tasks are also called on-line transaction processing (On-Line Transaction Processing), such as basic addition, deletion, query and modification operations. However, in the context of today's big data, traditional databases face a large amount of data processing with low efficiency, and it is difficult to provide effective support for the increasing tasks of data analysis and processing. [0003] From a technical point of view, the main...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 王晓玲屈稳稳宋光旋
Owner EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products