Method and system for generating dependencies in application-driven data generation supporting left outer join

A data generation, application-driven technology, applied in the field of database technology and data synthesis, can solve the problems of low efficiency, difficult to guarantee the authenticity of the generated results, difficult to meet the cardinality constraints, etc., to improve computing efficiency, efficient real-time generation performance, and achieve generation performance effect

Active Publication Date: 2022-04-05
EAST CHINA NORMAL UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] 1. It is difficult to meet the cardinality constraints in the data generation process, that is, to control the size of the intermediate result set, and it is often necessary to maintain a large and complex intermediate state
[0008] 2. The authenticity of the generated results is difficult to guarantee, and the generation efficiency is relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for generating dependencies in application-driven data generation supporting left outer join
  • Method and system for generating dependencies in application-driven data generation supporting left outer join
  • Method and system for generating dependencies in application-driven data generation supporting left outer join

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0062] Primary key status statistics: According to the given query template and data table structure, first calculate the query subtree with the left outer join as the root node from the bottom up, and obtain the query result of the subtree. According to this result, you can get the query subtree in the data table Whether the primary key is removed during the query process, that is, the retention status of the primary key. Use a bitmap to describe the retention status of each column of the primary key, use T to indicate retention, and F to indicate discarding, and combine tuples with the same retention status to count the proportion and content of the whole, and the table is as follows figure 1 Shown in T1

[0063] Calculate the constraints of the left outer join: first compress the data. For example, assuming that the left outer join only considers the status columns C1 and C2, the status column C3 can be compressed, and the data of the same items in the first two columns ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for generating dependencies in application-driven data generation that supports left outer joins, comprising the following steps: analyzing the state of the primary key, and calculating the query substring of the primary key column in the left outer join for a given database structure and original query statement The query result of the tree, mark the retained state of the primary key, and make statistics on the retained state of the primary key; calculate the primary key filtering ratio, calculate the primary key filtering ratio of each state according to the given left outer join constraint; fill the foreign key, randomly according to the filtering ratio Foreign keys are populated without duplication, and memory usage is compressed by overflowing files during processing. The invention also discloses a system for realizing the above method. Compared with the prior art, the present invention has the advantages of feature extraction, authenticity constraint and data compression.

Description

technical field [0001] The invention belongs to the field of database technology and data synthesis technology, and in particular relates to a method and system for generating dependency relationships in application-driven data generation supporting left outer joins. Background technique [0002] At present, with the development of the network and the popularization of personal terminals, the amount of data shows an obvious growth trend. At the same time, the demand for data analysis on e-commerce and social media network platforms is also increasing. In these new scenarios and new requirements, re-evaluating the performance of the database has become a new challenge that needs to be faced. In database performance evaluation, load generation is an important link. Whether it is for a new database component or a new data load, it is of great significance to simulate a database instance with the required load characteristics. [0003] An important requirement in generating t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2453G06F16/2455
CPCG06F16/2453G06F16/2455
Inventor 翁思扬王清帅张蓉
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products