Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Performance benchmark test system and method for big data stream processing framework

A technology of benchmark testing and data flow, which is applied in the direction of digital transmission system, transmission system, data exchange network, etc., and can solve the problems of over-construction, non-supporting dynamic change of data source, simplicity, etc.

Active Publication Date: 2018-10-19
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these works all have problems such as the application test set coverage is too small
[0008] To sum up, the existing performance benchmarks for stream processing frameworks have three deficiencies in terms of application test sets, streaming data sources and performance indicators. One is that they do not support dynamically changing data sources, and the other is that the structure of the application is too simple. , the feature coverage of streaming processing is low. Third, most of the performance indicators only consider delay and throughput, and do not involve other indicators such as back pressure.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Performance benchmark test system and method for big data stream processing framework
  • Performance benchmark test system and method for big data stream processing framework
  • Performance benchmark test system and method for big data stream processing framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0033] The present invention proposes a performance benchmark testing system and method for a big data flow processing framework. Its core idea is to realize the test under typical flow processing scenarios by constructing scenarios and applications covering the characteristics of flow processing modes and combining changing parameters. At the same time, it collects the performance index data of the stream processing framework when the application is running, and finally visualizes it as a chart, discovers and analyzes the performance problems and causes of the stream processing framework.

[0034] The stream processing mode has its own unique characteristics, which are divided into two aspects: data characteristics and computing characteristics in this paper. Data characteristics mainly refer to the characteristics of the flow data to be pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a performance benchmark test system and method for a big data stream processing framework. The system comprises a streaming load generator, a streaming scenario and application constructor, a performance data collection tool and a performance data analysis tool. The invention selects an application that conforms to the streaming processing mode calculation feature, generates a load that conforms to the streaming processing mode data feature, tests the performance of the big data stream processing framework under typical scenarios and applications, and collects performance indicators such as back pressure, throughput, delay, system resources, and node data during operation. Finally, the bottlenecks of the stream processing framework are diagnosed by analysis and statistics of collected data.

Description

technical field [0001] The invention relates to a performance benchmarking system and method of a large data stream processing framework, in particular to the performance performance of the framework in typical streaming scenarios and applications, and belongs to the field of software technology. Background technique [0002] With the advent of the Internet era and the continuous development of technologies such as mobile Internet, social networks, and e-commerce, data has shown explosive growth. Big data has become a hot spot in today's scientific and technological circles, business circles, and even the government. [0003] In general, data can be divided into bounded data and infinite data. Bounded data, also called batch data, refers to fixed and bounded data stored in persistent media, and the amount of data does not change during calculation. Generally speaking, the batch big data processing framework (hereinafter referred to as the batch processing framework) receive...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L12/26H04L12/807H04L12/803H04L47/27
CPCH04L43/045H04L43/08H04L43/0852H04L43/0888H04L43/0894H04L47/125H04L47/27
Inventor 黄涛许利杰魏峻王伟郑莹莹刘重瑞胡家煊
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products