Streaming data self-adaption persistence method and system based on mixed storage

A streaming data and hybrid storage technology, applied in special data processing applications, electrical digital data processing, data processing input/output process, etc., can solve the problems of fixed window, unstable streaming data rate, etc.

Active Publication Date: 2016-08-24
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF2 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The purpose of the present invention is to overcome the problem of fixed windows in the existing streaming data persistence, and propose a hybrid storage-based streaming data adaptive persistence method, which can calculate the The size of the persistent window, and then dynamically adapt to the situation of unstable streaming data rate, ensures the balance between ease of use and consistency of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Streaming data self-adaption persistence method and system based on mixed storage
  • Streaming data self-adaption persistence method and system based on mixed storage
  • Streaming data self-adaption persistence method and system based on mixed storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] The present invention will be described in detail below in conjunction with specific embodiments and accompanying drawings.

[0063] This example uses Java language as the programming language and Hazelcast as the streaming data processing platform. The application scenario is to process the data stream flowing into the Hazelcast processing system (the data stream can be the intermediate state and calculation results in stream processing, or It is the streaming data itself) asynchronously persisted to the non-volatile storage medium.

[0064] Such as figure 1 As shown, the streaming data self-adaptive persistence system based on hybrid storage proposed by the present invention can be divided into four parts: monitor, analyzer, decision maker, and executor. persistent storage of data, and the whole set of methods is applied to streaming data processing systems.

[0065] Such as figure 2 As shown, firstly, the monitor will collect the status feature information of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a streaming data self-adaption persistence method and system based on mixed storage. The method includes the steps that state feature information of a streaming data processing system is collected in real time; a multiple linear regression model based on machine learning is established, and model parameters are estimated according to the collected state feature information; the optimal persistent window size of the streaming data processing system under the current state is calculated and obtained according to the state feature information of the current streaming data processing system and the established regression model; the streaming data processing system changes the current persistent window according to the obtained persistent window size, and the middle state or the calculation result in the streaming data processing process is stored in a solid state disk; when data capacity in the solid state disk reaches a certain degree, data in the solid state disk are stored in an ordinary hard disk. By means of the method and system, the persistent window size at the moment can be calculated according to the current and historical state information, accordingly the situation that the streaming data rate is unstable is dynamically adapted, and balance between usability and consistency of the system is guaranteed.

Description

technical field [0001] The invention relates to a method and system for self-adaptive persistence of streaming data based on hybrid storage, and belongs to the technical field of software. Background technique [0002] With the rapid development of cloud computing, Internet of Things, social media and other information technologies and application models, streaming data has gradually become a key data form in information systems. In typical application fields, streaming big data has the characteristics of real-time, volatile, sudden, disordered and infinite (see literature: Sun Dawei, Zhang Guangyan, Zheng Weimin. Big data streaming computing: key technologies and System example [J]. Journal of Software, 2014, 25(4): 839-862.). In fields such as the Internet of Things, we need to quickly store data streams in non-volatile media, and in the financial field with high reliability requirements; we need to store the intermediate state and calculation results of nodes and save th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F3/06
CPCG06F3/061G06F3/0685G06F16/24568
Inventor 黄涛钟华魏峻王伟唐震段世凯
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products