An offline real-time data processing method and system based on a big data framework
A real-time processing and big data technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of inability to process real-time data and the inability of the Storm platform to process offline data, etc., to achieve strong scalability and fault tolerance , The effect of efficient and perfect data processing
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0047]The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.
[0048] Such as figure 1 , 3 , 4, a method for offline real-time processing of data based on a big data framework, comprising the following steps:
[0049] Step 1. Offline real-time data collection
[0050] Build the Hadoop platform and Storm platform environment on the machine and install Mysql, Flume and other related software. Configure Flume-related configuration files to transmit data in Avro mode. Each machine runs a Flume agent, and a Flume agent contains multiple Sources and Sinks, and Channel serves as the channel connecting the two.
[0051] Step 2. Data storage and caching
[0052] Source collects data from the data source and transmits it to Channel, Sink collects data from Channel and outputs it, the output offline data is uploaded to the HDFS distributed...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


