Multi-dimensional processing method and system for massive data
A technology for massive data and processing systems, applied in the field of big data processing, it can solve problems such as data delay and analysis system lag, and achieve the effect of real-time reflection of business changes
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0023] Such as figure 1 As shown, the multi-dimensional processing method of this massive data, the method includes the following steps:
[0024] (1) Real-time data reception: access to raw data through Kafka;
[0025] Kafka is a high-throughput distributed publish-subscribe messaging system that can handle all action streaming data in consumer-scale websites. Such actions (web browsing, searching and other user actions) are a key factor in many social functions on the modern web. These data are usually addressed by processing logs and log aggregation due to throughput requirements. This is a viable solution for systems like Hadoop that log data and analyze offline, but require real-time processing constraints. The purpose of Kafka is to unify online and offline message processing through Hadoop's parallel loading mechanism, and to provide real-time consumption through cluster machines.
[0026] (2) Real-time data processing: the original data is processed in real time thr...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

