Processing method for reading Kafka data based on Spark Streaming
A processing method and data technology, applied in the direction of electrical digital data processing, special data processing applications, relational databases, etc., can solve problems such as data loss, cache data loss, cache impossible recovery, etc., to achieve data loss guarantee and prevent data loss Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.
[0022] The processing method for reading Kafka data based on Spark Streaming provided by the present invention uses a relational database to create two database tables, which are respectively a scheduling table (control) and a failure record number table (fai lure). The scheduling table stores scheduling information, including scheduling number id, start time, end time, status, creation time and other information. The failure record table stores specific failure data record details, including failure record id, offset, topic (topic), Kafka node list and other information. Among them, the scheduling number id in the scheduling table is the primary foreign key relationship with the id of the failure record table.
[0023] In the process of connecting SparkStreaming to Kafka to read and process data, firstly, the createDirectStream method of SparkStreamin...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


