Dynamic updating method and system for data stream distribution process configuration
A technology of dynamic update and data flow, applied in the field of data processing, can solve the problem of low efficiency of dynamic update of configuration, and achieve the effect of improving the efficiency of dynamic update
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] First of all, it should be explained that the present invention figure 1 The illustrated embodiment is preferably applicable to scheduling processes.
[0051] figure 1 A schematic flowchart of a method for dynamically updating data stream distribution process configuration provided by an embodiment of the present invention, as shown in figure 1 As shown, the method includes:
[0052] S101: Obtain first configuration data of a target data stream distribution process at the current moment, and acquire second configuration data for the target data stream distribution process at the current moment, wherein the target data stream distribution process includes: kafkastream.
[0053] The data stream distribution process includes but is not limited to kafka stream, and can also include Storm Stream, SparkStreaming, Flink Stream, etc. when the first configuration data is updated, the data stream distribution process needs to be restarted. In the embodiment of the present inve...
Embodiment 2
[0069] In order to avoid the overload of the server caused by simultaneous dynamic updates of multiple kafka streams, Embodiment 2 of the present invention uses the following method on the basis of Embodiment 1 to generate a backup data stream distribution process corresponding to the target data stream distribution process in the cache :
[0070] The target data flow distribution processes kafka stream-1, kafka stream-2, kafka stream-3, and kafkastream-4 are all deployed on the same server A. And each kafka stream corresponds to a data processing process. At this point, kafka stream-1, kafka stream-2, kafka stream-3, and kafka stream-4 all need to perform dynamic update operations, and kafka stream-1, kafka stream-2, kafka stream-3, and kafka stream-4 first send The first zookeeper competes for the zookeeper lock. For example, if kafka stream-1 competes successfully first, then kafka stream-1 generates a backup data stream distribution process corresponding to the target dat...
Embodiment 3
[0074] In order to balance the load between servers, Embodiment 3 of the present invention uses the following method on the basis of Embodiment 1 to realize generating a backup data stream distribution process corresponding to the target data stream distribution process in the cache:
[0075]All target data flow distribution processes deployed on each server request to the second zookeeper, so that the second zookeeper generates coordination instructions according to the remaining computing power of each server, and the coordination instructions are used to allocate and generate backup data for each server The number of tasks of the flow distribution process; each server generates a backup data flow distribution process corresponding to the target data flow distribution process in the cache according to the coordination instruction sent by the second zookeeper after receiving the coordination instruction sent by each server.
[0076] Exemplarily, the target data stream distribu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com