Batch processing data fragmentation method and system, computer equipment and storage medium
A data sharding and batch processing technology, applied in computing, electrical digital data processing, digital data information retrieval, etc., can solve problems such as unfavorable batch task execution efficiency and stand-alone deployment, and reduce development access costs and development difficulties. , the effect of improving execution efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0058] An embodiment of the present invention, such as figure 1 and image 3 As shown, the present invention provides a batch data fragmentation method, comprising steps:
[0059] S1. When the client starts, register the service with Zookeeper.
[0060] Preferably, when the client starts, the registration service is to Zookeeper, specifically including:
[0061] Initialize the Zookeeper connection; establish a connection with Zookeeper through the open source framework (Curator framework); monitor the real-time status of the custom persistent node in Zookeeper, so as to receive the broadcast sent by Zookeeper to the newly registered message in time.
[0062] Specifically, the open source framework encapsulates a whole set of underlying APIs connected to Zookeeper, which can greatly simplify the connection process and ensure the stability of the link, and internally encapsulates the heartbeat detection mechanism.
[0063] S2. Receive the broadcast sent by Zookeeper for the n...
Embodiment 2
[0079] An embodiment of the present invention, such as figure 2 and image 3 As shown, the present invention also provides another batch data fragmentation method, comprising steps:
[0080] S10. Receive the registration service sent by the SDK when the client starts.
[0081] S11. Update the nodes in the node list according to the registration service.
[0082] S12. After the node is created, broadcast a newly registered message.
[0083] S13. Accept the node list query request sent by the SDK after receiving the broadcast, so that the SDK calculates and obtains the fragmentation coefficient corresponding to the client according to the current ranking position of the node corresponding to the client in the node list and the total number of registrations.
[0084] Zookeeper can update the nodes in the node list by receiving the registration service sent by the SDK when the client starts, and broadcast the newly registered message after the node is created. After receiving ...
Embodiment 3
[0091] An embodiment of the present invention, such as Figure 7 As shown, the present invention also provides a batch processing data sharding system, including a registration module 1 , a receiving module 2 , a query module 3 , a calculation module 4 and a sending module 5 .
[0092] The registration module 1 is used to register the service to Zookeeper when the client starts; the receiving module 2 is used to receive the broadcast sent by Zookeeper to the newly registered message; the query module 3 is used to query the updated node list of Zookeeper and obtain the corresponding The current ranking position of the node in the node list, and the total number of registrations; the calculation module 4 is used to calculate and obtain the fragmentation coefficient corresponding to the client according to the current ranking position and the total registration number; the sending module 5 is used to send the fragmentation coefficient to The client enables the client to query the...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


