spark-streaming intermediate data partition method, device, computer equipment and storage medium
A technology for intermediate data and data partitioning, applied in the field of data processing, which can solve problems such as extended job execution time, unbalanced reduce task load, and low job execution efficiency, to achieve uniform partitioning, improve job execution efficiency, and reduce time and space overhead Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0051] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.
[0052] The method provided by this application can be applied as figure 1shown in the application environment. For a batch job, the map task reads the data and processes them in parallel on the nodes, and then outputs intermediate data in the form of key / value pairs, which are partitioned by the Range partitioner, such as figure 1 Each map data shown is divided into 3 parts. Then each reduce task will obtain the intermediate data of each map task for processing, and finally output the result. The processing flow of the Range partitioner includes sampling, Key cluster up...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com