Data file processing method and device
A technology of data files and processing methods, applied in the field of data processing, can solve the problems of large time consumption of data files, lag in timeliness, difficult processing, etc., and achieve the effect of improving the efficiency of data processing
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach
[0088] For example, if the number of nodes in the HDFS cluster is 10, the original data file can be split into 10 sub-files, and according to the remaining storage space of each node and the value range distribution of specific key values, determine the value of each split sub-file The size and the upper and lower limits of the value domain distribution of each sub-file. For example: the primary key IDs of 10,000 records in the bank transaction flow table are distributed between 1,000 and 9,999, and there are 9,000 records from 1,000 to 3,000, then these 9,000 records can be split into 9 sub-files, and the IDs of 3,000 to 9,000 The data is a subfile. Wherein, the number of sub-files to be split, the size of each sub-file to be split, and the strategy of storing sub-files on nodes with appropriate sizes according to their sizes can be referred to as storage strategies. The strategy of how to split the original data file is called a split strategy.
[0089] It should be noted ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com