A file processing method and device
A file processing and file technology, applied in file systems, file access structures, electronic digital data processing, etc., can solve the problems of long time consumption, high memory access pressure on Namenode nodes, and low file reading efficiency, so as to relieve work pressure, The effect of improving processing speed and access efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0048] see figure 1, which shows the first flow chart of the first embodiment of the file processing method of the present invention, the flow may include:
[0049] Step 100: Get at least two files.
[0050] Here, the Namenode node of the Hadoop cluster can be used to obtain the file. For example, the Namenode node of the Hadoop cluster obtains the file through ftp downloading. At this time, the obtained file is stored in the local storage directory of the Hadoop cluster.
[0051] In this step, the source of the obtained file is not limited, for example, the file may be obtained from an interface machine.
[0052] Step 101: Merge the files meeting the file merging conditions to obtain a merged file.
[0053] Specifically, the acquired files are classified; among the files of each category, the files whose capacity is smaller than the capacity threshold are marked as the files to be merged in the corresponding category; the sum of the capacities of the files to be merged in a...
no. 2 example
[0091] see Figure 4 , which shows a schematic diagram of the composition and structure of the file processing device in the embodiment of the present invention, the device includes: an acquisition module 200, a derivation module 201 and a storage module 202;
[0092] Obtaining module 200, configured to obtain at least two files.
[0093] The deriving module 201 is used for merging the files satisfying the file merging condition to obtain the merged file.
[0094] The storage module 202 is configured to store the merged file in the form of BloomMapFile.
[0095] Specifically, the deriving module 201 is configured to classify the acquired files; among the files of each category, mark the files whose capacity is smaller than the capacity threshold as the files to be merged in the corresponding category; When the sum of the capacity of the files to be merged reaches the capacity threshold, use the BloomFilter to merge the files to be merged in the corresponding category to obta...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


