File merging method and device based on cassandra database
A data file and database technology, applied in the database field, can solve the problems of heavy disk read and write burden, heavy combined IO pressure, large disk space occupation, etc., and achieve the effect of optimizing file storage structure and IO efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] In the cassandra database, when the client writes data, the client program determines the server node to which the data should be sent according to the token range on the cluster, and the server accepts data in parallel with multiple threads, and each thread will do the data received by itself. Sort and generate data files smaller than 10M. In some implementation scenarios, when the amount of data processed by a single machine of the Cassandra database reaches 4TB / day, the number of data files that the database process needs to open will exceed 200,000. When data storage is required for 7 days, the number of data files that the process needs to open will reach 1.4 million. The data files need to be merged to reduce the number of files. Therefore, this embodiment provides a new dynamic small file merging method, which avoids the defects existing in the existing file merging strategy.
[0035]Cassandra is a NoSQL distributed database that adopts the Log Structured Merge ...
Embodiment 2
[0076] On the basis of the method for merging files based on the cassandra database provided in the above-mentioned embodiment 1, the present invention also provides a device for merging files based on the cassandra database that can be used to implement the above method, such as Figure 5 Shown is a schematic diagram of the device architecture of the embodiment of the present invention. The apparatus for merging files based on a cassandra database in this embodiment includes one or more processors 21 and a memory 22 . in, Figure 5 A processor 21 is taken as an example.
[0077] Processor 21 and memory 22 can be connected by bus or other means, Figure 5 Take connection via bus as an example.
[0078] Memory 22, as a non-volatile computer-readable storage medium based on the cassandra database-based file merging method, can be used to store non-volatile software programs, non-volatile computer-executable programs and modules, such as the one based on File merging method f...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com