File merging method and device

A file and trigger condition technology, applied in the field of data communication, can solve the problems of low database read performance and large number of files, etc.

Active Publication Date: 2014-02-19
HUAWEI TECH CO LTD
View PDF4 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the prior art, even through the method of file merging, the number of files stored on the node is still relatively large, and each data read needs to filter data from multiple files, and the read performance of the entire database is still not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File merging method and device
  • File merging method and device
  • File merging method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0087] Before introducing the various embodiments of the present invention, first briefly introduce a scenario to which the various embodiments of the present invention are applicable. The various embodiments of the present invention are mainly applicable to non-relational databases, and non-relational databases can be implemented through consisten...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a file merging method and device. The file merging method comprises the steps of carrying out partitioning on sections in the charge of nodes in a cluster system according to key value information of user data, determining that each node meets a first triggering condition, reading at least two first files from disks of the nodes to caches corresponding to the nodes, respectively determining the target partition which each key value belongs to according to the key values of the data, corresponding to users, stored in the first files, merging the data with the same key value, and storing the merged data corresponding to the key values into the corresponding target partitions. Partitioning is carried out on the sections in the nodes, the data are merged once, and the data with the same key value are stored in the same partition. In the inquiry process, the partitions where the data are located are determined according to the key values, and then the files in the partitions are scanned. As the number of the files in each partition is small, data screening only needs to be carried out on fewer files, and therefore the read performance is improved.

Description

technical field [0001] The embodiment of the present invention relates to data communication technology, in particular to a method and device for merging files. Background technique [0002] With the continuous development of the Internet, the scale of Internet applications continues to grow, and the database storage that these applications rely on is facing increasing challenges. Traditional relational data has been difficult to meet the storage requirements of massive data, non-relational database NoSql applications, such as: BigTable developed by Google, Facebook's Cassandra are all non-relational databases. Usually, a non-relational database is a distributed system, and the data it stores is distributed on each node. At present, most non-relational databases are implemented through consistent hashing (hash). The so-called consistent hash refers to the hash function All hash values ​​form an end-to-end ring (the maximum value is connected to the minimum value), and each ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/17G06F16/183
Inventor 包海龙
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products