Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data processing method, client, node server and distributed file system

A node server and data processing technology, applied in the field of data processing, can solve the problems of destroying the interface semantics of the HDFS system, writing other operations sequentially, the impact of large file processing, and affecting the original intention of the HDFS system design.

Inactive Publication Date: 2018-03-27
CHINA MOBILE COMM GRP CO LTD +1
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, the inventor has found through in-depth research that the following problems still exist in the prior art 1: in the prior art 1, an implementation method of letting the DataNode node cache some small file metadata information is adopted. Although this method can at least partially solve the small file writing problem, the The confusion of mechanism and strategy has severely damaged the semantics of the HDFS system, which will inevitably have a huge impact on other operations such as sequential writing and large file processing, and will inevitably seriously affect the original intention of the HDFS system design
[0011] The disadvantage of prior art 3 is that the file threshold used is 16M, and the file granularity is too large, which will inevitably lead to a large number of file mergers during the processing, resulting in a relatively high I / O write operation time delay in some write operations, which is not suitable for High I / O performance data access requirements; moreover, this method still adopts sequential write operations, and cannot realize random write operations; finally, this solution adds machines outside the HDFS system, which has actually destroyed the interface semantics of the HDFS system. The interface relationship between the application system and the HDFS system has been changed, and there is no universality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method, client, node server and distributed file system
  • Data processing method, client, node server and distributed file system
  • Data processing method, client, node server and distributed file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention.

[0025] The HDFS system is implemented based on the principle of Google GFS, and uses the Share-Nothing architecture to realize the parallel processing of large-scale data. The specific system structure and principle can be found in the existing technology, and will not be repeated here. The main disadvantages and problems of this system are as follows:

[0026] 1) It is impossible to efficiently store a large number of small files; the existing HDFS system is mainly aimed at very large files, but cannot efficiently store a large number of small file data.

[0027] 2) It does not support multi-user writing and arbitrarily modifying files; in the existing HDFS system, there is only one writer in a file, and writing operations can only be completed at the end of the file, that is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention disclose a data processing method. The method comprises the following steps of: receiving a predetermined data writing request sent by a client, wherein the predetermineddata writing request is a data writing request which satisfies a first predetermined condition in more than one data writing request received by the client; and executing a corresponding predetermineddata writing operation according to the predetermined data writing request and string corresponding written data. The embodiments of the invention furthermore disclose the client, a node server and adistributed file system.

Description

technical field [0001] The invention relates to data processing technology, in particular to a data processing method, a client, a node server and a distributed file system. Background technique [0002] With the rapid development of information technology, massive amounts of information are used to store data and the investment is increasing. There is an urgent need for new storage solutions to change the status quo, save storage costs, and reduce storage investment. Cloud storage has emerged as the times require. Hadoop is an open source project of Apache, the purpose is to establish a stable and scalable distributed computing architecture running on the basis of cheap hardware devices. Among them, the Hadoop file system (HDFS, Hadoop Distributed File System) is one of the Hadoop sub-projects. As an open source implementation of Google's distributed file system (GFS, Google File System), HDFS is a cloud storage solution for major institutions and companies. The scheme pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F3/06
CPCG06F3/0604G06F3/0611G06F3/0629G06F3/0643G06F3/0655G06F3/067
Inventor 霍绍博吴希选马锦素付长冬吴庆华张龙
Owner CHINA MOBILE COMM GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products