Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

File random writing method and system suitable for distributed file system

A technology of distributed files and random writing, which is applied in special data processing applications, instruments, electrical digital data processing, etc., and can solve problems such as random writing methods of files without a distributed file system

Active Publication Date: 2020-02-14
EAST CHINA INST OF COMPUTING TECH
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This patent does not have a file random writing method suitable for distributed file systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File random writing method and system suitable for distributed file system
  • File random writing method and system suitable for distributed file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several changes and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0022] According to a kind of file random write method that is suitable for distributed file system provided by the present invention, comprises: Step 1: the write interface of HDFS is updated to support random write mode, makes random write operation only affect directly modified block or limited related Neighboring blocks; Step 2: Extend the HDFS data transmission protocol so that the client can access any data block; Step 3: Update server-side data packet processing, update check value calculation, ob...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a file random writing method and system suitable for a distributed file system. The method comprises the following steps: 1, updating a writing interface of an HDFS to support arandom writing mode, and enabling a random writing operation to only affect a directly modified block or a limited adjacent block; 2, an HDFS data transmission protocol is expanded, so that a clientcan access any data block; and step 3, processing an update server data packet, calculating an update check value, obtaining data copy update, and updating the check value. In the present invention, when the data content of the file stored in the distributed file system is updated; according to the method and the system, only the current data block and the adjacent data block are influenced, a large amount of unnecessary data transmission caused by writing back the whole file like a native HDFS (Hadoop Distributed File System) is avoided, the data transmission and movement are effectively reduced, the network bandwidth pressure is relieved, and the processing resource consumption of a client and a server is reduced.

Description

technical field [0001] The invention relates to the field of distributed storage, in particular to a file random writing method and system applicable to a distributed file system. Background technique [0002] Distributed file storage systems are influenced by traditional disk file systems, most of which use fixed-size data blocks to organize and manage files. The currently popular distributed file systems HDFS (Hadoop Distributed File System) and GFS (Google File System) both divide large files into fixed-size blocks for storage, usually 64MB. After these files are created, most of them are appended to the end of the file, and almost no random write operations are involved. This fixed-length block design is not suitable for random writing, because the writing overhead is high and the performance is poor. But usually about 25% of the user's file operations are random writes. Aiming at the above status quo, a HDFS-based file random writing method is proposed, which can wri...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/172G06F16/182
CPCG06F16/172G06F16/182
Inventor 沈晨杜真真王敬平黄子君徐文远周洁褚少鹤
Owner EAST CHINA INST OF COMPUTING TECH
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More