Real-time repetition removal and transmission method for data in network file system

A network file system and file technology, applied in transmission systems, electrical digital data processing, special data processing applications, etc., can solve the problem of low dicing efficiency, data dicing and deduplication are not performed in real time, and no data deduplication technology is used. and other problems to achieve the effect of saving storage space and improving reliability

Inactive Publication Date: 2010-12-15
TSINGHUA UNIV
View PDF3 Cites 44 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in its method, only after the file is closed, the whole file is divided into blocks and synchronized to the server side. At least the following significant disadvantages exist in its method: 1. Data block and deduplication are not carried out in real time. The new data at the end cannot be transmitted to the server in time; second, the cutting efficiency is low, even if only a small part of the content in the file is modified, the entire file needs to be cut again; third, when opening a new file, you need to first All the data blocks of the file are spliced ​​into a temporary file before the file can be operated on, so it takes a long time to wait
[0005] Other existing network systems, such as Network File System (NFS) and Server Message Block (SMB), do not use data deduplication technology, nor can they provide functions to reduce storage space occupation and network data transmission

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time repetition removal and transmission method for data in network file system
  • Real-time repetition removal and transmission method for data in network file system
  • Real-time repetition removal and transmission method for data in network file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] Below in conjunction with accompanying drawing, introduce in detail the real-time deduplication and transmission method of the data in the network file system that the present invention proposes:

[0063] (1) File metadata table, data block index table and file composition table are respectively set at the client end and the server end of network file system; Described file metadata table records the metadata of each file in network file system, and this metadata The data includes file identification, file name, identification of the folder where the file is located, file size, file type, access authority and creation, modification and access time of the file; the data block index table records the data block of the file in the network file system The identification and the number of references to the data block corresponding to the identification, wherein the identification of the data block is the hash value of the content of the data block; the file composition table ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a real-time repetition removal and transmission method for data in a network file system, and belongs to the technical field of computer data storage. The method comprises the following steps of: setting a file metadata table, a data block index table and a file composition table in a client and a server respectively, and setting a to-be-transmitted message queue for storing data and updating message in the client; receiving and responding an operating command initiated to the network file system by a client application program through a file system drive by the client, wherein the operating command comprises the operation of creating a new file, writing data into an existing file, reading the data from the existing file and deleting the existing file; and settinga network service interface for uploading and downloading data block contents and receiving and answering client message in the server. The method can delete the repeated data so as to save the storage space, avoid transmitting the existing data of the opposite side between the client and the server and reduce the overhead of network bandwidth; meanwhile, the method supports a file blocking method of fixed length and unfixed length so as to improve the utilization rate of the storage space.

Description

technical field [0001] The invention relates to a real-time data deduplication and transmission method in a network file system, belonging to the technical field of computer data storage. Background technique [0002] For a long time, how to store as much data as possible with as little space as possible and how to transmit as much information as possible with as little network bandwidth overhead as possible has been the core issue in the field of network storage. Reducing the storage and transmission of duplicate data in files is the key technology to solve the above problems. [0003] Data deduplication technology, also known as data deduplication technology, emerged at the beginning of this century and has been popularized in recent years. The idea of ​​data deduplication is not complicated: first discover the same data units that appear in different files, different versions of the same file, or even in different locations of the same version, and then index the data un...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/06H04L29/08G06F17/30
Inventor 唐力汪东升
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products