Distributed storage system data storage method, apparatus, system, and storage medium

A technology of distributed storage and system data, applied in the computer field, can solve the problems of many redundant data, many files, and the effect of over-occupancy of system memory is small, and the effect of reducing memory occupancy and improving detection probability is achieved.

Inactive Publication Date: 2019-01-18
ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, most of the duplicate files are large files such as disaster recovery backups, and the content of the files is large. The current files rarely have complete duplication between files, mainly because there are many redundant data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed storage system data storage method, apparatus, system, and storage medium
  • Distributed storage system data storage method, apparatus, system, and storage medium
  • Distributed storage system data storage method, apparatus, system, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The core of the present invention is to provide a data storage method for a distributed storage system. The method divides the file to be stored into blocks, and performs data comparison and determination of redundant data by dividing the file block. The detection probability of data realizes accurate data deduplication; another core of the present invention is to provide a distributed storage system data storage device, system and readable storage medium.

[0047] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by pe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data storage method of a distributed storage system, which comprises the following steps: dividing a file to be stored into blocks to obtain a plurality of file blocks to bestored; comparing a file block to be stored with a file block stored in advance to judge whether there is a file block matched with the content of the file block to be stored in the system. If yes, obtaining a data storage location of a file block whose content matches in the system; indexing the matching file block to be stored according to the data storage location. By dividing the file to be stored into blocks and comparing the data to determine the redundant data, the method can improve the detection probability of the partial duplicate data in the file and realize the accurate data deletion. The invention also provides a distributed storage system data storage device, a system and a readable storage medium, which have the beneficial effects.

Description

technical field [0001] The invention relates to the field of computers, in particular to a data storage method, device, system and readable storage medium of a distributed storage system. Background technique [0002] With the rapid development of data information, various data interactions such as cloud computing, big data, and the Internet of Things have led to a rapid increase in stored data, and data management for storage devices has become a more critical issue. [0003] Data stored in storage devices generally has high redundancy, that is, there are many data stored repeatedly among different files, especially various backup storage systems and various operating systems. Reducing data redundancy can effectively improve the utilization of storage space and is an important research project in storage systems. [0004] At present, to reduce data duplication redundancy, it is usually a method of directly comparing files and deleting duplicate files. At present, most of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/182G06F16/13G06F16/16
Inventor 徐晓阳赵万里
Owner ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products