Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data processing method and system in duplicate data deletion process

A data deduplication and data processing technology, applied in the field of data processing, can solve problems such as low efficiency and poor duplicate data effect, and achieve the effects of ensuring integrity, deduplication rate, validity and integrity

Active Publication Date: 2019-03-01
广州鼎甲计算机科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Based on this, it is necessary to provide a data processing method and system, computer equipment, and computer storage medium in the process of deduplication in view of the technical problem that the traditional file data block method has poor effect or low efficiency in deduplication.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and system in duplicate data deletion process
  • Data processing method and system in duplicate data deletion process
  • Data processing method and system in duplicate data deletion process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, and do not limit the protection scope of the present invention.

[0042] It should be noted that the term "first\second\third" involved in this embodiment of the present invention is only to distinguish similar objects, and does not represent a specific ordering of objects. Understandably, "first\second\third Three" are interchangeable in a specific order or sequence where permissible. It should be understood that the terms "first\second\third" are interchangeable under appropriate circumstances such that the embodiments of the invention described herein can be practiced in sequences other than those illustrated or described herein.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a data processing method and a system, a computer device and a computer storage medium in a repetitive data deletion process. The method comprises the following steps: extracting file data of a first data amount from the backup data, detecting whether the file data conforms to a preset partitioning condition; if the file data does not conform to the partitioning condition,extracting the file data of the second data amount from the backup data, superimposing the extracted file data on the file data extracted before the extracting, and obtaining the superimposed data; if the superimposed data does not conform to the partitioning condition and the data amount of the superimposed data is smaller than the third data amount, returning the file data extracting the seconddata amount from the backup data, superimposing the extracted file data on the file data extracted before the second data amount, and obtaining the superimposed data; if the superimposed data meets apreset block partitioning condition or the data amount of the superimposed data is greater than or equal to the third data amount, the current superimposed data is determined as a re-deleted data block.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a data processing method and system, computer equipment, and computer storage medium in the process of data deduplication. Background technique [0002] De-duplication is a data reduction technology designed to reduce the storage capacity used in a storage system. It eliminates redundant data by deleting duplicate data in the storage system and retaining only one copy. The space-saving efficiency of deduplication technology can be characterized by the deduplication rate, which can be determined according to the ratio between the size of the saved space and the size of the original data. [0003] Data deduplication technology can be divided into file level and data block level according to granularity. Data block-level deduplication divides a file into data blocks in different ways, and detects data blocks as units; data block methods in data block-level deduplic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/174G06F3/06G06F11/14
CPCG06F3/0608G06F3/0641G06F3/067G06F11/1453G06F11/1464
Inventor 王贤达马立珂王子骏
Owner 广州鼎甲计算机科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products