Repeated data deleting method and device

A technology of data deduplication and data block, which is applied in the field of data processing, can solve the problems of limited erasure times of flash storage media, affect the reliability of mobile intelligent terminal systems, and reduce the service life of flash, so as to reduce the fingerprint search overhead and reduce memory consumption. Requirements and impact on application performance, life extension effects

Active Publication Date: 2017-05-03
HONOR DEVICE CO LTD
View PDF5 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] First, there is data duplication in the storage system. For example, the redundancy of duplicate data generated by application installation and update is about 45%.
[0005] Second, smart terminals use flash memory (flash) as a permanent storage medium, which has poor write performance and limited number of erase operations
In addition, the erasing times of the flash storage medium are limited. If there are many duplicate data, it will cause a large number of write operations, which will reduce the service life of the flash and affect the system reliability of the mobile smart terminal.
Since the duplicate data fingerprint query operation provided by the existing technology has high requirements on computing resources and storage resources, the application of the existing data deduplication technology to the smart terminal will seriously affect the system reliability of the mobile smart terminal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Repeated data deleting method and device
  • Repeated data deleting method and device
  • Repeated data deleting method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the purpose, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. Obviously, the described embodiments are only some of the embodiments of the present invention, rather than all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052] Embodiments of the present invention provide a method and device for deduplication of data, which are applied to mobile smart terminals and improve the reliability of the smart mobile terminal system. Wherein, the method and the device are based on the same inventive concept, and since the principles of the method and the device to solve problems are similar, the implementation of the device and the method can be referred to each ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a repeated data deleting method and device, and improves system reliability of a mobile intelligent terminal. The method comprises the steps of: carrying out matching on a data fingerprint of a current data block and data fingerprints in a hotspot hash table, and if a matching result is that the data fingerprint of the current data block is consistent with one data fingerprint in the hotspot hash table, determining the current data block as repeated data; and if a matching result is that the data fingerprint of the current data block is inconsistent with the data fingerprints in the hotspot hash table, carrying out matching on the data fingerprint of the current data block and data fingerprints in a hash fingerprint table, and when a matching result is that the data fingerprint of the current data block is consistent with one data fingerprint in the hash fingerprint table, determining the current data block as the repeated data, wherein the data fingerprint of each harsh table item in the hotspot hash table is a data fingerprint of which repeated times in at least one file reach a set threshold value, and the data fingerprints of the hash fingerprint table are stored data fingerprints of all the data blocks.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method and device for deduplication of data. Background technique [0002] With the development of computers, smart mobile terminals have profoundly changed people's lives. In recent years, the computing and storage capabilities of mobile smart terminals have developed rapidly. [0003] At present, the storage system of mobile smart terminals has the following specific characteristics: [0004] First, there is data duplication in the storage system. For example, the redundancy of duplicate data generated by application installation and update is about 45%. [0005] Second, the smart terminal adopts a permanent storage medium of flash memory (flash), which has poor write operation performance and a limited number of erase operations. [0006] Due to the existence of duplicate data in the storage system, the reliability of the system is greatly affected, so it is urgen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06
CPCG06F3/0641
Inventor 毛波吴素贞王雅坤
Owner HONOR DEVICE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products