Image based repeated data deletion method and apparatus

A technology of data deduplication and image, which is applied in the field of data processing, can solve problems such as inability to reduce data and obtain deduplication rate, and achieve the effect of reducing storage capacity, saving storage space, and increasing reduction ratio

Inactive Publication Date: 2016-05-04
HUAWEI TECH CO LTD
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, data such as images are compressed and encoded, and it is difficult to obtain a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image based repeated data deletion method and apparatus
  • Image based repeated data deletion method and apparatus
  • Image based repeated data deletion method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

[0028] figure 1 A schematic flowchart of an image-based data deduplication method provided by an embodiment of the present invention, as shown in figure 1 As shown, the method includes step S101-step S104:

[0029] Step S101, obtaining the pixel matrix of the image to be stored;

[0030] Step S102, according to the pixel matrix, segment the image to be stored to obtain a plurality of image blocks, obtain the weak block fingerprints of the image blocks, and obtain the strong block fingerprints of the image blocks;

[0031] It should be noted that, in the method provided in this embodiment, obtaining multiple image blocks requires two methods of horizontal sliding segmentation and vertical sliding segmentation to realize the image block of the image to be stored. For the specific implementation of horizontal segmentati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an image based repeated data deletion method and apparatus. The method provided by an embodiment of the invention comprises the steps of obtaining a pixel matrix of a to-be-stored image; according to the pixel matrix, segmenting the to-be-stored image to obtain image blocks, and obtaining weak block fingerprints of the image blocks; judging whether weak reference fingerprints as same as the weak block fingerprints exist in a fingerprint library or not, and when the weak reference fingerprints as same as the weak block fingerprints exist in the fingerprint library, obtaining the weak reference fingerprints; according to the weak reference fingerprints, obtaining first reference image blocks from an image library; according to the first reference image blocks, compressing the image blocks to obtain compressed image blocks, and storing the compressed image blocks; and when the weak reference fingerprints as same as the weak block fingerprints do not exist in the fingerprint library, storing the weak block fingerprints as new weak reference fingerprints in the fingerprint library, and storing the image blocks in the image library. According to the image based repeated data deletion method and apparatus provided by the invention, the deletion rate of repeated data of the image is increased, the reduction ratio of the image is increased, and the storage, transmission and processing speeds of the image are increased.

Description

technical field [0001] The invention relates to the field of data processing, in particular to an image-based deduplication method and device. Background technique [0002] Data deduplication technology is a data reduction technology applied to storage systems, aiming to reduce the storage capacity used in storage systems. By looking for duplicate variable-sized data blocks at different locations in different files. Only one of the duplicate data blocks is kept, and the others are replaced with indicators, thereby eliminating redundant data and reducing stored data. Highly redundant data sets (such as backup data) benefit greatly from data deduplication technology, and users can achieve a reduction ratio of 10:1 to 50:1. [0003] However, data such as images are compressed and encoded, and it is difficult to obtain a deduplication rate simply by using existing data deduplication technology, and cannot be reduced. Contents of the invention [0004] In a first aspect, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06T7/00
CPCG06F16/162G06F16/51
Inventor 钟延辉曾凯
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products