Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Junk data recovery processing method and device, and electronic equipment

A technology for garbage data, recycling and processing

Active Publication Date: 2020-02-28
ALIBABA GRP HLDG LTD
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this writing method will bring an additional burden, that is, the garbage collection of old data
Garbage identification and recycling is more complicated when there is shared duplicate data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Junk data recovery processing method and device, and electronic equipment
  • Junk data recovery processing method and device, and electronic equipment
  • Junk data recovery processing method and device, and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0095] Such as Figure 10 As shown, it is one of the schematic flowcharts of the garbage data recovery and processing method of the embodiment of the present invention. This figure shows that the processing of garbage data in the shared state in the device includes:

[0096] S101: Acquire at least one first data file in a shared state in a device segment. In the LSBD architecture, the sharing of data files generally exists in the form of hard links. As mentioned above, common application scenarios are device snapshots or device cloning.

[0097] S102: Acquire a first index file corresponding to the first data file and a second index file corresponding to at least one second data file having a sharing relationship with the first data file. The first index file mentioned here is an index file in the same device segment as the first data file. The second index file may be an index file in the snapshot device or the clone device, and the second index file does not directly point...

Embodiment 2

[0136] Such as Figure 14 As shown, it is the fifth schematic flow diagram of the garbage data recovery and processing method of the embodiment of the present invention. In this embodiment, the garbage recovery process performed after the garbage data block is determined is emphasized. As an implementable manner, in this embodiment, the object of garbage collection may not distinguish between data files in the shared state and data files in the non-shared state, but after the garbage data blocks are determined, the garbage collection objects in the data files Data blocks are extracted, reorganized to form a new data file, and then replaced. Specifically, as Figure 15 As shown, the processing flow includes:

[0137] S301: Obtain at least one valid data block in at least one existing data file in the device segment, and use the valid data block to generate at least one new data file, where the valid data block is a data block other than the garbage data block in the data file...

Embodiment 3

[0142] Such as Figure 15 As shown, it is one of the structural diagrams of the garbage data recycling and processing device in the embodiment of the present invention, and the processing device includes:

[0143] A first obtaining module 11, configured to obtain at least one first data file in a shared state in the device segment;

[0144] The second obtaining module 12 is configured to obtain a first index file corresponding to the first data file and a second index file corresponding to at least one second data file having a sharing relationship with the first data file. Among them, this part of processing may further include:

[0145] Obtain the first file name of the first data file, and obtain the corresponding file ID according to the first file name;

[0146] Obtain the second file names of all the second data files sharing the file ID;

[0147] One or more device segments where all the second data files are located are determined according to the second file names,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention provide a junk data recovery processing method and a device, and electronic equipment. The method comprises the steps of obtaining at least one first data file in a sharedstate in a device segment; obtaining a first index file corresponding to the first data file and a second index file corresponding to at least one second data file having a sharing relationship withthe first data file; and determining a junk data block in the first data file according to the first index file and the second index file, and executing first junk recovery processing. According to the junk data recovery processing method, junk recovery in a data sharing state can be realized, and in the process of determining the junk data block, the direct and indirect data reference relationship of the shared data block is fully considered, so that the junk data block is accurately determined, and the junk recovery processing is further executed.

Description

technical field [0001] The present application relates to a garbage data recycling method, device and electronic equipment, belonging to the field of computer technology. Background technique [0002] In current storage products, basically no overwrite (overwrite) method is used when writing data, but new data is stored in a new location. The benefit of this is that the performance of writing will be better, the availability of writing will be higher, and data errors will not easily occur. However, this writing method will bring an additional burden, that is, garbage collection of old data. The identification and recycling of junk data is more complicated when there is shared duplicate data. Contents of the invention [0003] Embodiments of the present invention provide a method, device, and electronic device for recovering and processing garbage data, so as to solve garbage recovery when data files are shared. [0004] In order to achieve the above purpose, an embodime...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/13G06F16/16G06F16/176
Inventor 佘海斌
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products