Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A New Metadata Management System and a Metadata Attribute Hybrid Indexing Method

A management system and metadata technology, applied in the storage field, can solve the problems of high time and space overhead, and achieve the effect of reducing time and space overhead

Active Publication Date: 2011-12-28
天津艺点意创科技有限公司
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the problems of large time and space overheads in the existing metadata management methods in mass storage systems, provide a hybrid index method for metadata attributes, and build a new metadata management system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A New Metadata Management System and a Metadata Attribute Hybrid Indexing Method
  • A New Metadata Management System and a Metadata Attribute Hybrid Indexing Method
  • A New Metadata Management System and a Metadata Attribute Hybrid Indexing Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] Such as figure 1 As shown, a new metadata management system includes an attribute frequency division device, a high-frequency metadata index device and a low-frequency metadata index device.

[0045] The attribute frequency division device includes a metadata attribute usage frequency statistics module and a metadata attribute frequency division module. The metadata attribute usage frequency statistics module is used for the number of times the metadata attribute is used and the update time of the metadata attribute. The metadata attribute frequency division module is used to determine whether the attribute is a high-frequency or low-frequency attribute based on the information collected by the metadata attribute usage frequency statistics module, and store the value of the high-frequency metadata attribute in all metadata with the corresponding metadata identifier. Enter the high-frequency metadata set, and add the corresponding metadata identifier to the value of th...

Embodiment 2

[0052] A hybrid indexing method for metadata attributes, comprising the following steps:

[0053] 1) Divide metadata attributes into high-frequency metadata attributes and low-frequency metadata attributes, and store them in high-frequency metadata sets and low-frequency metadata sets respectively after adding metadata tags; 2) Use KD-tree and B-tree to establish high-frequency metadata sets Index; 3) Use artificial immune algorithm to index low-frequency metadata sets.

[0054] Step 1 may specifically include the following procedures:

[0055] 1.1) Define activity thresholds for metadata attributes , as the basis for classifying metadata attributes.

[0056] 1.2) Define the activity of metadata attributes , as the basis for measuring the activity of metadata attributes, use the formula calculated, where is the time when the metadata attribute was most recently accessed, is the time the metadata property was created, is the current time of the system, is the ...

Embodiment 3

[0075] Assume that a piece of metadata includes attributes A, B, and C, where the creation time of A is 200, the last access time is 500, the creation time of B is 100, the last access time is 100, and the creation time of C is It is 100, the latest access time is 550, the current system time is 600, in the cycle T Inner attribute A is accessed 300 times, attribute B is accessed 100 times, and attribute C is accessed 200 times.

[0076] Set activity threshold according to step 1.1) is 1.2, according to the calculation method given in step 1.2), the activity of attributes A, B and C are calculated as follows:

[0077] The activity of attribute A is 1-0.003+0.217-0.003=1.211;

[0078] The activity of attribute B is 1-0.01+0.161-0.002=1.149;

[0079] The activity of attribute C is 1-0.005+0.256-0.002=1.249;

[0080] According to step 1.3) attribute A and attribute C are high-frequency attributes, and attribute B is low-frequency attributes.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a mixed indexing method for metadata attributes. The method comprises the following steps of: dividing the metadata attributes into a high-frequency metadata attribute and a low-frequency metadata attribute according to the used frequency, creation time and recently-accessed time of each metadata attribute, and aiming at the properties of the high-frequency metadata attribute and the low-frequency metadata attribute, establishing indexes by using KD-tree and B-tree trees and an artificial immune algorithm. The invention also provides a structure of a novel metadata management system, and introduces functions and processes of main modules. In the mixed indexing method, aiming at the problems of large time and space expenditure, large exceptional space and the like during the management and searching of metadata, the efficiency of searching the high-frequency metadata attribute is improved, and the space expenditure for managing the low-frequency metadata attribute is reduced.

Description

technical field [0001] The invention belongs to the technical field of storage, relates to a metadata management system therein, and specifically relates to a method for establishing a metadata index. Background technique [0002] Mass storage systems need to respond to metadata access requests from a large number of users. According to statistics, about 70% of access requests are for metadata access requests. The performance of metadata management directly affects the overall performance of mass storage systems. Metadata in a mass storage system contains multiple attributes, but users generally focus on certain attributes when accessing them, so that some attributes in the same piece of metadata are frequently used, while some attributes are used less frequently, and a single method is used. Establishing an index to manage metadata cannot address the different usage frequencies of metadata attributes, and there are problems such as large time and space overheads. [0003] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 蔡涛牛德姣宋丽丽
Owner 天津艺点意创科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products