File data management method based on relational database and K-D tree indexes

A technology for file data and management methods, applied in relational databases, database models, electronic digital data processing, etc., can solve problems such as file data loss, strong coupling between file systems and application systems, and difficulty in accessing, so as to reduce the occupied space , Improve retrieval efficiency, high retrieval efficiency effect

Active Publication Date: 2014-09-24
ZHEJIANG UNIV
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Each business system that needs to be integrated has left a large amount of documents, and its management will encounter the following problems: (1) The amount of document data is large. Taking a district-level unit as an example, all the documents involved The total size has exceeded 5T, and the data volume is increasing by 2T per year
(2) There is no backup mechanism for files, and any security incident will result in the loss of files
(3) Documents are stored in the form of "file name + file path", which lacks an effective query mechanism, making it difficult to consult
(4) The storage efficiency of files and data is low. Some business systems store image files directly in the database, and the reading and writing of files must go through the SQL engine, so the storage efficiency is low
The business attributes of the file are entrusted to the upper application database management, resulting in a strong coupling between the file system and the application system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File data management method based on relational database and K-D tree indexes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0031] (1) Build a distributed storage environment. The experimental environment is a single file metadata management server with 2T hard disk and a file cluster composed of 4 file servers. The operating system is ubantu13.10, and the network transmission rate is 10m / s. Generate non-repeating file names and randomly select their administrative division attributes, and upload files 1000 times at the same time.

[0032] (2) Taking the business background as an example, research the field documents that need to be retrieved for documents and materials, build a relational database on the metadata management server side, and design the table structure of the file metadata database. The principle of designing fields is that each field is related to the business Requirements are related, where the file storage path, whether to delete, and upload time are required fields.

[0033] primary key ID Owned business operationID upload time uploadDate business completion tim...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a file data management method based on a relational database and K-D tree indexes. The file data management method comprises the following steps that distribution storage is conducted on files with a consistent Hash algorithm, MD5 values of the files are regarded as Hash values of the files, a mapping relation between the Hash values and servers in a cluster is established, and thus the files with the different Hash values are distributed to the different servers; the relational database is established at a meta data management server side, and the structure of a file meta database table is designed; a multi-dimensional retrieval tree is established according to the number of fields of the file meta database table; corresponding inquiring is conducted according to types of inquiring requests received by the server side, and inquiring results are fed back. According to the file data management method based on the relational database and the K-D tree indexes, the relational database and the file indexes in an internal storage device are used, the usability of fuzzy retrieval is ensured, the high efficiency of range retrieval is also ensured, and the file data management method has important practical application value in the field of massive file data management.

Description

technical field [0001] The invention relates to a management method for massive file data, in particular to a file data management method based on a relational database and a K-D tree index. Background technique [0002] There are massive business data in large-scale enterprise applications that need to be managed. Among these document data, document data (including scanned documents, policies and regulations, etc.) account for the vast majority. How to design a reasonable and efficient document storage and management mechanism according to business characteristics is a very meaningful problem. [0003] Each business system that needs to be integrated has left a large amount of documents, and its management will encounter the following problems: (1) The amount of document data is large. Taking a district-level unit as an example, all the documents involved The total size has exceeded 5T, and the data volume is increasing by 2T per year. (2) There is no backup mechanism for...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/134G06F16/137G06F16/2246G06F16/2264G06F16/284
Inventor 杜震洪张丰刘仁义郑少楠郭绿奕
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products