Unlock instant, AI-driven research and patent intelligence for your innovation.

A data lake file system based on object storage

A file system and object storage technology, applied in file system, file system management, file metadata retrieval, etc., can solve problems such as low data utilization efficiency, lack of data change management, high cost of data change, etc., to improve efficiency and experience , Reduce construction cost, reduce construction period and cost effect

Active Publication Date: 2022-06-07
天津安锐捷技术有限公司
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The main manifestations are: 1. There are various data sources, inconsistent processing methods, and low processing efficiency; 2. The data in the data warehouse is basically fixed, and the cost of data changes is huge; 3. Data analysis and scientific calculations brought about by data changes Subsequent data applications also need to be changed, and the cost is huge; 4. The management of data changes is missing, resulting in the lack of data governance, data blood relationship and other information, and low data utilization efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data lake file system based on object storage
  • A data lake file system based on object storage
  • A data lake file system based on object storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] It should be noted that the embodiments of the present invention and the features of the embodiments may be combined with each other under the condition of no conflict.

[0033] The present invention will be described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.

[0034] like Figure 1-Figure 5 As shown, the present invention is a data lake file system based on object storage, including: a local file storage component, a file management component and a local metadata storage component, wherein the file management component includes an operation transaction management component and a file version management component component; the local file storage component utilizes FUSE API to construct an abstraction layer of local file storage, unified file operation and hierarchical storage, and is used to improve the response speed; the file management component combines the file version management component and the operatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a data lake file system based on object storage, including a local file storage component, a file management component and a local metadata storage component, wherein the file management component includes an operation transaction management component and a file version management component; the local file storage The component is controlled by the file management component. The local file storage component is responsible for saving the business data storage object file locally, and calls the local metadata storage component to save the metadata corresponding to the business data target object; the operation transaction management component is used to control the entire local file storage The life cycle of component transactions, linking file version management components during transaction commit and rollback operations. The invention enables the application side of the component to achieve the effect of caching without being aware of the underlying file system principle, so that the user does not need to care about the details of data management, and only pays attention to the upper user interface to improve the effect and accuracy of data management, reduces the difficulty of data application, and improves the efficiency of data management. Data application flexibility.

Description

technical field [0001] The invention belongs to the field of data lake file systems, in particular to a data lake file system based on object storage. Background technique [0002] The storage of the existing data lake is generally HDFS or OSS object storage or other file storage methods (such as network storage, etc.). The existing method is mainly to solve the storage of structured, semi-structured and unstructured files. It was originally invented. This file system is for distributed computing (HDFS), easy access and use (object storage OSS, S3, CEPH, etc.), a large number of file storage (network storage GlusterFS, etc.), and lack of application scenarios that consider data lakes. [0003] In recent years, due to the storage and management of raw data, data lakes are easier and more flexible to use in subsequent big data processing and AI data applications than data warehouses and data marts, so they have developed rapidly in the market. However, there is no file system...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/11G06F16/14G06F16/172
CPCY02D10/00
Inventor 高旭麟孙社宾孙涛刘珊
Owner 天津安锐捷技术有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More