Two-stage storage method combined with RDBMS (relational database management system) and Hadoop cloud storage

A cloud storage and cloud technology, applied in instruments, computing, electrical and digital data processing, etc., can solve the problems of system efficiency decline, inability to provide secure and efficient storage management, and high real-time requirements for data reading and querying. The effect of efficient storage

Inactive Publication Date: 2012-08-22
WUHAN UNIV
View PDF4 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] (4) Data is often "stored once and applied multiple times", and the real-time requirements for data reading and query are relatively high
Relational database (RDBMS) is one of the most effective means of managing structured data, but when the amount of data increases, the efficiency of the system will seriously drop, and it cannot provide safe and efficient storage management

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Two-stage storage method combined with RDBMS (relational database management system) and Hadoop cloud storage
  • Two-stage storage method combined with RDBMS (relational database management system) and Hadoop cloud storage
  • Two-stage storage method combined with RDBMS (relational database management system) and Hadoop cloud storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The basic technical idea of ​​the present invention is: to comprehensively utilize the advantages of RDBMS and Hadoop to solve the problem——to store by combining the front-end database and the back-end cloud storage. On the one hand, the data in small files with structured features is stored in the database (front-end); on the other hand, when the capacity of the database reaches a certain amount, the entire database file is written into the Hadoop cloud (back-end) storage and cleared at the same time. The database continues to accept new data writes. Therefore, Hadoop stores large files collected and merged by RDBMS, while RDBMS manages a lightweight database. The advantages of both are utilized while their respective disadvantages can be avoided.

[0034] The technical solution of the present invention will be described in detail below in conjunction with the drawings and embodiments.

[0035]The embodiment adopts the RDBMS system at the front end, and the Hadoop clo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a two-stage storage method combined with an RDBMS (relational database management system) and Hadoop cloud storage. The two-stage storage method is characterized in that the front end adopts the RDBMS, and the back end adopts a Hadoop cloud end; the front end is used for collecting small files of structured data and storing in a database of the RDBMS; and when the capacity of the database achieves a certain quantity, the whole database can be used as a large file to be written into the Hadoop cloud end for storage, the database is simultaneously emptied, and the database can continuously accept the writing of the new data. According to the two-stage storage method provided by the invention, the lightweight relational database is utilized for meeting high-efficient writing and large-scale concurrent access requirements of the structured data, and the RDBMS can further automatically merge the mass small files into the large file so as to avoid the inconvenience caused by developing a tool for merging the small files; and the Hadoop cloud storage is utilized for realizing high-efficient storage of the mass files of the large database, so that the defects of Hadoop can be avoided, and the advantages of the cloud storage can be further fully utilized for realizing high-efficient storage and management. As long as the data meets the structuring requirements, the technical scheme provided by the invention can be used for high-efficient storage, and the two-stage storage method has certain universality.

Description

technical field [0001] The invention relates to the field of efficient storage and management of massive small files, in particular to an efficient storage method for massive structured data combined with RDBMS and Hadoop. Background technique [0002] Since the beginning of the 21st century, due to the explosive growth of information, there have been more and more application services for massive structured data. Typical applications include Weibo, instant messaging tool information, mobile phone text messages, and complex system logs. Such services often have the following characteristics: [0003] (1) The data has structured features, and its structure is relatively simple; [0004] (2) The data generation speed is very fast, generating tens of thousands or even hundreds of thousands of records per second; if stored in the form of traditional files, a large number of small files will be formed. [0005] (3) The data stock is very large, with more than tens of millions, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 徐正全刘小俊潘少明
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products