Column-based storage and research method and equipment based on hard disk and internal storage

A hard disk and memory technology, applied in the field of columnar storage and query methods and equipment based on hard disk and memory, can solve the problems that the hard disk cannot be compared with the memory, and the data cannot be completely stored, so as to facilitate data compression and save memory consumption , the effect of improving efficiency

Active Publication Date: 2015-06-17
TRANSWARP INFORMATION TECH SHANGHAI
View PDF5 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in real life, the data volume of the production system often reaches the level of TB or PB, and the data cannot be completely stored in the memory.
With the development of hardware technology, the reading and writing performance of hard disks such as SSDs (Solid State Drives) has been continuously improved, and it has become a trend to use hard disks instead of memory as data cache. However, at this stage, the reading and writing of hard disks cannot Hard disk storage, especially designing efficient columnar storage is a meaningful and challenging problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Column-based storage and research method and equipment based on hard disk and internal storage
  • Column-based storage and research method and equipment based on hard disk and internal storage
  • Column-based storage and research method and equipment based on hard disk and internal storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] In a typical configuration of the present application, the terminal, the device serving the network and the trusted party all include one or more processors (CPUs), input / output interfaces, network interfaces and memory.

[0074] Memory may include non-permanent storage in computer-readable media, in the form of random access memory (RAM) and / or nonvolatile memory, such as read-only memory (ROM) or flash memory (flashRAM). Memory is an example of computer readable media.

[0075] Computer-readable media, including both permanent and non-permanent, removable and non-removable media, can be implemented by any method or technology for storage of information. Information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a column-based storage and research method and equipment based on a hard disk and the internal storage. A data sheet structure on data source is created in the internal storage through creating metainformation of the data sheet corresponding to the data source, and a column-based data block is generated on the current data according to the metainformation and is stored into the hard disk; the internal storage can be used more effectively, the following query data performance in the hard disk reaching the similar performance with the query data in the internal storage is realized, and the strong data analysis ability based on high-speed query efficiency can be further supported in following; moreover, when the column is an index column, the following data query efficiency can be improved by establishing a reverse index for each index column and storing the index column to the document at corresponding position in a solid state disk with the adoption of RadixTree structure.

Description

technical field [0001] The present application relates to the fields of communications and computers, and in particular to a hard disk and memory-based columnar storage and query method and device. Background technique [0002] With the rapid development of traditional enterprise business, the processing requirements of big data has become an inevitable problem in all industries. Traditional databases are row-based storage, which stores complete data rows one by one in the file system. Row storage is suitable for scenarios where most of the data columns are used in queries, such as OLTP (On-Line Transaction Processing, online transaction processing system) Inquire. But for OLAP (On-Line Analytical Processing, online analytical processing), users only need to query a few data columns, and using row storage will load many useless data columns, resulting in performance degradation. In order to solve this problem, a columnar database was born. Columnar storage stores the same ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/1737G06F16/221G06F16/2453
Inventor 张常淳
Owner TRANSWARP INFORMATION TECH SHANGHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products