A hbase column aggregation method

An aggregation method and aggregation column technology, applied in the field of computer information storage, can solve problems such as slow aggregation query performance, increase data redundancy, affect query performance, etc., to reduce IO requests and data comparison or calculation, and reduce data redundancy. , Improve the effect of storage performance

Active Publication Date: 2021-12-21
西安烽火软件科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016] 1) MapReduce aggregation scheme: actually adopt the Read-Write method, that is, first read out the existing values ​​in the library, rewrite them after calculation, and increase the IO request; MapReduce needs to re-scan all the data in the table every time and repeat the calculation Relatively large, waste of computing resources; MapReduce batch calculation is relatively poor in real-time; cannot support convective data storage, only fixed files can be used as input
[0017] 2) Scan&Endpoint aggregation scheme: each aggregation query needs to query all relevant data for aggregation operation, which requires more resources, and is prone to timeout exceptions, which affects query performance; the historical data that needs to be aggregated in the data table requires a long-term Save, increase data redundancy, consume storage, aggregate query performance is getting slower and slower, and resource waste is increasing; Scan&Endpoint server can only achieve aggregation on the same region, and the client needs a second aggregation operation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A hbase column aggregation method
  • A hbase column aggregation method
  • A hbase column aggregation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0066] Below in conjunction with accompanying drawing, technical scheme of the present invention is described in further detail:

[0067] The present invention expands the existing HBase server query mechanism, and the HRegionServer architecture is as follows image 3 As shown, the aggregation scanner is added, image 3 The aggregation scanner (store) in the table expands the description information of the table and adds column aggregation properties image 3 The description information of the table in; when querying, the multi-version...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a column aggregation method of HBASE. Based on the HBase server query mechanism, the system architecture of the method is provided with an aggregation scanner module, and the description information of the aggregation table is expanded, and column aggregation attributes are added therein; in the query When merging, the multi-version data of the same cell is aggregated and calculated according to the column aggregation attribute to realize the aggregation on the cell; when the data is merged, the HBase server will have multiple versions according to the column aggregation attribute information in the aggregation table description. The data is aggregated and the result is retained, and the non-result data is cleared after the merge operation is completed. Compared with the prior art, the method disclosed in the present invention has better improvements in terms of real-time performance, performance and resource usage of aggregation table query.

Description

technical field [0001] The invention discloses an HBase column aggregation method and relates to the technical field of computer information storage. Background technique [0002] HBase is a high-reliability, high-performance, column-oriented, and scalable open source non-relational database implemented with reference to Google's BigTable, using HDFS as the underlying storage. With the development and application of big data technology, HBase has gradually become a NoSQL distributed storage system widely used in the industry. It is highly reliable, column-oriented, and open source, and has been successfully used in production systems by companies such as Facebook and Alibaba. [0003] The HBase data model is shown in the following table: [0004] [0005] The most basic unit of HBase is the column (Column, Qualifier); one column or multiple columns form a column cluster (Family, Store), and one column cluster or multiple column clusters form a row (Row), and the unique r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2455G06F16/22G06F16/27
CPCG06F16/25
Inventor 崔博曹俊亮周帅锋王勇强
Owner 西安烽火软件科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products