Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data structure based on cloud computing database system

A data structure and database technology, which is applied in the field of data structure based on cloud computing database system, can solve the problems of unreasonable data structure design, low storage and retrieval efficiency, etc., and achieve the effect of improving retrieval efficiency and saving storage space

Inactive Publication Date: 2013-11-20
成都博云科技有限公司
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to provide a data structure based on a cloud computing database system, aiming to solve the problems of unreasonable data structure design and low storage and retrieval efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data structure based on cloud computing database system
  • Data structure based on cloud computing database system
  • Data structure based on cloud computing database system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0030] Embodiment 1: Primary key data structure design for transactional data

[0031] The primary key is a transaction data primary key, and the transaction data has a time attribute, that is, the transaction data primary key has a time attribute. Storing time information in the primary key helps to prompt query retrieval speed.

[0032] Such as Figure 4 As shown, the design steps of the primary key of the transaction data are: Step 11: the transaction data establishes a transaction data table 401 by day; Step 12: the transaction data table 401 includes 6 bytes, and the first 4 bytes are used for storing , hour and millisecond, the last 2 bytes are used for cyclic self-increment by minute; Step 13: the first 4 bytes are set as a hash field, and the last 2 bytes are set as a sequence field. The benefits of this design are many. After the date table, the time information can remove the date part and only keep hours, minutes, and milliseconds, so that 4 bytes can be completed...

Embodiment 2

[0033] Embodiment 2: Primary key data structure design for statistical data

[0034] The primary key is a statistical data primary key, and the statistical data primary key has a time attribute. Statistical data also has a time attribute, and the smallest unit of statistical data is only minutes (pre-statistics in seconds are meaningless).

[0035] Such as Figure 5As shown, the design steps of the statistical data primary key are: Step 21: Statistical data establishes a statistical data table 501 by day; Step 22: The statistical data table 501 includes 6 bytes, and the first byte is used to save minutes information, the second byte is used to save the hour information, and the last four bytes are used to increment by minute; Step 23: the first byte and the second byte are hash fields, and the last four bytes Four bytes are the sequence field. For statistical data, we also use daily data sub-tables. After pressing the day table, the time information only needs to keep hour...

Embodiment 3

[0036] Embodiment 3: Aiming at the design of the above data structure model, the present invention designs a factory class for producing primary keys, which needs to be associated with tables to help tables generate unique key values.

[0037] RowkeyFactory (base class)

[0038] RowkeyFactory is divided into three types according to the type of data table. Factory information is stored by instance. There are different storage rules for different types of data tables, and each factory instance.

[0039]

[0040] RowkeyFactory4Trans class design

[0041]

[0042]

[0043] RowkeyFactory4Stat class design

[0044]

[0045]

[0046]

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention is applicable to the technical field of data structures in cloud computing, and provides a data structure based on a cloud computing database system. The data structure comprises a primary key and a data sheet, wherein the primary key is used for storing data information with the unique identification in a binary code form and serves as a retrieval index; the data structure further comprises a primary key control module which is used for transforming data containing the time attribute in data information of the primary key into binary codes and storing the binary codes into the primary key in a hash distribution way. According to the design method and the system of the data structure, the primary key structure of transaction data and statistical data is the core of the data structure, and storage and retrieval efficiencies of a big data system during cloud computing can be greatly improved.

Description

technical field [0001] The invention belongs to the technical field of data structure in cloud computing, and in particular relates to a data structure based on a cloud computing database system. Background technique [0002] HBase is a distributed, column-oriented open source database, a subproject of Apache's Hadoop project. HBase is different from general relational databases, it is a database suitable for unstructured data storage. Another difference is HBase's column-based rather than row-based schema. Such as figure 1 As shown, Hadoop is a big data processing platform composed of a distributed file system (HDFS) and a distributed computing framework (MapReduce). The distributed file system (HDFS) solves the problem of large file data storage, and the distributed computing framework ( MapReduce) solves the problem of parallel computing of massive data. [0003] HBase searches based on Rowkey (row key, which uniquely identifies a row of data). Three methods are supp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 叶春生车新奕雷果程涛木刘学
Owner 成都博云科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products