Real-time data warehouse platform

A real-time data and warehouse technology, applied in the network field, can solve problems such as poor performance, inability to be used by business personnel, and poor query performance of small data volumes, so as to improve query performance, realize real-time synchronization, and not easily lose logs.

Inactive Publication Date: 2018-03-09
百味云科技股份有限公司
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the existing technical solutions for data warehouses, offline and non-updatable distributed hive data warehouses are used. It is difficult to achieve the level of real-time data warehouses, and it is impossible to synchronize business databases in real time.
If the timeliness cannot be guaranteed, more improvements cannot be provided to the existing business data analysis.
In addition, the existing data warehouse cannot be easily used by business personnel
[0003] Overall, the existing log system has the following defects: 1) Most of the existing systems are offline distributed data warehouses of hive, which cannot satisfy the user's update and record-level insertion functions
2) poor performance
The existing hive distributed data warehouse has extremely poor query performance for small data volumes, and even cannot reach the performance of traditional relational data warehouses
3) Fusion of log real-time data and historical data
Existing data warehouses are all offline data, which cannot be integrated with real-time log data, which indirectly hinders the analysis and mining of all business data
[0004] Aiming at the problem of up-stop in related technologies, no effective solution has been proposed yet

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time data warehouse platform
  • Real-time data warehouse platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention belong to the protection scope of the present invention.

[0020] Such as figure 1 As shown, a real-time data warehouse platform 100 according to an embodiment of the present invention includes: a business data collection system, a log data collection system, and an analysis system; the business data collection system includes a candu module 118, and the change log of the business data by the candu module 118 Carry out synchronous analysis, and store the analyzed data in the kudu storage module 130 of the analysis system; the log data acquisition sys...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a real-time data warehouse platform, which comprises a business data acquisition system, a log data acquisition system and an analysis system. The business data acquisition system comprises a candu module; the candu module is used for synchronously analyzing an update log of business data and storing the analyzed data into a kudu storage module of an analysis system; the log data acquisition system is used for acquiring log data, calculating the log data, and storing a calculation result to the kudu storage module; the kudu storage module is used for carrying out data analysis in real time according to the stored analyzed data and the stored calculation result. According to the real-time data warehouse platform provided by the invention, the update log of business data distributed on each business system is collected in real time through the candu module, so that real-time synchronization of the business data is realized.

Description

technical field [0001] The invention relates to the field of network technology, in particular to a real-time data warehouse platform. Background technique [0002] In the existing technical solutions for data warehouses, offline and non-updatable distributed hive data warehouses are used. It is difficult to achieve the level of real-time data warehouses, and it is impossible to synchronize business databases in real time. If the timeliness cannot be guaranteed, more improvements cannot be provided to the existing business data analysis. In addition, the existing data warehouse cannot be easily used by business personnel. [0003] Generally speaking, the existing log system has the following defects: 1) Most of the existing systems are offline distributed data warehouses of hive, which cannot meet the user's update and record-level insert functions. 2) Poor performance. The existing hive distributed data warehouse has extremely poor query performance for small data volume...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2358G06F16/2379G06F16/283
Inventor 阙子扬赵卫刘健周娜
Owner 百味云科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products