Storage method and system for kafka's hbase database

A database and data writing technology, applied in the direction of database design/maintenance, structured data retrieval, special data processing applications, etc., can solve the problems of high logic maintenance cost, unfavorable post-maintenance, low efficiency, etc., to improve efficiency and avoid Lost, boosted output bandwidth effect

Active Publication Date: 2019-04-30
北京中交兴路车联网科技有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the process of data collection, it is very common for data to be transferred through Kafka. Kafka designs many topics (topics), and writes data into the Hbase database for different topics. Usually, different methods are required to make the data written into the Hbase database, because The data is different in structure and content, and the topics are different. This makes it necessary to write separate logic codes for different topics to write data, which is time-consuming, laborious, inefficient, and not conducive to later maintenance. Too much logic is expensive to maintain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Storage method and system for kafka's hbase database
  • Storage method and system for kafka's hbase database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0036] Such as figure 1 As shown, the present invention provides a kind of storage method for the Hbase database of kafka (distributed message queue), it is characterized in that, comprises the steps:

[0037] A storage method for the Hbase database of kafka is characterized in that, comprising the steps:

[0038] S1: Collect the data of all topics (topics) in the kafka cluster and save them in the queue;

[0039] S2: Configure the corresponding relationship between the topic and the delimiter, filter rule, and storage rule in the data in the configuration file;

[0040] S3: Serialize the data according to the corresponding relationship, and filter the data;

[0041] S4: read the configuration information in the configuration file, create a connection with the Hbase database, and construct the data into a put object;

[0042] S5: Store the put object...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an Hbase database entering method for kafka. The method includes the steps that S1, all topic data in a kafka cluster is collected and stored in a queue; S2, the corresponding relation between topic and separators in the data, a filtering rule and a database entering rule are configured in a configuration file; S3, according to the corresponding relation, the data is serialized and filtered; S4, configuration information in the configuration file is read, connection with a Hbase database is set up, and the data is constructed into a put object; S5, the put object enters the Hbase. The problem that different topic types in kafka need to be processed individually is solved, the method general and adaptive to all topic methods is provided, the high-performance Hbase database entering method is constructed, the data writing efficiency is greatly improved, output bandwidth is increased, robot network and disk performance are used to the maximum, data safety is ensured to the maximum by means of double-queue design, and data loss is avoided.

Description

technical field [0001] The invention relates to the technical field of computer data processing, in particular to a storage method and system for an Hbase database of Kafka. Background technique [0002] Kafka (Distributed Message Queue) is a high-throughput distributed publish-subscribe messaging system that can handle all action data streams in consumer-scale web sites. Such actions (web browsing, searching and other user actions) are a key factor in many social functions on the modern web. These data are usually addressed by processing logs and log aggregation due to throughput requirements. For log data and offline analysis systems like Hadoop (distributed system framework), because of the limitation of real-time processing, using Kafka for processing is a feasible solution. The purpose of Kafka is to unify online and offline message processing through Hadoop's parallel loading mechanism, and to provide real-time consumption through clustering. [0003] The Hbase data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/21
CPCG06F16/21
Inventor 曹宇余效伟肖赞李旭阳
Owner 北京中交兴路车联网科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products