Unlock instant, AI-driven research and patent intelligence for your innovation.

Partitioned connection method oriented to mixed type big data processing systems

A technology of big data processing and connection method, applied in the field of big data, can solve the problems of single, unable to realize cross-system data processing, etc., to achieve the effect of improving performance

Inactive Publication Date: 2015-02-11
LANGCHAO ELECTRONIC INFORMATION IND CO LTD
View PDF5 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the interactive analysis engine in the existing hybrid big data architecture is only for a single big data system, and cannot realize cross-system data processing
For example, the current data in Hive and HBase cannot be directly associated. The usual practice is to perform a data migration in a single system of Hive or HBase. The data redundancy and transmission delay caused by a large amount of data are intolerable of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Partitioned connection method oriented to mixed type big data processing systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The present invention will be described in detail below with reference to the accompanying drawings.

[0016] The implementation of the present invention will be described in detail below in conjunction with the accompanying drawings and examples, so as to fully understand and implement the process of how to apply technical means to solve technical problems and achieve technical effects in the present invention. It should be noted that, if there is no conflict, the embodiments of the present invention and the mutual features of the embodiments are within the protection scope of the present invention.

[0017] The present invention takes a specific execution process as an example to illustrate the operating mechanism and processing process of the system.

[0018] There is a table hive_table in the Hive system, including the primary key id, the partition field part, and the content field value, and the table hbase_table in the HBase system includes the primary key id, and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a partitioned connection method oriented to mixed type big data processing systems. The partitioned connection method oriented to mixed type big data processing systems is capable of satisfying the transactional analysis business application demands of the industrial big data in allusion to different processing systems and greatly improving the property of the analysis through partition, coprocessr and mapjoin, and can be further applied to the transactional analysis of join-based grouping, counting and sorting. According to the partitioned connection method oriented to mixed type big data processing systems, the size of the data joining in the transmission, cache and join processes is decreased through determining the Hive query partition; by sufficiently utilizing the advantages of the distributed structure, the cache processes of all the nodes are executed in parallel; through caching data at each node, the join execution efficiency can be accelerated; and the data size and the node amount of the HBase table can be extended as required.

Description

technical field [0001] The present invention relates to the technical field of big data, and specifically relates to a partition connection method for hybrid big data processing systems. Background technique [0002] In response to the industry's big data business application requirements, computing frameworks and systems for data-intensive applications continue to emerge, and these systems only provide solutions for their respective problem domains. In order to cope with the increasingly complex business needs of the industry, it is necessary to use multiple processing architectures in large-scale clusters or data centers to store and process massive amounts of data. Therefore, a hybrid big data processing system has emerged, which integrates multiple processing modes such as batch processing, memory computing, stream processing, and NoSQL database, such as the YARN architecture, to meet the real-time processing, interactive processing, efficient retrieval, and in-depth pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/278
Inventor 亓开元卢军佐杨勇辛国茂
Owner LANGCHAO ELECTRONIC INFORMATION IND CO LTD