Distributed data analysis processing method based on memory computation

A distributed data and memory computing technology, applied in the field of big data analysis, can solve the problems of non-real-time analysis and processing, slow processing, and data analysis methods that cannot meet the requirements.

Inactive Publication Date: 2016-03-23
陕西艾特智慧信息技术有限公司
View PDF7 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] With the rapid development of the Internet and the rapid expansion of informatization data, there are more and more difficulties in the analysis and processing of massive data. In addition, the continuous development of cloud computing platforms has also brought new challenges and opportunities for data processing and analysis. Traditional data Analytical methods are increasingly unable to meet the requirements; big data is an emerging industry, and the key to gaining benefits lies in the "value-added" of data through "processing" of data, and the rate of "processing" will inevitably determine the "value-added" of benefits. " space, so how to quickly query and analyze massive data becomes particularly important
[0004] The analysis and processing of data by the traditional data analysis platform is not real-time. This is because the traditional data computing framework has the disadvantage of having to read and store. When the amount of data is too large, the processing is often slow, resulting in a large amount of delay; while the new computing The framework is based on the memory computing method, which not only inherits the distributed processing framework of the traditional computing framework, can automatically optimize the computing process, etc., but also makes up for the shortcomings of the traditional computing framework that do not have real-time performance, which greatly improves the data processing speed; therefore, we need a A platform for data analysis and processing based on a new in-memory computing framework, making big data processing faster and more convenient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed data analysis processing method based on memory computation
  • Distributed data analysis processing method based on memory computation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be described in detail below with reference to the accompanying drawings; the following detailed description of the present invention does not limit the present invention; on the contrary, the scope of the present invention is determined by the appended claims.

[0023] The present invention is a distributed data analysis and processing method based on memory computing, and its core architecture diagram is as follows figure 1 Shown; the main execution process is as follows.

[0024] S0: Use a SQL-like parser to parse query statements; data operations such as loading, traversing, and writing are finally converted into logical plans.

[0025] S1: Use the task converter to convert the logical plan generated by the parser into a query analysis expression that can be recognized by the computing framework.

[0026] S2: The expression generated by the task converter is finally optimized and executed by the query optimizer; the work here is completel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a distributed data analysis processing method based on memory computation, comprising the following steps of (1) providing an SQL-like (Structured Query Language) parser which parses an introduced query analysis text into corresponding logical plans and optimizes the logical plans preliminarily; (2) providing a task converter which converts the logical plans generated by the SQL-like parser into a computation expression, including a plurality of defined conversion categories, which can be recognized by a big data memory computational model; (3) providing a query optimizer which converts the introduced expression that can be recognized by the memory computational model into the logical plans, optimizes the logical plans and then converts into physical execution plans. Due to lack of data query and analysis processing in the conventional big data processing, the invention provides the distributed data analysis processing method based on memory computation and inherits the advantages of the memory computational model in the aspect of data processing so that the programming language of data query and analysis is simple.

Description

technical field [0001] This article relates to a field of big data analysis technology, specifically a distributed data analysis and processing method based on the memory computing framework. Background technique [0002] With the advent of the era of cloud computing, the emerging term "big data" has attracted more and more attention; the so-called big data refers to data that cannot be captured, managed and processed with conventional software tools within an affordable time frame. Data collection; the meaning of big data does not simply refer to mastering huge data information, but how to extract the key information contained in a large amount of data, that is, how to process and analyze big data; of course, for distributed data mining of massive data, it is necessary Relying on distributed processing of cloud computing, distributed database and cloud storage, virtualization technology, etc. [0003] With the rapid development of the Internet and the rapid expansion of in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2453G06F16/245G06F16/24532G06F16/2471
Inventor 朱志祥肖跃雷张龙兴陈晓
Owner 陕西艾特智慧信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products