High-efficiency processing method and system for big data

A processing method and big data technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as low efficiency, heavy tasks, difficult data processing, etc., and achieve the effect of improving processing efficiency and high availability

Inactive Publication Date: 2015-02-04
ANHUI SUN CREATE ELECTRONICS
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the distributed storage (HDFS) does not support the direct processing of structured query statements (SQL), the data in the distributed storage (HDFS) is difficult to be processed directly, and the computing tasks need to be transformed into the parallel computing MapReduce framework in the end. Execution, its management node (Jobtracker) has heavy tasks, low efficiency, and easily leads to single point of failure

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-efficiency processing method and system for big data
  • High-efficiency processing method and system for big data
  • High-efficiency processing method and system for big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] An efficient processing method for big data, including: firstly, the data node receives the data to be stored; secondly, the data node stores the data, and at the same time, creates an index according to the business scenario and stores it in the memory, and gradually saves it in the In the disk; again, the user inputs a task request, and the SQL engine realizes fast data retrieval according to the created index, and outputs the data to the computing node; then, the task processing module of the management node executes task scheduling, and applies for resources from the resource management module to determine the idle time. computing node, and the data is processed by the computing node; finally, the final processed data is presented to the user, and the data types received by the data node include structured, semi-structured and unstructured data, such as figure 1 shown.

[0029] Such as figure 1 As shown, the data node realizes the storage of the data to be stored, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a high-efficiency processing method for big data, which comprises the following steps that a data node receives data to be stored; the data node stores the data, an index is simultaneously created according to a business scenario and is stored in a memory, and the data is gradually stored in a disk by index curing; a user inputs a task request, and an SQL (Structured Query Language) engine implements rapid retrieval of the data according to the created index and outputs the data to a computational node; a task processing module of a management node executes task scheduling, applies for resources to a resource management module and determines a spare computational node, and the spare computational node processes the data; the finally processed data is shown for the user. The invention also discloses the high-efficiency processing system for the big data. According to the invention, all processing is executed concurrently; hardware equipment of a computer is utilized to the greatest extent; processing efficiency is greatly improved; the user can more rapidly obtain a processing result when a task is executed.

Description

technical field [0001] The invention relates to the technical field of computer big data application processing, in particular to an efficient processing method and system for big data. Background technique [0002] With the extensive development of large-scale projects such as safe cities and smart cities, data aggregation and data fusion have further developed, and the amount of data to be processed has reached TB and PB levels. The processing of large amounts of data has produced a series of practical problems. When relational databases face such a large amount of data, their technical architecture, processing capabilities, and processing methods are increasingly unable to meet user needs. [0003] The development of cloud computing and big data technology provides a good solution to the processing of massive data. The Hadoop framework system uses parallel computing (MapReduce) and distributed storage (HDFS) to realize the storage and calculation of large amounts of data....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/248G06F16/2272
Inventor 王佐成任子晖马韵洁张凯
Owner ANHUI SUN CREATE ELECTRONICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products