Unlock instant, AI-driven research and patent intelligence for your innovation.

Cross-regional task scheduling method and system based on big data

A task scheduling, cross-regional technology, applied in electronic digital data processing, structured data retrieval, database distribution/replication, etc., can solve the problems of mutual use of computing resources, occupation of bandwidth, waste of disk space, etc.

Active Publication Date: 2020-12-22
北京东方国信科技股份有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] First, a large amount of data is transmitted to the headquarters every day, occupying a large amount of bandwidth, and the cost of VPN is relatively high
[0012] Second, save multiple copies of the same data, wasting disk space
[0013] Third, regardless of the headquarters or the province, when doing data analysis, you can only use the computing resources of your own big data platform. Even if the resources are free, you cannot use each other's computing resources, resulting in waste of resources
[0014] Fourth, when personnel from various provinces do data analysis, they can only see the data of their own province, but not the data of the other party, and cannot do horizontal and cross-provincial comparative analysis of data
[0015] Fifth, a large number of personnel are required for support, and the labor cost is high
[0016] Sixth, the establishment of a super-large-scale hadoop cluster at the headquarters to carry the data of the whole country, the construction cost is very high
[0017] Seventh, the data delay of the headquarters accessing the provinces is very high. The headquarters can only access the data of the previous day in each province, and cannot access the data of the provinces in real time.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-regional task scheduling method and system based on big data
  • Cross-regional task scheduling method and system based on big data
  • Cross-regional task scheduling method and system based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] The implementation mode of the present invention is illustrated by specific specific examples below, and those who are familiar with this technology can easily understand other advantages and effects of the present invention from the contents disclosed in this description. Obviously, the described embodiments are a part of the present invention. , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0070] see figure 1 , providing a big data-based cross-regional task scheduling method, including the following steps:

[0071] S1: The user connects to the big data platform and issues a structured query language, parses the structured query language through the big data platform, and generates a syntax tree, and the big data platform includes a total data platform and a sub-data platform;

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-region task scheduling method and system based on big data. The cross-region task scheduling method comprises steps of a user issuing a structured query language; analyzing the structured query language to generate a syntax tree; adopting a cross-domain scheduling engine to disassemble the syntax tree according to the metadata information; generating and distributing a plurality of logic execution plans executed on a total data platform or a sub-data platform, receiving a cluster of the logic execution plans, acquiring metadata stored in the cluster by a cross-domain scheduling engine, and acquiring position information of all data blocks from the metadata to generate a logic execution plan finally executed in the cluster; The cross-domain scheduling enginedistributes a logic execution plan to the data nodes, the data analysis engine on the data nodes receiving the logic execution plan performs data reading and calculation, and the cluster summarizationcalculation generates a preliminary summarization result and sends the preliminary summarization result according to the sending position information; And the cluster receiving the preliminary summarization result performs secondary summarization calculation on the data to generate a final query result and returns the final query result to the user. Mass data transmission is avoided, the bandwidth is saved, and the cost is reduced.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of data processing, and in particular to a large data-based cross-regional task scheduling method and system. Background technique [0002] At present, China Unicom and China Telecom will establish big data platforms in various provinces, and then upload the stored data files to the big data platform of the headquarters through the network every day, and then do data auditing and data analysis at the headquarters, usually using Hive ( A data warehouse tool based on Hadoop, which can map structured data files into a database table, and provide a simple sql query function, which can convert sql statements into MapReduce tasks for operation) or Spark (designed for large-scale data processing The fast and general computing engine designed can be used to complete various operations, including SQL query, text processing, machine learning, etc.) and other Mpp tools. Generally, it is neces...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/242G06F16/2452G06F16/2458G06F16/248G06F16/27
Inventor 刘垚田俊何献青谢冬云
Owner 北京东方国信科技股份有限公司