Method and device for executing distributed computation tasks

A distributed computing and distributed data technology, applied in the field of computers and distributed computing, can solve the problems of inconvenient use of interfaces, SQL-Like interfaces that cannot describe computing requirements, and cannot describe computing requirements, etc., to achieve code reusability Effect

Active Publication Date: 2017-02-22
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the existing distributed computing frameworks are generally inconvenient to use interfaces, especially the lack of the concept of nested data sets, resulting in complex logic expressions (especially the expression of grouping logic) and difficult code reuse.
Taking the SQL-Like class interface as an example, due to the lack of the concept of nested data sets in the SQL-Like class interface, it is impossible to describe some computing requirements
Take a common calculation requirement as an example: group data by a certain field, and take the top 10 records with the largest value of the group field in each group as an example, when the expression is "select field_1, field_2 from table_1group by field1order by field_2desc limit 10", actually the first 10 records are taken out globally, not the first 10 records for each group; when the expression is "select field_1, field_2from table_1group by field1limit 10order by field_2desc", the expression Does not conform to the SQL syntax, resulting in the inability of the SQL-Like class interface to describe some common computing requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for executing distributed computation tasks
  • Method and device for executing distributed computation tasks
  • Method and device for executing distributed computation tasks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0015] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0016] figure 1 An exemplary system architecture 100 is shown to which embodiments of the method or apparatus for performing distributed computing tasks of the present application may be applied.

[0017] Such as figure 1 As shown, system architecture 100 may include distr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for executing distributed computation tasks. A specific implementation manner of the method comprises the following steps of: analyzing an expression of a distributed computation task so as to obtain an operator keyword; and executing the distributed computation task by utilizing an operator corresponding to the operator keyword on the basis of input parameters of the operator, wherein the input parameters comprise at least of the following items: a distributed data set which is stored in a distributed manner and consists of at least one data element and a distributed key value pair set which is stored in a distributed manner and consists at least one key value pair, and values of key value pairs in the distributed key value pair set are distributed data sets or distributed key value pair sets. Through the implementation manner, the operation carried out on nested data sets is realized, so that the distributed computation tasks of operating the distributed data sets and operating the grouped data of the distributed data sets after the grouping are supported, and the code reusability is realized.

Description

technical field [0001] The present application relates to the field of computers, in particular to the field of distributed computing, and in particular to methods and devices for executing distributed computing tasks. Background technique [0002] In the development of distributed computing programs, users (such as engineers writing distributed computing programs) can use the interfaces provided by the distributed computing framework to write distributed computing programs. [0003] The interfaces provided by existing distributed computing frameworks mainly include native class interfaces (for example, MapReduce, a programming model for parallel computing of large-scale data sets), similar structured query language (SQL-Like, Structured QueryLanguage Like) class interfaces ( For example, the data warehouse infrastructure Hive built on the distributed system infrastructure Hadoop developed by the Apache Foundation) and the Resilient Distributed Dataset (RDD, Resilient Distri...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/46
CPCG06F9/465G06F16/27G06F9/46G06F16/00G06F9/5061G06F15/16
Inventor 张云聪温翔王聪柴华徐瑶
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products