Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Cloud workflow task execution time prediction method based on limit gradient improvement

A technology for task execution and time prediction, applied in the field of cloud computing, can solve the problems of missing value variability, dependence on observations, poor robustness, etc., and achieve the effect of long training time

Inactive Publication Date: 2019-07-05
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The present invention aims to solve the problem of accurate estimation and prediction of cloud workflow task execution time, and proposes a cloud workflow task execution time prediction method based on extreme gradient boosting
Secondly, in the case of missing values ​​in the sample data sets, machine learning methods such as random forest models are used to complete the data sets with missing values, which solves the inherent missing values ​​of traditional methods such as average filling and K-nearest neighbor filling. Loss of variability, over-dependence on observations, and poor robustness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cloud workflow task execution time prediction method based on limit gradient improvement
  • Cloud workflow task execution time prediction method based on limit gradient improvement
  • Cloud workflow task execution time prediction method based on limit gradient improvement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0091] The experiment collects 5112 groups of cloud workflow task execution time and all corresponding influencing factors data. Firstly, 4090 sets of data are extracted from it to form a training set, and the remaining 1022 sets of data are used as a test set, and the random forest model is used to process the data missing value of the training set and the test set respectively; secondly, based on the training set with complete data values, the extreme gradient is used. The algorithm trains the cloud workflow task execution time prediction model; finally, input the actual influencing factor data in the test set into the trained model, predict the corresponding cloud workflow task execution time, and compare it with the actual task execution time to calculate the prediction error.

[0092] The performance evaluation of the cloud workflow task execution time prediction model uses the root mean square error RMSE (root mean square error) as an indicator to evaluate the accuracy o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a cloud workflow task execution time prediction method based on limit gradient improvement, and belongs to the technical field of cloud computing. According to the method, influence factors of task execution time are classified from three levels of workflow task composition, resources on which task operation depends and a physical execution environment of the resources, and comprehensive modeling of the influence factors of the task execution time is achieved. Secondly, aiming at the condition that the sample data set has a data missing value, the data set with the missing value is complemented by adopting a machine learning method; and finally, by means of the multi-type data processing capability of the extreme gradient lifting algorithm, the parameter design isrelatively simple, the calculated amount is small, the advantages of a serial learner and a parallel learner are combined, and a cloud workflow task execution time prediction model is trained by adopting the extreme gradient lifting algorithm. Compared with an existing prediction model, limitation on the sample data type is reduced, prediction errors are reduced, and the generalization ability ofthe model is further improved.

Description

technical field [0001] The invention relates to a method for predicting the execution time of a cloud workflow task, and belongs to the technical field of cloud computing. Background technique [0002] With the maturity and wide application of cloud computing technology, especially the pay-per-use model of cloud computing and the elastic and on-demand expansion of cloud resources, users can obtain anytime, anywhere and at low cost without investing in hardware equipment and software resources. Any required computing resource service, therefore, more and more scientific workers use the cloud computing environment to support and execute their own complex scientific computing processes, that is, scientific workflow or cloud workflow. [0003] The first step in cloud workflow execution is the scheduling of tasks to virtual machine resources, that is, from the virtual resource pool, match the most suitable virtual machine resources for each cloud workflow task, aiming to meet the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/08H04L12/24
CPCH04L41/147H04L67/10
Inventor 李慧芳韦琬雯樊锐胡光政邹伟东柴森春夏元清
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products