A Heuristics-Based Cost Model for Scientific Workflow Scheduling in Cloud

Journal Article (2021)
Author(s)

Ehab Al-Khannaq (TU Delft - Transport and Planning)

Sai Peck Lee (University of Malaya)

Saif Ur Rehman Khan (COMSATS Institute of Information Technology)

Navid Behboodian (HELP University)

Osamah Ibrahim Khala (Al-Nahrain University)

A Verbraeck (TU Delft - Policy Analysis)

J. W.C. Van Lint (TU Delft - Transport and Planning)

Transport and Planning
Copyright
© 2021 Ehab Nabiel Al-Khanak, Sai Peck Lee, Saif Ur Rehman Khan, Navid Behboodian, Osamah Ibrahim Khala, A. Verbraeck, J.W.C. van Lint
DOI related publication
https://doi.org/10.32604/cmc.2021.015409
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Ehab Nabiel Al-Khanak, Sai Peck Lee, Saif Ur Rehman Khan, Navid Behboodian, Osamah Ibrahim Khala, A. Verbraeck, J.W.C. van Lint
Transport and Planning
Issue number
3
Volume number
67
Pages (from-to)
3265-3282
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Scientific Workflow Applications (SWFAs) can deliver collaborative tools useful to researchers in executing large and complex scientific processes. Particularly, Scientific Workflow Scheduling (SWFS) accelerates the computational procedures between the available computational resources and the dependent workflow jobs based on the researchers’ requirements. However, cost optimization is one of the SWFS challenges in handling massive and complicated tasks and requires determining an approximate (near-optimal) solution within polynomial computational time. Motivated by this, current work proposes a novel SWFS cost optimization model effective in solving this challenge. The proposed model contains three main stages: (i) scientific workflow application, (ii) targeted computational environment, and (iii) cost optimization criteria. The model has been used to optimize completion time (makespan) and overall computational cost of SWFS in cloud computing for all considered scenarios in this research context. This will ultimately reduce the cost for service consumers. At the same time, reducing the cost has a positive impact on the profitability of service providers towards utilizing all computational resources to achieve a competitive advantage over other cloud service providers. To evaluate the effectiveness of this proposed model, an empirical comparison was conducted by employing three core types of heuristic approaches, including Single-based (i.e., Genetic Algorithm (GA), Particle Swarm Optimization (PSO), and Invasive Weed Optimization (IWO)), Hybrid-based (i.e., Hybrid-based Heuristics Algorithms (HIWO)), and Hyper-based (i.e., Dynamic Hyper-Heuristic Algorithm (DHHA)). Additionally, a simulation-based implementation was used for SIPHT SWFA by considering three different sizes of datasets. The proposed model provides an efficient platform to optimally schedule workflow tasks by handling data-intensiveness and computational-intensiveness of SWFAs. The results reveal that the proposed cost optimization model attained an optimal Job completion time (makespan) and total computational cost for small and large sizes of the considered dataset. In contrast, hybrid and hyper-based approaches consistently achieved better results for the medium-sized dataset.