A Reference Architecture for Datacenter Scheduling

Design, Validation, and Experiments

Conference Paper (2019)
Author(s)

G. Andreadis (Student TU Delft)

L.F.D. Versluis (Vrije Universiteit Amsterdam)

F.S. Mastenbroek (Student TU Delft)

Alexandru Iosup (Vrije Universiteit Amsterdam, TU Delft - Data-Intensive Systems)

Research Group
Data-Intensive Systems
DOI related publication
https://doi.org/10.1109/SC.2018.00040
More Info
expand_more
Publication Year
2019
Language
English
Research Group
Data-Intensive Systems
Pages (from-to)
478-492
ISBN (print)
978-1-5386-8385-9
ISBN (electronic)
978-1-5386-8384-2

Abstract

Datacenters act as cloud-infrastructure to stakeholders across industry, government, and academia. To meet growing demand yet operate efficiently, datacenter operators employ increasingly more sophisticated scheduling systems, mechanisms, and policies. Although many scheduling techniques already exist, relatively little research has gone into the abstraction of the scheduling process itself, hampering design, tuning, and comparison of existing techniques. In this work, we propose a reference architecture for datacenter schedulers. The architecture follows five design principles: components with clearly distinct responsibilities, grouping of related components where possible, separation of mechanism from policy, scheduling as complex workflow, and hierarchical multi-scheduler structure. To demonstrate the validity of the reference architecture, we map to it state-of-the-art datacenter schedulers. We find scheduler-stages are commonly underspecified in peer-reviewed publications. Through trace-based simulation and real-world experiments, we show underspecification of scheduler-stages can lead to significant variations in performance.

No files available

Metadata only record. There are no files for this record.