Evaluating Stream Processing Autoscalers

Conference Paper (2024)
Author(s)

G. Siachamis (TU Delft - Web Information Systems)

G.C. Christodoulou (TU Delft - Data-Intensive Systems)

K. Psarakis (TU Delft - Web Information Systems)

M. Fragkoulis (TU Delft - Web Information Systems)

Arie Van Deursen (TU Delft - Software Engineering)

Asterios Katsifodimos (TU Delft - Data-Intensive Systems)

Research Group
Web Information Systems
DOI related publication
https://doi.org/10.1145/3629104.3666036
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Web Information Systems
Pages (from-to)
110-122
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

While the concept of large-scale stream processing is very popular nowadays, efficient dynamic allocation of resources is still an open issue in the area. The database research community has yet to evaluate different autoscaling techniques for stream processing engines under a robust benchmarking setting and evaluation framework. As a result, no conclusions can be made about the current solutions and problems that remain unsolved. Therefore, we address this issue with a principled evaluation approach.

This paper evaluates the state-of-the-art control-based solutions in the autoscaling area with diverse, dynamic workloads, applying specific metrics. We investigate different aspects of the autoscaling problem as performance and convergence. Our experiments reveal that current control-based autoscaling techniques fail to account for generated lag cost by rescaling or underprovisioning and cannot efficiently handle practical scenarios of intensely dynamic workloads. Unexpectedly, we discovered that an autoscaling method not tailored for streaming can outperform others in certain scenarios.