Evaluating Stream Processing Autoscalers

None, None; None, None; None, None; None, None; None, None; None, None

Evaluating Stream Processing Autoscalers

Conference Paper (2024)

Author(s)

G. Siachamis (TU Delft - Web Information Systems)

G.C. Christodoulou (TU Delft - Data-Intensive Systems)

K. Psarakis (TU Delft - Web Information Systems)

M. Fragkoulis (TU Delft - Web Information Systems)

A. van Deursen (TU Delft - Software Engineering)

Asterios Katsifodimos (TU Delft - Data-Intensive Systems)

Research Group

Web Information Systems

DOI related publication

https://doi.org/10.1145/3629104.3666036

To reference this document use:

https://resolver.tudelft.nl/uuid:34079a41-58e7-44ab-9f55-5349b7e16472

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Web Information Systems

Pages (from-to)

110-122

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

While the concept of large-scale stream processing is very popular nowadays, efficient dynamic allocation of resources is still an open issue in the area. The database research community has yet to evaluate different autoscaling techniques for stream processing engines under a robust benchmarking setting and evaluation framework. As a result, no conclusions can be made about the current solutions and problems that remain unsolved. Therefore, we address this issue with a principled evaluation approach.

This paper evaluates the state-of-the-art control-based solutions in the autoscaling area with diverse, dynamic workloads, applying specific metrics. We investigate different aspects of the autoscaling problem as performance and convergence. Our experiments reveal that current control-based autoscaling techniques fail to account for generated lag cost by rescaling or underprovisioning and cannot efficiently handle practical scenarios of intensely dynamic workloads. Unexpectedly, we discovered that an autoscaling method not tailored for streaming can outperform others in certain scenarios.

Files

3629104.3666036.pdf

(pdf | 1.36 Mb)