Print Email Facebook Twitter An experimental evaluation of auto-scaling techniques for distributed stream processing systems Title An experimental evaluation of auto-scaling techniques for distributed stream processing systems Author Kanis, Job (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Siachamis, G. (mentor) Katsifodimos, A (graduation committee) Decouchant, Jérémie (graduation committee) Degree granting institution Delft University of Technology Date 2023-03-28 Abstract The introduction of cloud hosting has made it possible to elastically provision distributed stream processing systems (SPEs). By dynamically scaling the different operators of the system, resource consumption can be minimised while meeting the system service-level objectives. In the literature, many different auto-scaling techniques are proposed that make scaling decisions based on the current state of the system. However, these techniques are poorly evaluated and are rarely compared with each other. This makes it difficult to determine the state-of-the-art for auto-scaling techniques targeting SPEs, which slows down its development. In this paper, we design and implement a modular framework to evaluate the performance of state-of-the-art auto-scalers targeting SPEs. We implement state-of-the-art auto-scalers Dhalion, DS2, and Varga et al., using Kubernetes horizontal pod auto-scaler as baseline. We perform an end-to-end experimental evaluation of the auto-scalers and investigate their performance when run on different queries and workload patterns. Furthermore, we investigate the convergence time of the auto-scalers and evaluate their scaling accuracy. The results emphasise the difficulty of capturing the complex relationships of different operators and the struggle to balance resource efficiency and the performance of the system. Moreover, it shows the inherent weakness of reactive auto-scalers to react slowly to changing workloads and reveals the importance of considering the current health of the system when issuing scaling actions. Subject Auto-scalingDistributed Stream ProcessingCloud Computing To reference this document use: http://resolver.tudelft.nl/uuid:227603f5-6430-4ac4-94c1-a705f3533b70 Part of collection Student theses Document type master thesis Rights © 2023 Job Kanis Files PDF Thesis_Report_Job_Kanis_F ... ersion.pdf 1.93 MB Close viewer /islandora/object/uuid:227603f5-6430-4ac4-94c1-a705f3533b70/datastream/OBJ/view