Stateful Entities: Object-oriented Cloud Applications as Distributed Dataflows

Conference Paper (2024)
Author(s)

K. Psarakis (TU Delft - Web Information Systems)

W.D. Zorgdrager (Delivery Hero SE)

Marios Fragkoulis (Delivery Hero SE)

Guido Salvaneschi (University of St. Gallen)

A Katsifodimos (TU Delft - Web Information Systems)

Research Group
Web Information Systems
DOI related publication
https://doi.org/10.48786/edbt.2024.02
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Web Information Systems
Pages (from-to)
15-21
ISBN (electronic)
['9783893180912', '9783893180943', '9783893180950']
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Although the cloud has reached a state of robustness, the burden of using its resources falls on the shoulders of programmers who struggle to keep up with ever-growing cloud infrastructure services and abstractions. As a result, state management, scaling, operation, and failure management of scalable cloud applications require disproportionately more effort than developing the applications' actual business logic. Our vision aims to raise the abstraction level for programming scalable cloud applications by compiling stateful entities – a programming model enabling imperative transactional programs authored in Python – into stateful streaming dataflows. We propose a compiler pipeline that analyzes the abstract syntax tree of stateful entities and transforms them into an intermediate representation based on stateful dataflow graphs. It then compiles that intermediate representation into different dataflow engines, leveraging their exactly-once message processing guarantees to prevent state or failure management primitives from "leaking" into the level of the programming model. Preliminary experiments with a proof of concept implementation show that despite program transformation and translation to dataflows, stateful entities can perform at sub-100ms latency even for transactional workloads.