Autotuning Geo-distributed systems

A Contextual Bandit for Dynamic Data Movement in Detock

Bachelor Thesis (2026)
Author(s)

R. Popa (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

O. Mráz – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

A. Katsifodimos – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

B. Özkan – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2026
Language
English
Graduation Date
26-06-2026
Awarding Institution
Delft University of Technology
Project
CSE3000 Research Project
Programme
Computer Science and Engineering
Faculty
Electrical Engineering, Mathematics and Computer Science
Downloads counter
9
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Geo-distributed databases must serve transactions close to where their data is accessed to keep latency low, which is why systems like Detock assign each data item a ”home” region and route its transactions there. But Detock leaves the placement policy open: where should each key live, and when should it move? Reactive heuristics fill this gap poorly. Deciding one key at a time, they split co-accessed records across regions and turn otherwise single-home transactions into expensive multi-home ones. Migrations that fire too often or at the wrong time also force concurrent transactions reading a stale home to restart.

This paper presents an adaptive placement agent for Detock. It groups co-accessed keys into communities using the Leiden algorithm, then uses a contextual bandit to decide when and where to migrate each community. The bandit learns from the locality and restart signals it observes at runtime, with no prior knowledge of the workload. We evaluate the agent on a two-region, follow-the-sun deployment of the Product–Parts–Supplier benchmark. Under matched migration budgets, the agent tracks the shifting hotspot as well as a DynaMast-style baseline while issuing about 29% fewer home-movements and causing roughly 4.3× fewer transaction restarts. These results show that a lightweight, group-aware and cost-aware learned policy is a practical way to drive home-movement in a deterministic geo-distributed database.

Files

Paper.pdf
(pdf | 0.884 Mb)
License info not available