Benchmarking geo-distributed databases

Evaluation using the SmallBank benchmark

Bachelor Thesis (2025)
Author(s)

F. Cirtog (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

A Katsifodimos – Mentor (TU Delft - Data-Intensive Systems)

O. Mráz – Mentor (TU Delft - Data-Intensive Systems)

KG Langendoen – Graduation committee member (TU Delft - Embedded Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
20-06-2025
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In recent years, applications have started using geo-distributed databases, even though their behavior under different workloads remains complex. Therefore, this project analyses how several databases handle transactional workloads using the SmallBank benchmark. We implement and adapt an already existent benchmark, previously used for non-distribuited databases. Furthermore, we use it to evaluate multiple geo-distributed databases highlighting their strengths and weaknesses throughout multiple scenarios designed to stress different parts of the system. We observe that in most scenarios Detock performs slightly better than SLOG, and both outperform Calvin. This aligns with the fact that Detock builds upon SLOG, which itself improves on Calvin. On the other hand, Janus's performance is significantly behind the others due to its communication overhead. However, while the SmallBank benchmark provides an insightful comparison, providing specific advantages compared to previous benchmarks, its adaptation to geo-distributed databases limits its ability to compare scenarios with higher percentages of multi-home and multi-partition transactions.

Files

License info not available