Poster: Generating Reproducible Out-of-Order Data Streams
Philipp M. Grulich (Technical University of Berlin)
Jonas Traub (DFKI GmbH, Technical University of Berlin)
Sebastian Bress (Technical University of Berlin, DFKI GmbH)
Asterios Katsifodimos (TU Delft - Electrical Engineering, Mathematics and Computer Science)
Volker Markl (Technical University of Berlin, DFKI GmbH)
Tilmann Rabl (Hasso Plattner Institute)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Evaluating modern stream processing systems in a reproducible manner requires data streams with different data distributions, data rates, and real-world characteristics such as delayed and out-of-order tuples. In this paper, we present an open source stream generator which generates reproducible and deterministic out-of-order streams based on real data files, simulating arbitrary fractions of out-of-order tuples and their respective delays.