Alexander Alexandrov | TU Delft Repository

Emma in action

Declarative Dataflows for scalable data analysis

Conference paper (2016) - Alexander Alexandrov (author) , Andreas Salzmann (author) , Georgi Krastev (author) , Asterios Katsifodimos (author) , Volker Markl (author)

Parallel dataow APIs based on second-order functions were originally seen as a exible alternative to SQL. Over time, however, their complexity increased due to the number of physical aspects that had to be exposed by the underlying engines in order to facilitate efficient executi ...

Bridging the Gap

Towards optimization across linear and relational Algebra

Conference paper (2016) - Andreas Kunft (author) , Alexander Alexandrov (author) , A. Katsifodimos (author) , Volker Markl (author)

Advanced data analysis typically requires some form of preprocessing in order to extract and transform data before processing it with machine learning and statistical analysis techniques. Pre-processing pipelines are naturally expressed in dataflow APIs (e.g., MapReduce, Flink, e ...

Implicit Parallelism through Deep Language Embedding

Journal article (2016) - Alexander Alexandrov (author) , A Katsifodimos (author) , Georgi Krastev (author) , Volker Markl (author)

Parallel collection processing based on second-order functions such as map and reduce has been widely adopted for scalable data analysis. Initially popularized by Google, over the past decade this programming paradigm has found its way in the core APIs of parallel dataflow engine ...

Implicit parallelism through deep language embedding

Conference paper (2015) - Alexander Alexandrov (author) , Andreas Kunft (author) , A. Katsifodimos (author) , Felix Schüler (author) , Lauritz Thamsen (author) , Odej Kao (author) , Tobias Herb (author) , Volker Markl (author)

The appeal of MapReduce has spawned a family of systems that implement or extend it. In order to enable parallel collection processing with User-Defined Functions (UDFs), these systems expose extensions of the MapReduce programming model as library-based dataow APIs that are tigh ...