D.H.J. Epema | TU Delft Repository

Web3 Sybil avoidance using network latency

Journal article (2023) - Quinten Stokkink (author) , Can Umut Ileri (author) , DHJ Epema (author) , Johan Pouwelse (author)

Web3 is emerging as the new Internet-interaction model that facilitates direct collaboration between strangers without a need for prior trust between network participants and without central authorities. However, one of its shortcomings is the lack of a defense mechanism against ...

Resilient, Auditable and Secure IoT-Enabled Smart Inverter Firmware Amendments With Blockchain

Journal article (2023) - R. Akkaoui (author) , Alexandru I. Ştefanov (author) , P Palensky (author) , Dick Epema (author)

The solar industry in residential areas has been witnessing an astonishing growth worldwide. At the heart of this transformation, affecting the edge of the electricity grid, reside smart inverters (SIs). These IoT-enabled devices aim to introduce a certain degree of intelligence ...

A Taxonomy and Lessons Learned From Blockchain Adoption Within the Internet of Energy Paradigm

Journal article (2022) - R. Akkaoui (author) , Alexandru I. Ştefanov (author) , P Palensky (author) , Dick Epema (author)

The concept of the internet of energy (IoE) emerged as an innovative paradigm to encompass all the complex and intertwined notions relevant to the transition of current smart grids towards more decarbonization, digitalization and decentralization. With a focus on the two last asp ...

The concept of the internet of energy (IoE) emerged as an innovative paradigm to encompass all the complex and intertwined notions relevant to the transition of current smart grids towards more decarbonization, digitalization and decentralization. With a focus on the two last aspects, the amount of intelligent devices being connected in a scattered way to the existing power grid is ever-growing. Nevertheless, guaranteeing a cyber-secure and resilient control of these IoE components as well as a seamless and reliable delivery of electricity services, such as renewable energy exchange, electric vehicles charging, demand response, and so forth; might be the bottleneck of current power systems that are largely still functioning following a centralized approach. Thus, the future power grid would gradually incorporate a growing number of distributed-based control schemes to deal with this challenge. And many believe that blockchain could be a key-enabler in this transition, due to its consistent characteristics with multiple requirements of future power systems. In this paper, we provide an extensive state-of-the-art of blockchain-based additions to the IoE. Where, we first introduce various concepts related to blockchain and discuss the rationale behind its adoption in the context of IoE. Then, differently from the existing body of literature surveys, we do not only provide a taxonomy and evaluate a wide range of recent research outputs that integrated blockchain within modern power systems. But we also draw some valuable lessons learned for each studied category and discuss the intersection of blockchain with various emerging paradigms that have the potential of radically impacting the smart grid. In addition, we present some real-world industrial initiatives and ongoing projects built on top of blockchain, dedicated for offering diverse electricity services with a case study of a pilot project on energy trading in Amsterdam. Finally, we discuss the remaining challenges and worthwhile opportunities of deploying blockchain in this particular area, with a focus on the aspect of operational cyber-security.

A novel decentralized platform for peer-to-peer energy trading market with blockchain technology

Journal article (2021) - A.A.S. Esmat (author) , M.A. de Vos (author) , Yashar Ghiassi-Farrokhfal (author) , P. Palensky (author) , Dick Epema (author)

Peer-to-Peer (P2P) energy trading, which allows energy consumers/producers to directly trade with each other, is one of the new paradigms driven by the decarbonization, decentralization, and digitalization of the energy supply chain. Additionally, the rise of blockchain technolog ...

A Truly Self-Sovereign Identity System

Conference paper (2021) - Quinten Stokkink (author) , G. Ishmaev (author) , D.H.J. Epema (author) , Johan Pouwelse (author)

Existing digital identity management systems fail to deliver the desirable properties of control by the users of their own identity data, credibility of disclosed identity data, and network-level anonymity. The recently proposed Self-Sovereign Identity (SSI) approach promises to ...

How Lightning's Routing Diminishes its Anonymity

Conference paper (2021) - Satwik Prabhu Kumble (author) , DHJ Epema (author) , Stefanie Roos (author)

Lightning, the prevailing solution to Bitcoin's scalability issue, uses onion routing to hide senders and recipients of payments. Yet, the path between the sender and the recipient along which payments are routed is selected such that it is short, cost efficient, and fast. The lo ...

Network-Aware Locality Scheduling for Distributed Data Operators in Data Centers

Journal article (2021) - Long Cheng (author) , Ji Yin Wang (author) , Qingzhi Liu (author) , D.H.J. Epema (author) , C. Liu (author) , Ying Mao (author) , John Murphy (author)

Large data centers are currently the mainstream infrastructures for big data processing. As one of the most fundamental tasks in these environments, the efficient execution of distributed data operators (e.g., join and aggregation) are still challenging current data systems, and ...

Self-adaptive Executors for Big Data Processing

Conference paper (2019) - S. Omranian Khorasani (author) , Jan S. Rellermeyer (author) , D.H.J. Epema (author)

The demand for additional performance due to the rapid increase in the size and importance of data-intensive applications has considerably elevated the complexity of computer architecture. In response, systems offer pre-determined behaviors based on heuristics and then expose a l ...

Fair multiple-workflow scheduling with different quality-of-service goals

Journal article (2019) - Amin Rezaeian (author) , Mahmoud Naghibzadeh (author) , Dick H.J. Epema (author)

Cloud schedulers that allocate resources exclusively to single workflows are not work-conserving as they may be forced to leave gaps in their schedules because of the precedence constraints in the workflows. Thus, they may lead to a waste of financial resources. This problem can ...

An Experimental Performance Evaluation of Autoscalers for Complex Workflows

Journal article (2018) - A.S. Ilyushkin (author) , Ahmed Ali-Eldin (author) , Nikolas Herbst (author) , André Bauer (author) , Alessandro Papadopoulos (author) , DHJ Epema (author) , Alex Iosup (author)

Elasticity is one of the main features of cloud computing allowing customers to scale their resources based on the workload. Many autoscalers have been proposed in the past decade to decide on behalf of cloud customers when and how to provision resources to a cloud application ba ...

Elasticity is one of the main features of cloud computing allowing customers to scale their resources based on the workload. Many autoscalers have been proposed in the past decade to decide on behalf of cloud customers when and how to provision resources to a cloud application based on the workload utilizing cloud elasticity features. However, in prior work, when a new policy is proposed, it is seldom compared to the state-of-the-art, and is often compared only to static provisioning using a predefined quality of service target. This reduces the ability of cloud customers and of cloud operators to choose and deploy an autoscaling policy, as there is seldom enough analysis on the performance of the autoscalers in different operating conditions and with different applications. In our work, we conduct an experimental performance evaluation of autoscaling policies, using as application model workflows, a popular formalism for automating resource management for applications with well-defined yet complex structures. We present a detailed comparative study of general state-of-the-art autoscaling policies, along with two new workflow-specific policies. To understand the performance differences between the seven policies, we conduct various experiments and compare their performance in both pairwise and group comparisons. We report both individual and aggregated metrics. As many workflows have deadline requirements on the tasks, we study the effect of autoscaling on workflow deadlines. Additionally, we look into the effect of autoscaling on the accounted and hourly based charged costs, and we evaluate performance variability caused by the autoscaler selection for each group of workflow sizes. Our results highlight the trade-offs between the suggested policies, how they can impact meeting the deadlines, and how they perform in different operating conditions, thus enabling a better understanding of the current state-of-the-art.

The Impact of Task Runtime Estimate Accuracy on Scheduling Workloads of Workflows

Conference paper (2018) - A.S. Ilyushkin (author) , Dick H.J. Epema (author)

Workflow schedulers often rely on task runtime estimates when making scheduling decisions, and they usually target the scheduling of a single workflow or batches of workflows. In contrast, in this paper, we evaluate the impact of the absence or limited accuracy of task runtime es ...

Achieving Performance Balance among Spark Frameworks with Two-Level Schedulers

Conference paper (2018) - Aleksandra Kuzmanovska (author) , Hans van den Bogert (author) , Rudolf H. Mak (author) , Dick H.J. Epema (author)

When multiple data-processing frameworks with time-varying workloads are simultaneously present in a single cluster or data-center, an apparent goal is to have them experience equal performance, expressed in whatever performance metrics are applicable. In modern data-center envir ...

An Experimental Performance Evaluation of Autoscaling Policies for Complex Workflows

Conference paper (2017) - A.S. Ilyushkin (author) , Ahmed Ali-Eldin (author) , Nikolas Herbst (author) , Alessandro Papadopoulos (author) , B.I. Ghit (author) , Dick H.J. Epema (author) , A Iosup (author)

Simplifying the task of resource management and scheduling for customers, while still delivering complex Quality-of-Service (QoS), is key to cloud computing. Many autoscaling policies have been proposed in the past decade to decide on behalf of cloud customers when and how to pro ...

A Coflow-based Co-optimization Framework for High-performance Data Analytics

Conference paper (2017) - Long Cheng (author) , Ying Wang (author) , Yulong Pei (author) , D.H.J. Epema (author)

Efficient execution of distributed database operators such as joining and aggregating is critical for the performance of big data analytics. With the increase of the compute speedup of modern CPUs, reducing the network
communication time of these operators in large systems is ...

Better Safe than Sorry

Grappling with Failures of In-Memory Data Analytics Frameworks

Conference paper (2017) - B.I. Ghit (author) , D.H.J. Epema (author)

Providing fault-tolerance is of major importance for data analytics frameworks such as Hadoop and Spark, which are typically deployed in large clusters that are known to experience high failures rates. Unexpected events such as compute node failures are in particular an important ...

Modeling, Analysis, and Experimental Comparison of Streaming Graph-Partitioning Policies

Journal article (2017) - Y. Guo (author) , Sungpack Hong (author) , Hassan Chafi (author) , A Iosup (author) , Dick H.J. Epema (author)

In recent years, many distributed graph-processing systems have been designed and developed to analyze large-scale graphs. For all distributed graph-processing systems, partitioning graphs is a key part of processing and an important aspect to achieve good processing performance. ...

When Game Becomes Life

The Creators and Spectators of Online Game Replays and Live Streaming

Journal article (2016) - Adele Jia (author) , S Shen (author) , Dick Epema (author) , A. Iosup (author)

Online gaming franchises such as World of Tanks, Defense of the Ancients, and StarCraft have attracted hundreds of millions of users who, apart from playing the game, also socialize with each other through gaming and viewing gamecasts. As a form of User Generated Content (UGC), g ...

Online gaming franchises such as World of Tanks, Defense of the Ancients, and StarCraft have attracted hundreds of millions of users who, apart from playing the game, also socialize with each other through gaming and viewing gamecasts. As a form of User Generated Content (UGC), gamecasts play an important role in user entertainment and gamer education. They deserve the attention of both industrial partners and the academic communities, corresponding to the large amount of revenue involved and the interesting research problems associated with UGC sites and social networks. Although previous work has put much effort into analyzing general UGC sites such as YouTube, relatively little is known about the gamecast sharing sites. In this work, we provide the first comprehensive study of gamecast sharing sites, including commercial streaming-based sites such as Amazon's Twitch.tv and community-maintained replay-based sites such as WoTreplays. We collect and share a novel dataset on WoTreplays that includes more than 380,000 game replays, shared by more than 60,000 creators with more than 1.9 million gamers. Together with an earlier published dataset on Twitch.tv, we investigate basic characteristics of gamecast sharing sites, and we analyze the activities of their creators and spectators. Among our results, we find that (i) WoTreplays and Twitch.tv are both fast-consumed repositories, with millions of gamecasts being uploaded, viewed, and soon forgotten; (ii) both the gamecasts and the creators exhibit highly skewed popularity, with a significant heavy tail phenomenon; and (iii) the upload and download preferences of creators and spectators are different: while the creators emphasize their individual skills, the spectators appreciate team-wise tactics. Our findings provide important knowledge for infrastructure and service improvement, for example, in the design of proper resource allocation mechanisms that consider future gamecasting and in the tuning of incentive policies that further help player retention.

Tyrex

Size-Based Resource Allocation in MapReduce Frameworks

Conference paper (2016) - B.I. Ghit (author) , Dick Epema (author)

Many large-scale data analytics infrastructures are employed for a wide variety of jobs, ranging from short interactive queries to large data analysis jobs that may take hours or even days to complete. As a consequence, data-processing frameworks like MapReduce may have workloads ...

Design and Experimental Evaluation of Distributed Heterogeneous Graph-Processing Systems

Conference paper (2016) - Yong Guo (author) , Ana Varbanescu (author) , Dick H.J. Epema (author) , Alex Iosup (author)

Graph processing is increasingly used in a variety of domains, from engineering to logistics and from scientific computing to online gaming. To process graphs efficiently, GPU-enabled graph-processing systems such as TOTEM and Medusa exploit the GPU or the combined CPU+GPU capabi ...

A Medium-Scale Distributed System for Computer Science Research: Infrastructure for the Long Term

Journal article (2016) - Henri Bal (author) , DHJ Epema (author) , Cees de Laat (author) , Rob van Nieuwpoort (author) , John Romein (author) , Frank Seinstra (author) , Cees Snoek (author) , Harry Wijshoff (author)

The Dutch Advanced School for Computing and Imaging has built five generations of a 200-node distributed system over nearly two decades while remaining aligned with the shifting computer science research agenda. The system has supported years of award-winning research, underlinin ...