C. Evans

info

Please Note

<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>

Conference paper (2)

Journal article (1)

3 records found

TUD-SUMO

A research-oriented SUMO wrapper for traffic simulation in python

Journal article (2026) - C. Evans, M. Rinaldi, H. Taale, S. P. Hoogendoorn

TUD-SUMO is a Python wrapper for SUMO, a traffic simulation software, designed to support the development of traffic control systems, particularly adaptive systems where data is frequently transferred between a controller and the traffic environment. It provides automated data collection and a set of modular, extensible tools allowing for a wide range of scenarios and control strategies to be simulated and compared. These capabilities are accessed through a simplified interface that enables rapid prototyping of control strategies with complex interactions using minimal code, promoting ease of use and portability. TUD-SUMO has already been employed in multiple projects at Delft University of Technology, including two Horizon Europe projects and 2 transportation engineering courses. ...

Impact of Pre-training on Deep Reinforcement Learning Ramp Metering Systems

Conference paper (2025) - Callum Evans, Marco Rinaldi, Henk Taale, Serge Hoogendoorn

Pre-training is a process used to enhance the learning of deep reinforcement learning (RL) algorithms through initial guidance from an expert demonstrator. This involves training a neural network to replicate the outputs of the selected expert before allowing the RL agent to specialise and develop its own policy. This paper outlines a study that aims to analyse the impact of pre-training on deep RL algorithms used in ramp metering. Specifically, behaviour cloning is performed for increasing lengths of time (0-10,000 epochs), with ALINEA as the chosen expert algorithm guiding a proposed Proximal Policy Optimisation (PPO)-based system. The results confirm that, with the same length of training, some initial guidance through pre-training can significantly improve the system’s effectiveness in reducing congestion compared to no pre-training. Otherwise, excessive pre-training may lead to overfitting and reduced generalisability. Design issues resulting in weak model convergence, however, limit the algorithm’s overall performance in the chosen scenario. ...

A Reinforcement Learning Algorithm for Fair Electoral Redistricting in Parliamentary Systems

Conference paper (2023) - Callum Evans, Hugo Barbosa

The primary goal of a strong democracy should be to most accurately represent its electorate, and the way they are divided into electoral districts can drastically affect this. As a result, many methods have been proposed to algorithmically generate fairer boundaries, the majority of which focus on eliminating bias through qualitative measures, however, these often fail to produce truly fair results. This paper, therefore, aims to demonstrate how fairness can and should become a higher priority within our electoral systems through the development, implementation and application of a new reinforcement learning-based method for algorithmic redistricting that directly optimises for fairness. Specifically, the model has been applied to the parliamentary system of the UK, filling a significant gap within the literature, meaning the paper also outlines a new metric for measuring fairness in parliamentary systems that directly rewards proportionality, the seats-votes difference. The algorithm has then been evaluated on the current parliamentary constituency boundaries in the UK and was ultimately found to fulfil all initial goals as the algorithm was able to improve the map’s fairness in all experiments performed. The paper subsequently concludes with some of the limitations of the model and the seats-votes difference and ways the redistricting algorithm could be further expanded in the future. ...