L. Ferranti | TU Delft Repository

Topology-Aware Distributed Multi-Robot Coordination

Master thesis (2026) - J.A.R. Zwanen , L. Ferranti , M. Khosravi , Javier Alonso-Mora

Explicit trajectory communication can be used to coordinate multiple robots, but communicating at every planning iteration can lead to congestion of the communication network, increase message delays and message loss. At the same time, collision-free trajectory planning is often ...

Explicit trajectory communication can be used to coordinate multiple robots, but communicating at every planning iteration can lead to congestion of the communication network, increase message delays and message loss. At the same time, collision-free trajectory planning is often formulated as a nonconvex optimization problem, which can converge to different locally optimal solutions across consecutive planning iterations.When this happens, a robots planned motion can switch between distinct high-level avoidance behaviors, such as passing an obstacle on the left versus on the right, which can lead to inefficient or unsafe behavior. Topology-based motion planners address this by explicitly computing multiple candidate motion plans that represent these different passing decisions, each associated with a distinct homotopy class. This work asks how (changes in) homotopy-class representations can trigger communication to reduce communication load while maintaining safe and efficient behavior. Building on the topology-driven trajectory optimization(T-MPC) approach of [ 1], we propose T-DMPC, a topology-aware distributed motion planner in which each robot, computes multiple guidance trajectories in distinct homotopy classes and refines them via parallel local trajectory optimization within the corresponding homotopy classes, selects a solution using a consistent decision rule that prioritizes the previously executed homotopy class, and communicates the selected trajectory using an event-triggered policy. Communication is triggered by homotopy changes and complemented by geometric-deviation and time-based triggers to bound trajectory staleness. In addition, the robots communicate a fallback trajectory during planning failures (e.g. infeasibility). We evaluate T-DMPC on antipodal swap maneuvers with 2 and3 robots in simulation and on physical robots, comparing against T-VMPC (no communication, constant-velocity predictions) and T-AMPC (always communicate). The experiments show that, T-DMPC achieves task duration and traveled distance comparable to T-AMPC and T-VMPC, while reducing communication to about 9.8% (2 robots) and 13.2% (3 robots) of planning iterations in simulation, and 17.8% (2 robots) and 13.2% (3 robots) in real-world experiments, with no observed physical collisions. Ablations however show that topology-change alone is insufficient for safety,motivating the combined trigger design.

Game-Theoretic Motion Planning for Multi-Agent Interaction

Doctoral thesis (2026) - L. Peters , Javier Alonso-Mora , L. Ferranti

Robots are leaving factory floors and entering human environments, where they must move safely and efficiently while interacting with people and other autonomous systems. In these settings, decisions are interdependent: what a robot should do depends on how others will respond, a ...

Distributed Multi-Vehicle Coordination for Navigation

Master thesis (2025) - H.M. Visser , L. Ferranti , Javier Alonso-Mora

As self-driving vehicles progress toward real-world deployment, efficient and reliable motion planning in dynamic multi-agent environments becomes increasingly essential. This work addresses this challenge by advancing the field of nonlinear distributed model predictive control ( ...

As self-driving vehicles progress toward real-world deployment, efficient and reliable motion planning in dynamic multi-agent environments becomes increasingly essential. This work addresses this challenge by advancing the field of nonlinear distributed model predictive control (NMPC) for autonomous multi-vehicle coordination. The approach focuses on complex, interaction-dense scenarios where space is limited. To model vehicle shapes and collision boundaries efficiently it uses polytopic set representations to approximate vehicle geometries and collision boundaries instead of conventional shapes such as ellipsoids.

We propose a novel distributed NMPC approach for navigation in tight environments. The goal is to enable coordinated motion planning for multiple autonomous vehicles in dense traffic scenarios. This can be easily modelled with centralised formulations, however, considering the scale of the road network they become computationally intractable as the number of agents grows. In distributed approaches instead each vehicle solves its local part of the problem which scales linearly and is therefore better suited for these kind of environments. So building on an existing distributed method we enhance its structure to improve scalability, safety, and performance in cooperative autonomous driving tasks. The research is guided by three objectives: (RO1) to reproduce a known distributed baseline algorithm using the DART (Delft’s Autonomous-driving Robotic Testbed) vehicle model \cite{lyons_dart_2024}, the Model Predictive Contouring Control (MPCC) \cite{lam_model_2010} tracking objective, and the ACADOS solver framework \cite{verschueren_acados_2020}; (RO2) to develop a novel distributed MPC-based algorithm that improves inter-vehicle spacing and tracking performance while generalizing beyond predefined reference trajectories; and (RO3) to prepare the algorithm for validation on a real-world robotic testbed.

The primary contributions of this work include: (C1) successful reproduction of the distributed baseline using open-source tools and realistic vehicle modelling; (C2) development of two enhanced distributed algorithms, Distributed Model Predictive Contouring Control with Relaxed Collision Avoidance (DMPC-RCA) and DMPC-RCA with consensus (DMPC-RCA-C), that demonstrate superior performance in tracking accuracy and safety margins; and (C3) integration of these algorithms with the DART hardware platform for future experimental validation. The repository of these approaches can be found in https://github.com/HubertVisser/multi-vehicle-coordination-algorithm.git

Performance is evaluated in two representative scenarios, a merging situation and a T-junction, with the centralised NMPC approach serving as the performance benchmark. To evaluate the performance of the proposed designs, three key metrics are used: accumulated tracking cost, computation time, and minimum inter-agent distance. The results show that both proposed methods achieve tracking costs comparable to the centralised controller, while significantly outperforming the distributed baseline method. Notably, the inclusion of a consensus term yields no substantial improvement in performance over the non-consensus version.

To conclude, the proposed approaches offer strong potential for scalable, safe, and efficient multi-agent motion planning, moving one step closer to the deployment of fully cooperative autonomous driving on public roads.

Github repository for the Master Thesis - https://github.com/HubertVisser/multi-vehicle-coordination-algorithm.git

Imitation learning for an ASV path planner in complex marine environments

A feasibility study

Master thesis (2025) - B. Rutteman , L. Ferranti , Jonathan Klein Schiphorst , Javier Alonso-Mora

This thesis proposes a study towards the application of imitation learning (IL) algorithms Action Chunking Transformer (ACT) and diffusion policy as an autonomous surface vessel (ASV) path planner in complex marine environments. Rationale for conduct- ing this research are the ub ...

UAV Celestial Navigation with Light Pollution Adaptation

Master thesis (2025) - J. Seth , L. Ferranti , O.M. de Groot , C. Pek

UAVs increasingly require GNSS-independent positioning for operation in contested or infrastructure-denied environments. This paper presents a vision-based celestial navigation system with automatic adaptation to light pollution through dynamic star catalog selection. The algorit ...

Beyond Proportional Navigation

Deep Reinforcement Learning for Robust Drone Interception

Master thesis (2025) - M.L. Capelle , L. Ferranti , E.J.J. Smeur , Javier Alonso-Mora , R. Ferrari

The rapid increase in drone threats has created an urgent need for practical counter-drone systems for safety and defence purposes. Interceptor drones represent a promising and cost-effective solution for removing unwanted aerial vehicles. However, classical guidance laws such as ...

A Novel Approach to Mechanical Tracker Calibration for Surgical Navigation

On-Site Artefact-Based Calibration and Validation

Master thesis (2025) - M. van der Lecq , J. Kober , L. Ferranti , Tom van Riet , Ruud Schreurs , Naomi Rood , Willem Momma

Mechanical tracking systems offer potential advantages over optical and electromagnetic alternatives for surgical navigation, including freedom from line-of-sight constraints, immunity to metallic interference, and high-frequency position feedback. However, passive mechanical tra ...

Mechanical tracking systems offer potential advantages over optical and electromagnetic alternatives for surgical navigation, including freedom from line-of-sight constraints, immunity to metallic interference, and high-frequency position feedback. However, passive mechanical tracker arms lack the joint locking capability required for traditional laser tracker calibration, and existing artefact-based methods fail to provide the complete six-degree-of-freedom constraints necessary for accurate kinematic parameter identification. This thesis presents a novel artefact-based calibration methodology specifically designed for passive mechanical tracking arms used in surgical navigation applications. The methodology employs a laser-certified steel calibration artefact featuring 15 asymmetric pillars that provide complete 6-DoF constraint through single contact points. Unlike traditional sphere-based methods that only constrain position, the asymmetric plug design mechanically prevents rotation, enabling simultaneous identification of all kinematic parameters without sequential joint locking. The artefact was precision-machined and certified using a Leica AT960 laser tracker to ±0.001 mm uncertainty, providing metrological traceability one order of magnitude better than the target accuracy. Applied to a custombuilt six-degree-of-freedom Surgical Tracker Arm (SUTA), the calibration methodology achieved 1.01 mm RMS accuracy across 525 independent validation measurements, representing an 81.5% improvement from the 5.45 mm RMS baseline configuration. Dataset optimization analysis revealed that six measurement positions achieve near-optimal accuracy (1.09 mm RMS) while maintaining calibration efficiency, with marginal improvements beyond this point. The research identified mechanical interface compliance as the dominant error source, with clearance in the end-effector-pillar interface contributing approximately 0.5 mm to the total error. While the achieved accuracy does not yet match the 0.3-0.5 mm performance of state-of-the-art optical systems, the methodology successfully demonstrates that passive mechanical trackers can be calibrated to millimeter-level precision with a calibration artefact. The approach establishes a practical framework for calibrating passive tracking systems and identifies clear paths toward sub-millimeter accuracy through targeted hardware improvements, particularly in interface design and consistent seating mechanisms.

Designing Simulators for Robot Learning

Doctoral thesis (2025) - D.S. van der Heijden , R. Babuska , J. Kober , L. Ferranti

Reinforcement learning has emerged as a promising approach for enabling robots to learn from interactions with their environments, without relying on predefined behaviors. However, robots face significant challenges when learning directly from real-world interactions. Real-world ...

Reinforcement learning has emerged as a promising approach for enabling robots to learn from interactions with their environments, without relying on predefined behaviors. However, robots face significant challenges when learning directly from real-world interactions. Real-world learning is time-consuming and resource-intensive, often requiring extensive data collection over long periods. Additionally, the risks involved in trial-and-error learning in physical settings are high, as faulty policies can lead to safety issues or system damage. Simulations offer a safer and more efficient alternative, allowing robots to learn in simulated environments at faster-than-real-time speeds. Despite these benefits, simulations often serve as imperfect approximations of reality. As a result, robots may learn behaviors that exploit simulation-specific quirks, which may not perform well in real-world settings, creating difficulties in transferring learned behaviors from simulation to real environments—a challenge known as the sim-to-real gap. Several factors contribute to the sim-to-real gap, such as unmodeled physical phenomena like friction and deformation, and the asynchronous nature of real-world systems that simulations often fail to capture accurately. Additionally, using separate software stacks for simulation and deployment can unintentionally lead to discrepancies. Finally, simulating at faster-than-real-time speeds with asynchronous frameworks that distribute computation across multiple cores may also introduce inaccuracies without proper synchronization.

This thesis focuses on improving simulation tools and methodologies to enhance the efficiency and effectiveness of learning-based approaches in robotics. The work addresses key trade-offs between flexibility, speed, and accuracy in robotic simulations, which are critical for successfully transferring learned policies from simulation to real-world environments. Additionally, it introduces a strategy to improve resilience, ensuring that learned behaviors are robust to irrelevant and unknown dynamics.
By tackling these challenges, this thesis provides insights into the design of effective robotic simulators and presents contributions that help bridge the gap between simulated and real-world robotic learning.

Rule-compliant and Fault-Tolerant Motion Planning

With Application to Autonomous Surface Vehicles

Doctoral thesis (2025) - A. Tsolakis , R.R. Negenborn , L. Ferranti , Vasso Reppa

This thesis focuses on enabling safe and reliable navigation for Autonomous Surface Vessels (ASVs) operating in complex and mixed-traffic maritime environments. It presents a suite of motion planning and control algorithms that ensure fault tolerance and compliance with maritime ...

Combining Learning and Planning for Autonomous Search in Unknown Environments

Doctoral thesis (2025) - M. Lodel , Javier Alonso-Mora , R. Babuska , L. Ferranti

Public safety and emergency response agencies increasingly consider the deployment of mobile robots as mounting climate-related disasters and security challenges place human personnel at higher risk and stress. Mobile robots, such as drones, are a promising strategy to respond to ...

Public safety and emergency response agencies increasingly consider the deployment of mobile robots as mounting climate-related disasters and security challenges place human personnel at higher risk and stress. Mobile robots, such as drones, are a promising strategy to respond to these challenges: They can navigate difficult, hazardous terrain, gather real-time situational data, and conduct search or reconnaissance tasks without putting humans at direct risk. However, the currently practiced teleoperation of robots is challenging for such complex missions since the simultaneous navigation, situation assessment, and search tasks can overload human cognitive abilities. Therefore, autonomous planning and decision-making algorithms are required to enable robots to explore and search unknown environments for targets such as missing persons or hazardous materials.

Moving towards this goal, this thesis addresses two core problems. First, local motion planning must carefully account for information gained from sensor observations as well as collision avoidance and the robot’s dynamics while moving through cluttered, unknown areas. Second, global exploration planning must strategically select where in the environment to explore to find the target quickly - especially when the environment is large or complex. Given that human operators often possess semantic knowledge about likely target locations, we hypothesize that incorporating such guidance by observed semantic features (e.g., object or room types) into the exploration planning is crucial for time-efficient autonomous search. We address these two core problems by making the following contributions.

The first contribution of the thesis is an informative local motion planning approach
that generates safe, collision-free trajectories around obstacles while minimizing uncertainty about the target locations. The critical challenge is to achieve computationally efficient planning of trajectories that maximize information gain under the robot’s kinodynamic constraints. In the proposed approach, a model predictive control (MPC) motion planner is guided by a learned viewpoint policy. The policy is trained via deep reinforcement learning (DRL) to maximize long-term information gain by providing a local subgoal to the MPC. The MPC follows the subgoal and ensures that the motion plan remains feasible and collision-free. Therefore, the robot can rapidly replan safe and informative local trajectories online. Simulation experiments demonstrate that the method achieves competitive performance in locating targets compared to a computationally expensive state-of-the-art planner using Monte Carlo Tree Search (MCTS), while allowing for significantly faster execution and replanning.

While local informative planning is crucial for exploring cluttered spaces, it often be-
haves myopically and inefficiently with respect to large and complex environments. Therefore, the second contribution introduces a global target search planner that balances directed search towards semantically promising areas with complete search space coverage. This planner extends the idea of frontier exploration - focusing observations on the boundaries between explored and unexplored regions - to target search, where different frontiers are assigned a semantic priority. This priority represents the semantic relationships between the target and nearby objects. To minimize target search time, the target search planner schedules high-priority frontiers earlier by solving a custom combinatorial optimization problem to determine the visitation order. By integrating coverage gains into the frontier priorities, the planner ensures that the robot explores the environment efficiently while focusing on semantically relevant areas. We demonstrate this approach in two studies outlined below.

Large, high-quality datasets for learning target-specific semantic relationships are
scarce in many real-world scenarios, especially in search and rescue. The third contribution addresses this limitation by proposing a method to learn semantic priority models from expert feedback. Rather than collecting massive amounts of labeled data, the approach exploits an expert operator’s sparse guidance inputs in a few target search scenarios. This expert guidance selects a frontier to explore next, which is stored in a training dataset together with the frontier’s semantic features. An expert model is then trained to approximate a priority function that predicts how relevant each frontier is for the expert. By incorporating this learned priority function into the global target search planner, the robot can autonomously prioritize semantically relevant areas according to the expert’s semantic knowledge. Experiments show that using a small number of expert demonstrations is sufficient for the robot to significantly improve its search efficiency and reduce travel distance until the target is found.

Lastly, the thesis extends semantic target search to three-dimensional environments
by integrating it into a 3D planning pipeline for micro aerial vehicles (MAV). The pipeline first detects objects in the environment using onboard vision and associates them with priority values computed from pre-trained large language model (LLM) embeddings. These priorities are then propagated into frontiers in a 3D voxel map, indicating frontier regions that are most likely to contain the target. This enables the evaluation of frontier viewpoints for their information gain that accounts for both semantic priority and volumetric coverage. The viewpoint gains are then used in the combinatorial target search planner to prioritize the viewpoints that most likely lead to the target while ensuring efficient coverage of the environment. By integrating the MAV’s kinodynamic constraints into the planning costs, the system ensures smooth, feasible trajectories in real-time. Simulation studies reveal that semantically guided exploration leads to faster and more reliable target discovery than different purely coverage-based exploration baselines. Experiments with a real MAV in the lab confirm the approach’s ability to autonomously navigate an MAV through complex 3D environments to a target, exploiting semantic cues, maximizing
coverage, and avoiding collisions.

In summary, this thesis demonstrates how planning and learning techniques can be
combined for autonomous target search and exploration. These techniques enable mobile robots to navigate unknown environments efficiently and safely while searching for targets and collecting required information. Crucially, our proposed method for semantically guided frontier planning bridges the gap between recent learning-based navigation approaches and established planning-based approaches suitable for real-world robotic systems. By integrating semantic knowledge into robotic exploration, the proposed methods can reduce human operator cognitive load and, therefore, facilitate robot deployment in scenarios such as search and rescue or reconnaissance missions.

Collision avoidance of autonomous surface vessels considering proactive COLREG compliance

How the concept of the ship domain and arena can be applied in a collision avoidance framework of ASVs

Master thesis (2024) - D.J.J. Oudshoorn , Vasso Reppa , L. Ferranti , A. Tsolakis , R.R. Negenborn

This thesis presents a COLREG-compliant collision avoidance and detection algorithm for the safe navigation of autonomous surface vessels. The method builds upon the Velocity Obstacle algorithm and implements the concept of the ship domain and the ship arena into the framework. T ...

A Multi-Step Gaussian Process Learning Framework for Long-Horizon Vehicle Dynamics Prediction

Master thesis (2024) - L. Yin , L. Ferranti , L. Lyons

This work introduces a novel training strategy for Gaussian Process (GP) models aimed at improving their predictive accuracy and uncertainty quantification capabilities over extended prediction horizons. This improvement is highly relevant for applications in model predictive con ...

This work introduces a novel training strategy for Gaussian Process (GP) models aimed at improving their predictive accuracy and uncertainty quantification capabilities over extended prediction horizons. This improvement is highly relevant for applications in model predictive control (MPC) in the autonomous driving domain. Learning-based MPC strategies typically rely on standard physics-based models augmented with GP models to account for residual nonlinearities and uncertainties not captured by the former. Nonetheless, these conventional approaches often struggle with long-term prediction accuracy, especially when faced with out-of-distribution scenarios, a phenomenon where the model encounters data points that are significantly divergent from the training set. To address these challenges, a multi-step Gaussian process training framework is proposed. This framework yields a GP model capable of making accurate long-term predictions, i.e. a multi-step Gaussian Process (MSGP) model. It achieves this by integrating the simulation of future dynamics into the training process, allowing for the model’s kernel parameters to be tuned toward long-term dynamics. As a result, the MSGP model not only demonstrated the ability to make more stable and accurate long-term dynamic predictions but also with greater confidence. The efficacy of the multistep training framework is shown by the significant improvements in long-horizon dynamics predictions by the MSGP model, achieving an average 19% reduction in mean error and a 90% reduction in variance compared to the standard GP model. Moreover, the efficacy of the MSGP model is further confirmed through its application in a Multi-Step Gaussian Process-based Model Predictive Contouring Controller (MSGP-MPCC), which outperforms a traditional GP-based MPCC (GP-MPCC) baseline controller in lap time and reliability, achieving a 100% success rate in completing laps across ten consecutive simulations without crashing.

Autonomous Wildfire Monitoring Using Cooperative UAVs

Master thesis (2024) - M.F. van Tilburg , L. Ferranti , Yağmur Mavruk , Holger Caesar

Monitoring wildfires using multiple unmanned aerial vehicles (UAVs) is essential for timely intervention and management of the fire while minimising risks to human lives. A research gap was identified for practical UAV solutions integrating critical features, such as localisation ...

Monitoring wildfires using multiple unmanned aerial vehicles (UAVs) is essential for timely intervention and management of the fire while minimising risks to human lives. A research gap was identified for practical UAV solutions integrating critical features, such as localisation, fire safety and battery management. This thesis presents a novel rule-based algorithm designed for effective wildfire monitoring using cooperative UAVs. A simulation environment with a wildfire spread model was developed to test and evaluate various strategies under dynamic conditions. The proposed algorithm operates in two phases: first, locating the fire and then monitoring its progression. Key features include a heat avoidance mechanism to maintain UAV safety and recharging schedules to maximise operational uptime. A fire detection map is centrally created using the infrared data from the UAVs. The convex hull around the fire serves as the basis for path planning to track the fire perimeter. The best strategy is determined and optimised through a series of experiments in the simulation environment. The overall best-performing algorithm makes navigational decisions based on the importance of different sections of the fire perimeter. The importance of the perimeter is constantly updated based on the fire spread, visit times, and distances of the UAVs. The assignment of various UAV tasks was reviewed to enhance UAV cooperation, which showed performance benefits across different scenarios. Other key findings
include the recommendation of a small fleet size of up to six UAVs, with a flight speed of 10 m/s at an altitude of 100 meters, balancing costs, performance and safety. The algorithm demonstrated performance equal to state-of-the-art reinforcement learning techniques
while offering advantages in explainability. Additionally, the algorithm has been successfully validated in a lab environment, demonstrating its potential as a practical and cost-effective solution for wildfire monitoring. This work brings a fire monitoring system closer to real-world implementation and will possibly help fight wildfires effectively.

Probabilistic Motion Planning in Dynamic Environments

Parallelizable Scenario-Based Trajectory Optimization with Global Guidance

Doctoral thesis (2024) - O.M. de Groot , D.M. Gavrila , J. Alonso-Mora , L. Ferranti

Logistics and transportation can greatly benefit from the use of autonomous robots, such as self-driving vehicles. Robots can help to move goods or people without human supervision. One of the main components that enable autonomous navigation among humans is motion planning. Moti ...

Logistics and transportation can greatly benefit from the use of autonomous robots, such as self-driving vehicles. Robots can help to move goods or people without human supervision. One of the main components that enable autonomous navigation among humans is motion planning. Motion planning is responsible for computing a collision-free trajectory that moves the robot to its destination, based on the perceived information. The motion planner should be efficient, robust, and safe. This thesis contributes towards this goal by investigating the design of motion planning algorithms for autonomous navigation of mobile robots near humans.

Traditional motion planners for dynamic environments have two key limitations that this thesis aims to address. First, they assume that their model of dynamic obstacles (e.g., humans) is exactly correct, capturing it with a single deterministic prediction. In practice, the robot cannot observe human intentions and must account for its uncertainty about the human's future behavior. Second, motion planners usually compute a single trajectory around an obstacle as a result of previously taken decisions without exploring alternative options. They react slowly or even fail to find a solution when unpredicted changes make this path undesirable. This results in poor planning performance in dynamic environments.

The goal of this thesis is to develop motion planners that account for the uncertainty of human motion predictions and that are consistent and robust in their decision-making in order to deal with unpredicted changes in dynamic environments. To accomplish this goal, this thesis proposes two motion planning frameworks: scenario-based and topology-driven trajectory optimization.

The first contribution of this thesis is Scenario-based Model Predictive Contouring Control (S-MPCC), a real-time capable probabilistic planning framework that incorporates any uncertainty associated with the motion predictions of dynamic obstacles. Contrary to existing methods that only account for small variations around a single predicted trajectory (unimodal uncertainty), the proposed planner accounts for multiple possible trajectories (multi-modal uncertainty). The planner therefore safely accounts for several outcomes, for instance, to express that a pedestrian may or may not cross in front of the robot.

S-MPCC bounds the probability of collision in each time step with all obstacles through Chance-Constrained Optimization (CCO). The CCO is reformulated as an optimization without uncertainty by sampling trajectories from the predicted distribution, known as scenarios. Each scenario represents a possible position of all obstacles in one time step, and the planner avoids collisions with all scenarios. This Scenario Program (SP), through a tailored linearization, can be solved efficiently online. S-MPCC therefore plans probabilistic safe trajectories independent of the underlying distribution of the uncertainty.

S-MPCC considers the probability of collision separately for each time instance in the planned trajectory. The second contribution of this thesis, Safe Horizon Model Predictive Control (SH-MPC), builds on S-MPCC to constrain the joint probability of collision with all obstacles over the duration of the planned trajectory. Existing methods that separately constrain the probability of collision in each time step (temporal marginal) and with each obstacle (obstacle marginal) lead to overly cautious motion planning when safety constraints are enforced. SH-MPC formulates a single chance constraint to bound the overall probability of collision. This CCO is reformulated as an SP where each scenario represents a possible trajectory for all obstacles. To certify the joint probability of collision with the SP, the number of scenarios that affect the motion plan needs to be identified. SH-MPC estimates this quantity at a negligible computational cost during optimization. Consequently, SH-MPC plans trajectories in real-time under generic uncertainties that are less cautious than existing methods without compromising on safety.

The probabilistic safety of S-MPCC and SH-MPC is linked to the underlying accuracy of the prediction model of the obstacles that provide the scenarios. As a third contribution, a joint prediction and planning framework, Partitioned Scenario Replay (PSR), is proposed that replays past observations of human motion as scenarios for scenario-based planning. PSR does not fit a distribution on observed data but directly uses the data as empirical evidence of the underlying uncertainty and thereby provides a real-world safety guarantee.

A key limitation of the developed scenario-based planners and other optimization-based planners is that they locally refine an initial trajectory. This initial trajectory largely determines the quality of the final trajectory, while it does not consider other options. The fourth contribution of this thesis is Topology-driven Model Predictive Control (T-MPC) that concurrently optimizes trajectories, each attempting a different way to pass the obstacles. T-MPC is composed of a guidance planner and several parallel local planners. The guidance planner identifies guidance trajectories for several distinct maneuvers, relying on results from topology to distinguish trajectories. Each local planner is composed of an existing optimization-based planner (e.g., a scenario-based planner) and an additional set of constraints that are derived from one of the guidance trajectories. The guidance trajectories are optimized by the local planners in parallel, and the results are compared to determine which trajectory gets executed. T-MPC is faster, more consistent, and safer than several state-of-the-art planners. Contrary to similar existing work, it does not rely on an explicit lane structure and therefore enables both urban driving and mobile robotic applications.

The motion planners developed in this thesis are extensively validated in simulation and in experiments with a small-scale mobile robot and a full-scale self-driving vehicle navigating among pedestrians. The robot-agnostic implementation of the proposed planners that were developed for this thesis is available open source.

Trajectory planning and following in urban environments

To reduce traffic accidents involving vulnerable road users

Master thesis (2023) - V.V.J. van der Drift , O.M. de Groot , L. Ferranti , B. Shyrokau

This thesis presents a comprehensive approach to integrating a trajectory planner and follower for autonomous vehicles (AVs) using model predictive contouring control (MPCC). The planner generates collision-free trajectories with a kinematic bicycle model, while the follower trac ...

On Game-Theoretic Planning with Unknown Opponents' Objectives

Master thesis (2023) - Xinjie Liu , Javier Alonso-Mora , Lasse Peters , Laura Ferranti , Luca Laurenti

Many autonomous navigation tasks require mobile robots to operate in dynamic environments involving interactions between agents. Developing interaction-aware motion planning algorithms that enable safe and intelligent interactions remains challenging. Dynamic game theory renders ...

Many autonomous navigation tasks require mobile robots to operate in dynamic environments involving interactions between agents. Developing interaction-aware motion planning algorithms that enable safe and intelligent interactions remains challenging. Dynamic game theory renders a powerful mathematical framework to model these interactions rigorously as coupled optimization problems. By solving the resultant coupled optimization problems to equilibrium solutions, the game-theoretic models explicitly account for the interdependence of agents’ decisions and achieve simultaneous prediction and planning. Coupled constraints between players, such as collision avoidance, can also be handled explicitly. However, most existing game-theoretic motion planning approaches rely on known objective models of all agents. This assumption presents a key obstacle to real-world ego-centric planning applications of these methods, where only local information is available. This thesis investigates solution approaches to relax this assumption and explicitly account for the ego agent’s uncertainty about other agents’ objectives while adaptively conducting game-theoretic motion planning.

The main contribution of this work is an online adaptive model-predictive game-play (MPGP) framework that jointly infers other players’ objectives and computes corresponding generalized Nash equilibrium (GNE) strategies. These strategies are then used as predictions for other players and control strategies for the ego agent. The adaptivity of the proposed approach is enabled by differentiating through a trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents’ objectives. Compared with existing objective inference solutions in dynamic games, the proposed approach handles general inequality constraints in games and further supports direct integration with other differentiable modules, such as neural networks (NNs). Two simulation experiments indicate that the proposed approach performs closely to solving games with known objectives and outperforms the game-theoretic and model-predictive control (MPC) baselines. Two hardware experiments further demonstrate the real-time planning capability of the planner and its real-world applicability.

In addition to this main contribution, the second contribution of this work is a variational autoencoder (VAE) pipeline built upon the proposed differentiable game solver. This contribution aims at going beyond the point estimation in the first contribution and inferring potentially multi-modal beliefs about players’ objectives based on observations. The main idea is to employ variational inference (VI) to approximate Bayesian inference of players’ objectives. The variational autoencoder (VAE) framework is utilized for amortization to avoid per-sample optimization. Initial results on a single-player example show that after training, the proposed pipeline can: (i) generate a game objective distribution that resembles the underlying training data distribution and (ii) accurately predict a narrow, uni-modal posterior objective distribution when the observation is unambiguous based on seen data in the past and (iii) generate a multi-modal belief distribution of player’s objective to capture mostly likely modes in case of high uncertainty.

Data-driven optimal control for safe quadrotor navigation in windy environments

Master thesis (2023) - J.S. Probst , T. Keviczky , L. Ferranti , D. Benders , J. Alonso Mora , G. Franzese

Creating autonomous Micro Aerial Vehicles for executing complex missions poses various challenges, including safe navigation in the presence of external wind disturbances. Most current navigation methods handle external wind disturbances through real-time estimation and rejection ...

Creating autonomous Micro Aerial Vehicles for executing complex missions poses various challenges, including safe navigation in the presence of external wind disturbances. Most current navigation methods handle external wind disturbances through real-time estimation and rejection algorithms in the control stage, but lack safety guarantees in strong winds. Recent robust methods provide safety guarantees but can be overly conservative. With the availability of more powerful computing devices, data-driven control algorithms are becoming increasingly feasible. Combining Gaussian Process models with Model Predictive Control has shown to enhance safety and performance in various control applications. Moreover, Model Predictive Control has been extended to solve more complex optimization algorithms that combine trajectory generation and tracking, preventing reference trajectories that are risky and challenging to control in the presence of disturbances.

This research aims to improve quadrotor navigation in windy environments by using Gaussian Processes to model wind disturbances. The Gaussian Process model is integrated with a Model Predictive Contouring Control formulation that combines trajectory generation and control into one optimization problem. A nominal model of the quadrotor is derived and the Gaussian Process disturbance model is trained with the quadrotor position as input and wind disturbance as output. The wind disturbance map, along with the nominal model, is implemented in the Model Predictive Contouring Controller to consider both the mean and uncertainty of the resulting probabilistic model.

This study validates the use of Gaussian Processes to model complex wind disturbances in quadrotor navigation. The wind disturbance map is trained from available state information, with data collected using an optimal exploration design to minimize uncertainty and reduce exploration times. The hyperparameters involved in training the Gaussian Process model are discussed and the implementation of sparse Gaussian Processes is outlined to make it real-time feasible for the Model Predictive Contouring Control formulation. Including the prediction model by incorporating chance constraints results in improved tracking and increased robustness in cluttered environments compared to the nominal Model Predictive Contouring Control formulation. The proposed algorithm is shown to outperform state-of-the-art methods for safe quadrotor navigation in windy and cluttered environments, being able to handle more complex wind fields than existing methods while also being less conservative.

Robust Control of Large-Scale Satellite Constellations Using System-Level Synthesis

Master thesis (2023) - F.J.P. Ballast , T. Keviczky , L. Ferranti

The ‘New Space’ mentality is gaining in popularity and is at the basis of the growing size of satellite constellations. These satellite constellations are used for technologies such as satellite navigation and internet, but a clear framework to controlling large satellite conste ...

Optimisation of Speed Trajectories to Improve the Energy Economy of Electric Vehicles

Master thesis (2022) - T. Goyal , L. Ferranti , Mauro Salazar Villalon , Thijs van Keulen , B. Shyrokau

Electric vehicles are a cleaner and more efficient means of transport. However, the sub-energy-optimal acceleration and deceleration inputs of drivers result in speed trajectories that cause superfluous expenditure of the stored electrical energy in battery. Optimising the speed ...

Design of Automated R2C Cars for Cooperative Driving Experiments

Master thesis (2022) - T.E.J. Niesten , L. Ferranti , L. Lyons , B. Shyrokau

Cooperative driving controllers are becoming interesting subjects for future research in automated driving with the increase in connectivity. Using true-scale
autonomous vehicles to properly test new control algorithms can be a challenge
due to three main factors: costs, ...