L. Ferranti
Please Note
41 records found
1
Bayesian Optimization for Auto-tuning Cascaded PID Control of an (e)ROV
From manually tuned PID control to a Bayesian Optimized auto-tuned variant
This thesis presents a data-driven auto-tuning workflow for cascaded PI control on industrial eROVs. The workflow incorporates a standardised evaluation protocol, safety screening mechanisms, a composite cost function capturing multiple closed-loop performance criteria, and a hyperparameter configuration that generalises across axes and platforms without per-axis adjustment. It is validated on two physically distinct platforms differing by a factor of approximately 4.6 in mass: one in high-fidelity simulation (four independent tuning runs) and one on real hardware at an operational test site (two runs). Across all runs, the auto-tuned gains consistently match or outperform the manually tuned baseline on every axis where the evaluation protocol provides informative signals, with substantial cost reductions observed on both platforms. Residual cases where the workflow does not improve upon the baseline are traced to identifiable limitations in the evaluation protocol rather than in the optimiser itself. The auto-tuned gains for several axes have been transferred to the production vehicle, and independent pilot assessments confirm clearly improved closed-loop behaviour.
The workflow is ready for industrial deployment as a drop-in replacement for manual cascaded PI tuning on eROVs. ...
This thesis presents a data-driven auto-tuning workflow for cascaded PI control on industrial eROVs. The workflow incorporates a standardised evaluation protocol, safety screening mechanisms, a composite cost function capturing multiple closed-loop performance criteria, and a hyperparameter configuration that generalises across axes and platforms without per-axis adjustment. It is validated on two physically distinct platforms differing by a factor of approximately 4.6 in mass: one in high-fidelity simulation (four independent tuning runs) and one on real hardware at an operational test site (two runs). Across all runs, the auto-tuned gains consistently match or outperform the manually tuned baseline on every axis where the evaluation protocol provides informative signals, with substantial cost reductions observed on both platforms. Residual cases where the workflow does not improve upon the baseline are traced to identifiable limitations in the evaluation protocol rather than in the optimiser itself. The auto-tuned gains for several axes have been transferred to the production vehicle, and independent pilot assessments confirm clearly improved closed-loop behaviour.
The workflow is ready for industrial deployment as a drop-in replacement for manual cascaded PI tuning on eROVs.
To address this gap, we collected two tactile datasets using a 10-whisker MEMS-based sensor array swept across three object geometries under varying contact conditions. A compact fully spiking network with 901 parameters achieved 100% test accuracy, outperforming larger spiking architectures; classification was reliable from as few as 20 timesteps of contact, and raw pressure signals alone were sufficient. Generalization experiments showed that sweep speed had minimal effect on performance, while indentation depth and sweep direction introduced larger domain shifts. Compared to a structurally equivalent non-spiking baseline, the spiking model matched classification accuracy with an estimated 9.5% reduction in overall inference energy under analog input and a potential 95% reduction under spike-encoded input.
These results demonstrate the feasibility of a fully bio-inspired pipeline — from biomimetic whisker sensing to spiking neural inference — for tactile object classification. ...
To address this gap, we collected two tactile datasets using a 10-whisker MEMS-based sensor array swept across three object geometries under varying contact conditions. A compact fully spiking network with 901 parameters achieved 100% test accuracy, outperforming larger spiking architectures; classification was reliable from as few as 20 timesteps of contact, and raw pressure signals alone were sufficient. Generalization experiments showed that sweep speed had minimal effect on performance, while indentation depth and sweep direction introduced larger domain shifts. Compared to a structurally equivalent non-spiking baseline, the spiking model matched classification accuracy with an estimated 9.5% reduction in overall inference energy under analog input and a potential 95% reduction under spike-encoded input.
These results demonstrate the feasibility of a fully bio-inspired pipeline — from biomimetic whisker sensing to spiking neural inference — for tactile object classification.
Towards Federated Diffusion for Robot Navigation
Distributed Training of Generative Control Policies
In these safety-critical missions, robots are expected to support human first responders by exploring unknown and potentially hazardous environments, including buildings and natural terrains. To reduce the cognitive and operational burden on human operators, robots must navigate cluttered spaces both autonomously and safely. In the context of this thesis, autonomy refers to onboard decision-making and safety to avoiding collisions despite uncertainties like sensor noise or environmental effects.
This thesis explores autonomous and safe navigation using model predictive control (MPC), a planning and control strategy that predicts future behavior based on a system model and optimizes actions accordingly. MPC’s strength lies in its ability to handle constraints such as actuator limits and obstacle avoidance. However, applying MPC in real-world navigation presents several challenges, which this thesis addresses through the following contributions.
Navigating a mobile robot through cluttered environments requires it to follow dynamic trajectories while avoiding obstacles. Although MPC is well-suited for this task, recent methods for collision-free trajectory tracking often rely on complex mathematical formulations that can be difficult to interpret and apply. Chapter 2 aims to bridge this gap by offering a structured, step-by-step guide. It explains how to model the robot’s navigation problem, formulate the corresponding MPC problem, and establish performance and safety guarantees.
The chapter presents three formulations: a nominal MPC approach for ideal conditions, a robust MPC formulation that accounts for bounded disturbances, and a robust output-feedback MPC formulation that additionally handles measurement noise. Each formulation is supported by theoretical insights and practical considerations. While not exhaustive, the guide is intended to support researchers and practitioners in implementing MPC-based navigation under varying levels of uncertainty.
To enable autonomy, the MPC formulations introduced in Chapter 2 must operate in real time. This is challenging because MPC relies on solving an optimization problem at each time step, which can be computationally demanding. Planning long-term trajectories and computing control commands at high frequency on embedded hardware is especially difficult.
Chapter 3 addresses this by introducing a hierarchical MPC (HMPC) framework that separates planning and control into two layers. The planning MPC handles long-term trajectory generation at a lower frequency, while the tracking MPC focuses on short-term execution at a higher frequency. This separation allows the use of complex nonlinear models in both layers without compromising real-time performance. The tracking MPC layer, freed from long-term planning, can focus on precise tracking, improving stability and responsiveness.
The HMPC framework also includes a method for generating consistent collision avoidance constraints. Its effectiveness is demonstrated through simulations and lab experiments, showing safer and approximately four times faster goal-reaching compared to a single-layer MPC approach.
While the HMPC framework improves autonomy, it leverages a nominal MPC formulation that assumes perfect models and accurate state data. In practice, mobile robots operate in the presence of model uncertainties and noisy measurements, which can lead to constraint violations such as collisions.
Chapter 4 extends the HMPC framework to address these issues by incorporating robust output-feedback MPC into the tracking layer. This extension, called robust output-feedback hierarchical MPC (ROHMPC), provides formal safety guarantees even in the presence of disturbances and measurement noise. Synthesizing the ROHMPC scheme requires knowledge of uncertainty bounds, which are typically unknown.
To overcome this, the chapter introduces an efficient and modular pipeline that estimates these bounds from experimental data, performs necessary offline computations, calibrates constraint tightening to reduce conservatism, and implements the complete control scheme. The pipeline is released as an open-source software package to support reproducibility and future research. Using this pipeline, the chapter demonstrates the successful validation of the ROHMPC framework properties on a simulated quadcopter platform in Gazebo, with reproducible results.
The HMPC and ROHMPC frameworks address autonomy and safety respectively, but successful deployment in real-world scenarios also depends on the reliability of the system. A key aspect of reliability is reproducibility: the ability to consistently generate similar results.
Chapter 5 explores this concept in the context of robotics, defining reproducibility and analyzing how it applies to the hardware-software setups used in earlier chapters. The HMPC framework satisfies method reproducibility, meaning its implementation can be consistently reproduced. However, due to asynchronous processes and non-deterministic code components, it does not fully achieve results reproducibility.
In contrast, the ROHMPC framework satisfies both method and results reproducibility, reinforcing the credibility of the framework. By raising awareness of reproducibility challenges and offering practical insights, this chapter aims to support the development of more robust and trustworthy robotic systems.
All results presented in this thesis have been made publicly accessible through submissions to peer-reviewed venues, an open-access preprint server, and the release of open-source software packages. These results highlight the effectiveness of hierarchical MPC in both simulation and laboratory settings, and demonstrate how formal safety guarantees can be achieved under uncertainty.
To support future research, Chapter 6 summarizes the key contributions and outlines several promising directions for further exploration. These include extending the proposed algorithms to 3D environments, integrating onboard sensing for autonomous outdoor navigation, and incorporating data-driven methods to reduce conservatism. ...
In these safety-critical missions, robots are expected to support human first responders by exploring unknown and potentially hazardous environments, including buildings and natural terrains. To reduce the cognitive and operational burden on human operators, robots must navigate cluttered spaces both autonomously and safely. In the context of this thesis, autonomy refers to onboard decision-making and safety to avoiding collisions despite uncertainties like sensor noise or environmental effects.
This thesis explores autonomous and safe navigation using model predictive control (MPC), a planning and control strategy that predicts future behavior based on a system model and optimizes actions accordingly. MPC’s strength lies in its ability to handle constraints such as actuator limits and obstacle avoidance. However, applying MPC in real-world navigation presents several challenges, which this thesis addresses through the following contributions.
Navigating a mobile robot through cluttered environments requires it to follow dynamic trajectories while avoiding obstacles. Although MPC is well-suited for this task, recent methods for collision-free trajectory tracking often rely on complex mathematical formulations that can be difficult to interpret and apply. Chapter 2 aims to bridge this gap by offering a structured, step-by-step guide. It explains how to model the robot’s navigation problem, formulate the corresponding MPC problem, and establish performance and safety guarantees.
The chapter presents three formulations: a nominal MPC approach for ideal conditions, a robust MPC formulation that accounts for bounded disturbances, and a robust output-feedback MPC formulation that additionally handles measurement noise. Each formulation is supported by theoretical insights and practical considerations. While not exhaustive, the guide is intended to support researchers and practitioners in implementing MPC-based navigation under varying levels of uncertainty.
To enable autonomy, the MPC formulations introduced in Chapter 2 must operate in real time. This is challenging because MPC relies on solving an optimization problem at each time step, which can be computationally demanding. Planning long-term trajectories and computing control commands at high frequency on embedded hardware is especially difficult.
Chapter 3 addresses this by introducing a hierarchical MPC (HMPC) framework that separates planning and control into two layers. The planning MPC handles long-term trajectory generation at a lower frequency, while the tracking MPC focuses on short-term execution at a higher frequency. This separation allows the use of complex nonlinear models in both layers without compromising real-time performance. The tracking MPC layer, freed from long-term planning, can focus on precise tracking, improving stability and responsiveness.
The HMPC framework also includes a method for generating consistent collision avoidance constraints. Its effectiveness is demonstrated through simulations and lab experiments, showing safer and approximately four times faster goal-reaching compared to a single-layer MPC approach.
While the HMPC framework improves autonomy, it leverages a nominal MPC formulation that assumes perfect models and accurate state data. In practice, mobile robots operate in the presence of model uncertainties and noisy measurements, which can lead to constraint violations such as collisions.
Chapter 4 extends the HMPC framework to address these issues by incorporating robust output-feedback MPC into the tracking layer. This extension, called robust output-feedback hierarchical MPC (ROHMPC), provides formal safety guarantees even in the presence of disturbances and measurement noise. Synthesizing the ROHMPC scheme requires knowledge of uncertainty bounds, which are typically unknown.
To overcome this, the chapter introduces an efficient and modular pipeline that estimates these bounds from experimental data, performs necessary offline computations, calibrates constraint tightening to reduce conservatism, and implements the complete control scheme. The pipeline is released as an open-source software package to support reproducibility and future research. Using this pipeline, the chapter demonstrates the successful validation of the ROHMPC framework properties on a simulated quadcopter platform in Gazebo, with reproducible results.
The HMPC and ROHMPC frameworks address autonomy and safety respectively, but successful deployment in real-world scenarios also depends on the reliability of the system. A key aspect of reliability is reproducibility: the ability to consistently generate similar results.
Chapter 5 explores this concept in the context of robotics, defining reproducibility and analyzing how it applies to the hardware-software setups used in earlier chapters. The HMPC framework satisfies method reproducibility, meaning its implementation can be consistently reproduced. However, due to asynchronous processes and non-deterministic code components, it does not fully achieve results reproducibility.
In contrast, the ROHMPC framework satisfies both method and results reproducibility, reinforcing the credibility of the framework. By raising awareness of reproducibility challenges and offering practical insights, this chapter aims to support the development of more robust and trustworthy robotic systems.
All results presented in this thesis have been made publicly accessible through submissions to peer-reviewed venues, an open-access preprint server, and the release of open-source software packages. These results highlight the effectiveness of hierarchical MPC in both simulation and laboratory settings, and demonstrate how formal safety guarantees can be achieved under uncertainty.
To support future research, Chapter 6 summarizes the key contributions and outlines several promising directions for further exploration. These include extending the proposed algorithms to 3D environments, integrating onboard sensing for autonomous outdoor navigation, and incorporating data-driven methods to reduce conservatism.
Three limitations in the state of the art motivate the research objectives of this thesis. First, existing small-scale car-like platforms either remain relatively expensive for multi-agent experiments, depend on custom components that hinder reproducibility, or lack rigorous system-identification workflows needed for model-based control and sim-to-hardware transfer. Second, cooperative driving methods, for example platooning, often rely on timely attack detection or hard switching to degraded modes, leaving vulnerability windows and risking undesirable transients, moreover, adversarial robustness at the coordination and topology-management level, such as merge, split, rearrange, is less developed than low-level spacing control. Third, widely used MPCC formulations can become unreliable on highly curved paths because internal progress approximations drift under large deviations, which can degrade constraint handling, racing-oriented MPC variants frequently depend on offline priors, for example precomputed racelines, limiting transfer to emergency-like maneuvers where only local geometry is available. Accordingly, this thesis aims to develop: (I) a reproducible, low-cost experimental platform with a complete identification pipeline, (II) cooperative driving algorithms that preserve safety even under persistent or undetected communication attacks and remain feasible under actuator saturation, and (III) curvature-aware MPCC formulations that remain reliable on tight curvature and avoid dependence on offline priors in high-performance settings.
The thesis begins with the design and characterization of the Delft Autonomous-driving Robotic Testbed (DART). DART is a low-cost, reproducible, small-scale vehicle built on a commercial RC chassis and augmented with additional sensing and computational capabilities, including Lidar, an IMU, custom wheel encoders, and onboard computing. The platform preserves the essential features of full-scale vehicle dynamics, Ackermann steering, suspensions, electric motor, while remaining small enough to work with in laboratory settings. A central component is a comprehensive system identification procedure that yields reliable kinematic and vehicle dynamics models tailored to small-scale vehicles, along with sub-models for motor force, friction, steering actuation, and tyre lateral forces. This modelling pipeline enables realistic testing and accurate model-based control on hardware.
The thesis then shifts focus to cooperative multi-vehicle systems by introducing a distributed, attack-resilient platooning framework. This work addresses two intertwined challenges: maintaining safety and formation integrity under malicious interference on the communication channels, and enabling coordinated platoon-level decisions such as merging, splitting, and rearranging. At the control level, the method combines sensor-based Adaptive Cruise Control with communication-based Cooperative Adaptive Cruise Control, while a safety filter ensures collision avoidance even when communicated acceleration data is corrupted. At the coordination level, a distributed topology-management strategy detects inconsistent information and isolates compromised vehicles by reorganizing the platoon. The approach is validated in simulation and experimentally on multiple DART vehicles, showing that safety and string stability are preserved despite communication attacks.
With this platform at hand, we then focus on designing a motion-planning strategy for urban driving. We investigate model predictive control and develop a Curvature-Aware Model Predictive Contouring Control (CA-MPCC) framework. Traditional MPCC formulations assume low curvature and rely on a lag-error term that couples progression along the reference path with lateral tracking. This complicates tuning and reduces reliability in tight curves. The CA-MPCC formulation resolves these limitations by explicitly accounting for curvature in the path geometry, removing the lag-error term entirely and simplifying both the cost structure and the tuning process. The method is validated in simulation and on the DART platform, demonstrating improved robustness, reduced parameter sensitivity, and reliable real-time performance even on highly curved trajectories.
Next, the thesis extends the curvature-aware MPCC methodology to high-performance domains with the rCA-MPCC framework for autonomous racing. Racing introduces additional challenges, such as operating at the limits of handling, rapid curvature changes, and strong coupling between dynamical states. The rCA-MPCC formulation augments CA-MPCC with a curvature-informed terminal cost and a compact reference-path representation that avoids dependence on precomputed race line information. This chapter also expands the physical vehicle model through actuator-dynamics identification and residual dynamic modeling using Gaussian Processes. Together, these advances improve prediction accuracy and enable robust high-speed control. We furthermore extend the car-racing formulation to aerial drone racing, bridging the gap between the two communities. Experiments on small-scale cars and simulations on quadrotor drones demonstrate faster, more consistent lap times and robustness across different dynamic model choices.
Finally, the thesis demonstrates how the developed platform and control tools can be applied to cooperative multi-robot tasks such as persistent monitoring and target detection. The chapter combines DART with the CA-MPCC controller to implement Lissajous-curve-based coordinated trajectories generated by time-inverted Kuramoto dynamics. The result is a distributed coordination strategy that guarantees collision avoidance and complete area coverage. Experimental validation confirms that the high-level coordination method and low-level CA-MPCC controller integrate smoothly on real hardware, even under disturbances and temporary agent failures.
Overall, this thesis contributes a unified framework for physically grounded experimentation, accurate dynamics modeling, and robust control and coordination in autonomous driving. By developing a reproducible platform, constructing reliable models, and demonstrating advanced control strategies, from emergency maneuvers to cooperative platooning and multi-agent monitoring, it lowers the barrier to real-world validation and accelerates iteration cycles for autonomous driving research. The insights presented here support future work toward safer, more robust, and more efficient autonomous transportation systems. ...
Three limitations in the state of the art motivate the research objectives of this thesis. First, existing small-scale car-like platforms either remain relatively expensive for multi-agent experiments, depend on custom components that hinder reproducibility, or lack rigorous system-identification workflows needed for model-based control and sim-to-hardware transfer. Second, cooperative driving methods, for example platooning, often rely on timely attack detection or hard switching to degraded modes, leaving vulnerability windows and risking undesirable transients, moreover, adversarial robustness at the coordination and topology-management level, such as merge, split, rearrange, is less developed than low-level spacing control. Third, widely used MPCC formulations can become unreliable on highly curved paths because internal progress approximations drift under large deviations, which can degrade constraint handling, racing-oriented MPC variants frequently depend on offline priors, for example precomputed racelines, limiting transfer to emergency-like maneuvers where only local geometry is available. Accordingly, this thesis aims to develop: (I) a reproducible, low-cost experimental platform with a complete identification pipeline, (II) cooperative driving algorithms that preserve safety even under persistent or undetected communication attacks and remain feasible under actuator saturation, and (III) curvature-aware MPCC formulations that remain reliable on tight curvature and avoid dependence on offline priors in high-performance settings.
The thesis begins with the design and characterization of the Delft Autonomous-driving Robotic Testbed (DART). DART is a low-cost, reproducible, small-scale vehicle built on a commercial RC chassis and augmented with additional sensing and computational capabilities, including Lidar, an IMU, custom wheel encoders, and onboard computing. The platform preserves the essential features of full-scale vehicle dynamics, Ackermann steering, suspensions, electric motor, while remaining small enough to work with in laboratory settings. A central component is a comprehensive system identification procedure that yields reliable kinematic and vehicle dynamics models tailored to small-scale vehicles, along with sub-models for motor force, friction, steering actuation, and tyre lateral forces. This modelling pipeline enables realistic testing and accurate model-based control on hardware.
The thesis then shifts focus to cooperative multi-vehicle systems by introducing a distributed, attack-resilient platooning framework. This work addresses two intertwined challenges: maintaining safety and formation integrity under malicious interference on the communication channels, and enabling coordinated platoon-level decisions such as merging, splitting, and rearranging. At the control level, the method combines sensor-based Adaptive Cruise Control with communication-based Cooperative Adaptive Cruise Control, while a safety filter ensures collision avoidance even when communicated acceleration data is corrupted. At the coordination level, a distributed topology-management strategy detects inconsistent information and isolates compromised vehicles by reorganizing the platoon. The approach is validated in simulation and experimentally on multiple DART vehicles, showing that safety and string stability are preserved despite communication attacks.
With this platform at hand, we then focus on designing a motion-planning strategy for urban driving. We investigate model predictive control and develop a Curvature-Aware Model Predictive Contouring Control (CA-MPCC) framework. Traditional MPCC formulations assume low curvature and rely on a lag-error term that couples progression along the reference path with lateral tracking. This complicates tuning and reduces reliability in tight curves. The CA-MPCC formulation resolves these limitations by explicitly accounting for curvature in the path geometry, removing the lag-error term entirely and simplifying both the cost structure and the tuning process. The method is validated in simulation and on the DART platform, demonstrating improved robustness, reduced parameter sensitivity, and reliable real-time performance even on highly curved trajectories.
Next, the thesis extends the curvature-aware MPCC methodology to high-performance domains with the rCA-MPCC framework for autonomous racing. Racing introduces additional challenges, such as operating at the limits of handling, rapid curvature changes, and strong coupling between dynamical states. The rCA-MPCC formulation augments CA-MPCC with a curvature-informed terminal cost and a compact reference-path representation that avoids dependence on precomputed race line information. This chapter also expands the physical vehicle model through actuator-dynamics identification and residual dynamic modeling using Gaussian Processes. Together, these advances improve prediction accuracy and enable robust high-speed control. We furthermore extend the car-racing formulation to aerial drone racing, bridging the gap between the two communities. Experiments on small-scale cars and simulations on quadrotor drones demonstrate faster, more consistent lap times and robustness across different dynamic model choices.
Finally, the thesis demonstrates how the developed platform and control tools can be applied to cooperative multi-robot tasks such as persistent monitoring and target detection. The chapter combines DART with the CA-MPCC controller to implement Lissajous-curve-based coordinated trajectories generated by time-inverted Kuramoto dynamics. The result is a distributed coordination strategy that guarantees collision avoidance and complete area coverage. Experimental validation confirms that the high-level coordination method and low-level CA-MPCC controller integrate smoothly on real hardware, even under disturbances and temporary agent failures.
Overall, this thesis contributes a unified framework for physically grounded experimentation, accurate dynamics modeling, and robust control and coordination in autonomous driving. By developing a reproducible platform, constructing reliable models, and demonstrating advanced control strategies, from emergency maneuvers to cooperative platooning and multi-agent monitoring, it lowers the barrier to real-world validation and accelerates iteration cycles for autonomous driving research. The insights presented here support future work toward safer, more robust, and more efficient autonomous transportation systems.
This dissertation addresses these fundamental challenges by framing robot motion planning for interactive scenarios through the lens of non-cooperative game theory. This perspective provides a principled mathematical framework for modeling multiple self-interested decision-makers who act simultaneously with partially aligned objectives. Focusing on environments where a single controlled robot interacts with uncontrolled agents whose intents are not known a priori, this dissertation develops a comprehensive suite of game-theoretic tools. These tools enable robots to infer underlying intents from observations and generate motion plans that capture the complex interdependence of self-interested decision-making.
The core contributions of this dissertation span intent inference, real-time adaptation, uncertainty-aware planning, computational efficiency, and complex non-smooth dynamics.
First, we formalize the problem of learning unknown intents from observed past behavior as an inverse game. By casting this as maximum-likelihood estimation with equilibrium constraints, our transcription jointly estimates game parameters, hidden states, and future decisions, significantly improving inference accuracy. Second, we tightly integrate these inverse games with online planning. We propose a novel solution technique handling inequality constraints with a first-order update rule for amortized inference, yielding a game-theoretic planner that dynamically adapts to evolving intent estimates.
Third, we address situations demanding explicit reasoning over a distribution of possible intents. We introduce contingency games, an uncertainty-aware planning technique that jointly generates multi-hypothesis predictions of others alongside conditional plans for the robot. By explicitly anticipating future information gains, this approach seamlessly bridges the gap between conservatively ignoring uncertainty and assuming it will never resolve. Fourth, to alleviate the substantial computational burden of online game-theoretic planning, we introduce an amortized solver for mixed strategies. An offline model learns to propose dynamically feasible trajectory candidates, while a discrete game solved online rapidly computes competitive mixed Nash equilibria.
Finally, we tackle interaction domains with inherently non-smooth dynamics, such as multi-agent manipulation, where constraints are not continuously differentiable. We propose a data-driven approach leveraging probabilistic inference and generative diffusion models. This blends learning from single-agent demonstrations with reasoning about joint multi-agent costs, discovering collaborative strategies without requiring massive multi-agent datasets.
In summary, this dissertation advances interactive motion planning by equipping robots to accurately infer intents, act safely under uncertainty, and navigate complex interactions. These algorithmic contributions are extensively validated via simulation and ground robots across autonomous driving, mobile navigation, and multi-agent manipulation, accompanied by open-source libraries to accelerate future research. ...
This dissertation addresses these fundamental challenges by framing robot motion planning for interactive scenarios through the lens of non-cooperative game theory. This perspective provides a principled mathematical framework for modeling multiple self-interested decision-makers who act simultaneously with partially aligned objectives. Focusing on environments where a single controlled robot interacts with uncontrolled agents whose intents are not known a priori, this dissertation develops a comprehensive suite of game-theoretic tools. These tools enable robots to infer underlying intents from observations and generate motion plans that capture the complex interdependence of self-interested decision-making.
The core contributions of this dissertation span intent inference, real-time adaptation, uncertainty-aware planning, computational efficiency, and complex non-smooth dynamics.
First, we formalize the problem of learning unknown intents from observed past behavior as an inverse game. By casting this as maximum-likelihood estimation with equilibrium constraints, our transcription jointly estimates game parameters, hidden states, and future decisions, significantly improving inference accuracy. Second, we tightly integrate these inverse games with online planning. We propose a novel solution technique handling inequality constraints with a first-order update rule for amortized inference, yielding a game-theoretic planner that dynamically adapts to evolving intent estimates.
Third, we address situations demanding explicit reasoning over a distribution of possible intents. We introduce contingency games, an uncertainty-aware planning technique that jointly generates multi-hypothesis predictions of others alongside conditional plans for the robot. By explicitly anticipating future information gains, this approach seamlessly bridges the gap between conservatively ignoring uncertainty and assuming it will never resolve. Fourth, to alleviate the substantial computational burden of online game-theoretic planning, we introduce an amortized solver for mixed strategies. An offline model learns to propose dynamically feasible trajectory candidates, while a discrete game solved online rapidly computes competitive mixed Nash equilibria.
Finally, we tackle interaction domains with inherently non-smooth dynamics, such as multi-agent manipulation, where constraints are not continuously differentiable. We propose a data-driven approach leveraging probabilistic inference and generative diffusion models. This blends learning from single-agent demonstrations with reasoning about joint multi-agent costs, discovering collaborative strategies without requiring massive multi-agent datasets.
In summary, this dissertation advances interactive motion planning by equipping robots to accurately infer intents, act safely under uncertainty, and navigate complex interactions. These algorithmic contributions are extensively validated via simulation and ground robots across autonomous driving, mobile navigation, and multi-agent manipulation, accompanied by open-source libraries to accelerate future research.
We propose a novel distributed NMPC approach for navigation in tight environments. The goal is to enable coordinated motion planning for multiple autonomous vehicles in dense traffic scenarios. This can be easily modelled with centralised formulations, however, considering the scale of the road network they become computationally intractable as the number of agents grows. In distributed approaches instead each vehicle solves its local part of the problem which scales linearly and is therefore better suited for these kind of environments. So building on an existing distributed method we enhance its structure to improve scalability, safety, and performance in cooperative autonomous driving tasks. The research is guided by three objectives: (RO1) to reproduce a known distributed baseline algorithm using the DART (Delft’s Autonomous-driving Robotic Testbed) vehicle model \cite{lyons_dart_2024}, the Model Predictive Contouring Control (MPCC) \cite{lam_model_2010} tracking objective, and the ACADOS solver framework \cite{verschueren_acados_2020}; (RO2) to develop a novel distributed MPC-based algorithm that improves inter-vehicle spacing and tracking performance while generalizing beyond predefined reference trajectories; and (RO3) to prepare the algorithm for validation on a real-world robotic testbed.
The primary contributions of this work include: (C1) successful reproduction of the distributed baseline using open-source tools and realistic vehicle modelling; (C2) development of two enhanced distributed algorithms, Distributed Model Predictive Contouring Control with Relaxed Collision Avoidance (DMPC-RCA) and DMPC-RCA with consensus (DMPC-RCA-C), that demonstrate superior performance in tracking accuracy and safety margins; and (C3) integration of these algorithms with the DART hardware platform for future experimental validation. The repository of these approaches can be found in https://github.com/HubertVisser/multi-vehicle-coordination-algorithm.git
Performance is evaluated in two representative scenarios, a merging situation and a T-junction, with the centralised NMPC approach serving as the performance benchmark. To evaluate the performance of the proposed designs, three key metrics are used: accumulated tracking cost, computation time, and minimum inter-agent distance. The results show that both proposed methods achieve tracking costs comparable to the centralised controller, while significantly outperforming the distributed baseline method. Notably, the inclusion of a consensus term yields no substantial improvement in performance over the non-consensus version.
To conclude, the proposed approaches offer strong potential for scalable, safe, and efficient multi-agent motion planning, moving one step closer to the deployment of fully cooperative autonomous driving on public roads.
Github repository for the Master Thesis - https://github.com/HubertVisser/multi-vehicle-coordination-algorithm.git ...
We propose a novel distributed NMPC approach for navigation in tight environments. The goal is to enable coordinated motion planning for multiple autonomous vehicles in dense traffic scenarios. This can be easily modelled with centralised formulations, however, considering the scale of the road network they become computationally intractable as the number of agents grows. In distributed approaches instead each vehicle solves its local part of the problem which scales linearly and is therefore better suited for these kind of environments. So building on an existing distributed method we enhance its structure to improve scalability, safety, and performance in cooperative autonomous driving tasks. The research is guided by three objectives: (RO1) to reproduce a known distributed baseline algorithm using the DART (Delft’s Autonomous-driving Robotic Testbed) vehicle model \cite{lyons_dart_2024}, the Model Predictive Contouring Control (MPCC) \cite{lam_model_2010} tracking objective, and the ACADOS solver framework \cite{verschueren_acados_2020}; (RO2) to develop a novel distributed MPC-based algorithm that improves inter-vehicle spacing and tracking performance while generalizing beyond predefined reference trajectories; and (RO3) to prepare the algorithm for validation on a real-world robotic testbed.
The primary contributions of this work include: (C1) successful reproduction of the distributed baseline using open-source tools and realistic vehicle modelling; (C2) development of two enhanced distributed algorithms, Distributed Model Predictive Contouring Control with Relaxed Collision Avoidance (DMPC-RCA) and DMPC-RCA with consensus (DMPC-RCA-C), that demonstrate superior performance in tracking accuracy and safety margins; and (C3) integration of these algorithms with the DART hardware platform for future experimental validation. The repository of these approaches can be found in https://github.com/HubertVisser/multi-vehicle-coordination-algorithm.git
Performance is evaluated in two representative scenarios, a merging situation and a T-junction, with the centralised NMPC approach serving as the performance benchmark. To evaluate the performance of the proposed designs, three key metrics are used: accumulated tracking cost, computation time, and minimum inter-agent distance. The results show that both proposed methods achieve tracking costs comparable to the centralised controller, while significantly outperforming the distributed baseline method. Notably, the inclusion of a consensus term yields no substantial improvement in performance over the non-consensus version.
To conclude, the proposed approaches offer strong potential for scalable, safe, and efficient multi-agent motion planning, moving one step closer to the deployment of fully cooperative autonomous driving on public roads.
Github repository for the Master Thesis - https://github.com/HubertVisser/multi-vehicle-coordination-algorithm.git
Imitation learning for an ASV path planner in complex marine environments
A feasibility study
Beyond Proportional Navigation
Deep Reinforcement Learning for Robust Drone Interception
Github Repository of the code used for the thesis: https://github.com/maximecapelle/quadintercept_drl ...
Github Repository of the code used for the thesis: https://github.com/maximecapelle/quadintercept_drl
A Novel Approach to Mechanical Tracker Calibration for Surgical Navigation
On-Site Artefact-Based Calibration and Validation
Rule-compliant and Fault-Tolerant Motion Planning
With Application to Autonomous Surface Vehicles
This thesis focuses on improving simulation tools and methodologies to enhance the efficiency and effectiveness of learning-based approaches in robotics. The work addresses key trade-offs between flexibility, speed, and accuracy in robotic simulations, which are critical for successfully transferring learned policies from simulation to real-world environments. Additionally, it introduces a strategy to improve resilience, ensuring that learned behaviors are robust to irrelevant and unknown dynamics.
By tackling these challenges, this thesis provides insights into the design of effective robotic simulators and presents contributions that help bridge the gap between simulated and real-world robotic learning. ...
This thesis focuses on improving simulation tools and methodologies to enhance the efficiency and effectiveness of learning-based approaches in robotics. The work addresses key trade-offs between flexibility, speed, and accuracy in robotic simulations, which are critical for successfully transferring learned policies from simulation to real-world environments. Additionally, it introduces a strategy to improve resilience, ensuring that learned behaviors are robust to irrelevant and unknown dynamics.
By tackling these challenges, this thesis provides insights into the design of effective robotic simulators and presents contributions that help bridge the gap between simulated and real-world robotic learning.
Moving towards this goal, this thesis addresses two core problems. First, local motion planning must carefully account for information gained from sensor observations as well as collision avoidance and the robot’s dynamics while moving through cluttered, unknown areas. Second, global exploration planning must strategically select where in the environment to explore to find the target quickly - especially when the environment is large or complex. Given that human operators often possess semantic knowledge about likely target locations, we hypothesize that incorporating such guidance by observed semantic features (e.g., object or room types) into the exploration planning is crucial for time-efficient autonomous search. We address these two core problems by making the following contributions.
The first contribution of the thesis is an informative local motion planning approach
that generates safe, collision-free trajectories around obstacles while minimizing uncertainty about the target locations. The critical challenge is to achieve computationally efficient planning of trajectories that maximize information gain under the robot’s kinodynamic constraints. In the proposed approach, a model predictive control (MPC) motion planner is guided by a learned viewpoint policy. The policy is trained via deep reinforcement learning (DRL) to maximize long-term information gain by providing a local subgoal to the MPC. The MPC follows the subgoal and ensures that the motion plan remains feasible and collision-free. Therefore, the robot can rapidly replan safe and informative local trajectories online. Simulation experiments demonstrate that the method achieves competitive performance in locating targets compared to a computationally expensive state-of-the-art planner using Monte Carlo Tree Search (MCTS), while allowing for significantly faster execution and replanning.
While local informative planning is crucial for exploring cluttered spaces, it often be-
haves myopically and inefficiently with respect to large and complex environments. Therefore, the second contribution introduces a global target search planner that balances directed search towards semantically promising areas with complete search space coverage. This planner extends the idea of frontier exploration - focusing observations on the boundaries between explored and unexplored regions - to target search, where different frontiers are assigned a semantic priority. This priority represents the semantic relationships between the target and nearby objects. To minimize target search time, the target search planner schedules high-priority frontiers earlier by solving a custom combinatorial optimization problem to determine the visitation order. By integrating coverage gains into the frontier priorities, the planner ensures that the robot explores the environment efficiently while focusing on semantically relevant areas. We demonstrate this approach in two studies outlined below.
Large, high-quality datasets for learning target-specific semantic relationships are
scarce in many real-world scenarios, especially in search and rescue. The third contribution addresses this limitation by proposing a method to learn semantic priority models from expert feedback. Rather than collecting massive amounts of labeled data, the approach exploits an expert operator’s sparse guidance inputs in a few target search scenarios. This expert guidance selects a frontier to explore next, which is stored in a training dataset together with the frontier’s semantic features. An expert model is then trained to approximate a priority function that predicts how relevant each frontier is for the expert. By incorporating this learned priority function into the global target search planner, the robot can autonomously prioritize semantically relevant areas according to the expert’s semantic knowledge. Experiments show that using a small number of expert demonstrations is sufficient for the robot to significantly improve its search efficiency and reduce travel distance until the target is found.
Lastly, the thesis extends semantic target search to three-dimensional environments
by integrating it into a 3D planning pipeline for micro aerial vehicles (MAV). The pipeline first detects objects in the environment using onboard vision and associates them with priority values computed from pre-trained large language model (LLM) embeddings. These priorities are then propagated into frontiers in a 3D voxel map, indicating frontier regions that are most likely to contain the target. This enables the evaluation of frontier viewpoints for their information gain that accounts for both semantic priority and volumetric coverage. The viewpoint gains are then used in the combinatorial target search planner to prioritize the viewpoints that most likely lead to the target while ensuring efficient coverage of the environment. By integrating the MAV’s kinodynamic constraints into the planning costs, the system ensures smooth, feasible trajectories in real-time. Simulation studies reveal that semantically guided exploration leads to faster and more reliable target discovery than different purely coverage-based exploration baselines. Experiments with a real MAV in the lab confirm the approach’s ability to autonomously navigate an MAV through complex 3D environments to a target, exploiting semantic cues, maximizing
coverage, and avoiding collisions.
In summary, this thesis demonstrates how planning and learning techniques can be
combined for autonomous target search and exploration. These techniques enable mobile robots to navigate unknown environments efficiently and safely while searching for targets and collecting required information. Crucially, our proposed method for semantically guided frontier planning bridges the gap between recent learning-based navigation approaches and established planning-based approaches suitable for real-world robotic systems. By integrating semantic knowledge into robotic exploration, the proposed methods can reduce human operator cognitive load and, therefore, facilitate robot deployment in scenarios such as search and rescue or reconnaissance missions. ...
Moving towards this goal, this thesis addresses two core problems. First, local motion planning must carefully account for information gained from sensor observations as well as collision avoidance and the robot’s dynamics while moving through cluttered, unknown areas. Second, global exploration planning must strategically select where in the environment to explore to find the target quickly - especially when the environment is large or complex. Given that human operators often possess semantic knowledge about likely target locations, we hypothesize that incorporating such guidance by observed semantic features (e.g., object or room types) into the exploration planning is crucial for time-efficient autonomous search. We address these two core problems by making the following contributions.
The first contribution of the thesis is an informative local motion planning approach
that generates safe, collision-free trajectories around obstacles while minimizing uncertainty about the target locations. The critical challenge is to achieve computationally efficient planning of trajectories that maximize information gain under the robot’s kinodynamic constraints. In the proposed approach, a model predictive control (MPC) motion planner is guided by a learned viewpoint policy. The policy is trained via deep reinforcement learning (DRL) to maximize long-term information gain by providing a local subgoal to the MPC. The MPC follows the subgoal and ensures that the motion plan remains feasible and collision-free. Therefore, the robot can rapidly replan safe and informative local trajectories online. Simulation experiments demonstrate that the method achieves competitive performance in locating targets compared to a computationally expensive state-of-the-art planner using Monte Carlo Tree Search (MCTS), while allowing for significantly faster execution and replanning.
While local informative planning is crucial for exploring cluttered spaces, it often be-
haves myopically and inefficiently with respect to large and complex environments. Therefore, the second contribution introduces a global target search planner that balances directed search towards semantically promising areas with complete search space coverage. This planner extends the idea of frontier exploration - focusing observations on the boundaries between explored and unexplored regions - to target search, where different frontiers are assigned a semantic priority. This priority represents the semantic relationships between the target and nearby objects. To minimize target search time, the target search planner schedules high-priority frontiers earlier by solving a custom combinatorial optimization problem to determine the visitation order. By integrating coverage gains into the frontier priorities, the planner ensures that the robot explores the environment efficiently while focusing on semantically relevant areas. We demonstrate this approach in two studies outlined below.
Large, high-quality datasets for learning target-specific semantic relationships are
scarce in many real-world scenarios, especially in search and rescue. The third contribution addresses this limitation by proposing a method to learn semantic priority models from expert feedback. Rather than collecting massive amounts of labeled data, the approach exploits an expert operator’s sparse guidance inputs in a few target search scenarios. This expert guidance selects a frontier to explore next, which is stored in a training dataset together with the frontier’s semantic features. An expert model is then trained to approximate a priority function that predicts how relevant each frontier is for the expert. By incorporating this learned priority function into the global target search planner, the robot can autonomously prioritize semantically relevant areas according to the expert’s semantic knowledge. Experiments show that using a small number of expert demonstrations is sufficient for the robot to significantly improve its search efficiency and reduce travel distance until the target is found.
Lastly, the thesis extends semantic target search to three-dimensional environments
by integrating it into a 3D planning pipeline for micro aerial vehicles (MAV). The pipeline first detects objects in the environment using onboard vision and associates them with priority values computed from pre-trained large language model (LLM) embeddings. These priorities are then propagated into frontiers in a 3D voxel map, indicating frontier regions that are most likely to contain the target. This enables the evaluation of frontier viewpoints for their information gain that accounts for both semantic priority and volumetric coverage. The viewpoint gains are then used in the combinatorial target search planner to prioritize the viewpoints that most likely lead to the target while ensuring efficient coverage of the environment. By integrating the MAV’s kinodynamic constraints into the planning costs, the system ensures smooth, feasible trajectories in real-time. Simulation studies reveal that semantically guided exploration leads to faster and more reliable target discovery than different purely coverage-based exploration baselines. Experiments with a real MAV in the lab confirm the approach’s ability to autonomously navigate an MAV through complex 3D environments to a target, exploiting semantic cues, maximizing
coverage, and avoiding collisions.
In summary, this thesis demonstrates how planning and learning techniques can be
combined for autonomous target search and exploration. These techniques enable mobile robots to navigate unknown environments efficiently and safely while searching for targets and collecting required information. Crucially, our proposed method for semantically guided frontier planning bridges the gap between recent learning-based navigation approaches and established planning-based approaches suitable for real-world robotic systems. By integrating semantic knowledge into robotic exploration, the proposed methods can reduce human operator cognitive load and, therefore, facilitate robot deployment in scenarios such as search and rescue or reconnaissance missions.
Collision avoidance of autonomous surface vessels considering proactive COLREG compliance
How the concept of the ship domain and arena can be applied in a collision avoidance framework of ASVs
include the recommendation of a small fleet size of up to six UAVs, with a flight speed of 10 m/s at an altitude of 100 meters, balancing costs, performance and safety. The algorithm demonstrated performance equal to state-of-the-art reinforcement learning techniques
while offering advantages in explainability. Additionally, the algorithm has been successfully validated in a lab environment, demonstrating its potential as a practical and cost-effective solution for wildfire monitoring. This work brings a fire monitoring system closer to real-world implementation and will possibly help fight wildfires effectively. ...
include the recommendation of a small fleet size of up to six UAVs, with a flight speed of 10 m/s at an altitude of 100 meters, balancing costs, performance and safety. The algorithm demonstrated performance equal to state-of-the-art reinforcement learning techniques
while offering advantages in explainability. Additionally, the algorithm has been successfully validated in a lab environment, demonstrating its potential as a practical and cost-effective solution for wildfire monitoring. This work brings a fire monitoring system closer to real-world implementation and will possibly help fight wildfires effectively.
Probabilistic Motion Planning in Dynamic Environments
Parallelizable Scenario-Based Trajectory Optimization with Global Guidance
Traditional motion planners for dynamic environments have two key limitations that this thesis aims to address. First, they assume that their model of dynamic obstacles (e.g., humans) is exactly correct, capturing it with a single deterministic prediction. In practice, the robot cannot observe human intentions and must account for its uncertainty about the human's future behavior. Second, motion planners usually compute a single trajectory around an obstacle as a result of previously taken decisions without exploring alternative options. They react slowly or even fail to find a solution when unpredicted changes make this path undesirable. This results in poor planning performance in dynamic environments.
The goal of this thesis is to develop motion planners that account for the uncertainty of human motion predictions and that are consistent and robust in their decision-making in order to deal with unpredicted changes in dynamic environments. To accomplish this goal, this thesis proposes two motion planning frameworks: scenario-based and topology-driven trajectory optimization.
The first contribution of this thesis is Scenario-based Model Predictive Contouring Control (S-MPCC), a real-time capable probabilistic planning framework that incorporates any uncertainty associated with the motion predictions of dynamic obstacles. Contrary to existing methods that only account for small variations around a single predicted trajectory (unimodal uncertainty), the proposed planner accounts for multiple possible trajectories (multi-modal uncertainty). The planner therefore safely accounts for several outcomes, for instance, to express that a pedestrian may or may not cross in front of the robot.
S-MPCC bounds the probability of collision in each time step with all obstacles through Chance-Constrained Optimization (CCO). The CCO is reformulated as an optimization without uncertainty by sampling trajectories from the predicted distribution, known as scenarios. Each scenario represents a possible position of all obstacles in one time step, and the planner avoids collisions with all scenarios. This Scenario Program (SP), through a tailored linearization, can be solved efficiently online. S-MPCC therefore plans probabilistic safe trajectories independent of the underlying distribution of the uncertainty.
S-MPCC considers the probability of collision separately for each time instance in the planned trajectory. The second contribution of this thesis, Safe Horizon Model Predictive Control (SH-MPC), builds on S-MPCC to constrain the joint probability of collision with all obstacles over the duration of the planned trajectory. Existing methods that separately constrain the probability of collision in each time step (temporal marginal) and with each obstacle (obstacle marginal) lead to overly cautious motion planning when safety constraints are enforced. SH-MPC formulates a single chance constraint to bound the overall probability of collision. This CCO is reformulated as an SP where each scenario represents a possible trajectory for all obstacles. To certify the joint probability of collision with the SP, the number of scenarios that affect the motion plan needs to be identified. SH-MPC estimates this quantity at a negligible computational cost during optimization. Consequently, SH-MPC plans trajectories in real-time under generic uncertainties that are less cautious than existing methods without compromising on safety.
The probabilistic safety of S-MPCC and SH-MPC is linked to the underlying accuracy of the prediction model of the obstacles that provide the scenarios. As a third contribution, a joint prediction and planning framework, Partitioned Scenario Replay (PSR), is proposed that replays past observations of human motion as scenarios for scenario-based planning. PSR does not fit a distribution on observed data but directly uses the data as empirical evidence of the underlying uncertainty and thereby provides a real-world safety guarantee.
A key limitation of the developed scenario-based planners and other optimization-based planners is that they locally refine an initial trajectory. This initial trajectory largely determines the quality of the final trajectory, while it does not consider other options. The fourth contribution of this thesis is Topology-driven Model Predictive Control (T-MPC) that concurrently optimizes trajectories, each attempting a different way to pass the obstacles. T-MPC is composed of a guidance planner and several parallel local planners. The guidance planner identifies guidance trajectories for several distinct maneuvers, relying on results from topology to distinguish trajectories. Each local planner is composed of an existing optimization-based planner (e.g., a scenario-based planner) and an additional set of constraints that are derived from one of the guidance trajectories. The guidance trajectories are optimized by the local planners in parallel, and the results are compared to determine which trajectory gets executed. T-MPC is faster, more consistent, and safer than several state-of-the-art planners. Contrary to similar existing work, it does not rely on an explicit lane structure and therefore enables both urban driving and mobile robotic applications.
The motion planners developed in this thesis are extensively validated in simulation and in experiments with a small-scale mobile robot and a full-scale self-driving vehicle navigating among pedestrians. The robot-agnostic implementation of the proposed planners that were developed for this thesis is available open source. ...
Traditional motion planners for dynamic environments have two key limitations that this thesis aims to address. First, they assume that their model of dynamic obstacles (e.g., humans) is exactly correct, capturing it with a single deterministic prediction. In practice, the robot cannot observe human intentions and must account for its uncertainty about the human's future behavior. Second, motion planners usually compute a single trajectory around an obstacle as a result of previously taken decisions without exploring alternative options. They react slowly or even fail to find a solution when unpredicted changes make this path undesirable. This results in poor planning performance in dynamic environments.
The goal of this thesis is to develop motion planners that account for the uncertainty of human motion predictions and that are consistent and robust in their decision-making in order to deal with unpredicted changes in dynamic environments. To accomplish this goal, this thesis proposes two motion planning frameworks: scenario-based and topology-driven trajectory optimization.
The first contribution of this thesis is Scenario-based Model Predictive Contouring Control (S-MPCC), a real-time capable probabilistic planning framework that incorporates any uncertainty associated with the motion predictions of dynamic obstacles. Contrary to existing methods that only account for small variations around a single predicted trajectory (unimodal uncertainty), the proposed planner accounts for multiple possible trajectories (multi-modal uncertainty). The planner therefore safely accounts for several outcomes, for instance, to express that a pedestrian may or may not cross in front of the robot.
S-MPCC bounds the probability of collision in each time step with all obstacles through Chance-Constrained Optimization (CCO). The CCO is reformulated as an optimization without uncertainty by sampling trajectories from the predicted distribution, known as scenarios. Each scenario represents a possible position of all obstacles in one time step, and the planner avoids collisions with all scenarios. This Scenario Program (SP), through a tailored linearization, can be solved efficiently online. S-MPCC therefore plans probabilistic safe trajectories independent of the underlying distribution of the uncertainty.
S-MPCC considers the probability of collision separately for each time instance in the planned trajectory. The second contribution of this thesis, Safe Horizon Model Predictive Control (SH-MPC), builds on S-MPCC to constrain the joint probability of collision with all obstacles over the duration of the planned trajectory. Existing methods that separately constrain the probability of collision in each time step (temporal marginal) and with each obstacle (obstacle marginal) lead to overly cautious motion planning when safety constraints are enforced. SH-MPC formulates a single chance constraint to bound the overall probability of collision. This CCO is reformulated as an SP where each scenario represents a possible trajectory for all obstacles. To certify the joint probability of collision with the SP, the number of scenarios that affect the motion plan needs to be identified. SH-MPC estimates this quantity at a negligible computational cost during optimization. Consequently, SH-MPC plans trajectories in real-time under generic uncertainties that are less cautious than existing methods without compromising on safety.
The probabilistic safety of S-MPCC and SH-MPC is linked to the underlying accuracy of the prediction model of the obstacles that provide the scenarios. As a third contribution, a joint prediction and planning framework, Partitioned Scenario Replay (PSR), is proposed that replays past observations of human motion as scenarios for scenario-based planning. PSR does not fit a distribution on observed data but directly uses the data as empirical evidence of the underlying uncertainty and thereby provides a real-world safety guarantee.
A key limitation of the developed scenario-based planners and other optimization-based planners is that they locally refine an initial trajectory. This initial trajectory largely determines the quality of the final trajectory, while it does not consider other options. The fourth contribution of this thesis is Topology-driven Model Predictive Control (T-MPC) that concurrently optimizes trajectories, each attempting a different way to pass the obstacles. T-MPC is composed of a guidance planner and several parallel local planners. The guidance planner identifies guidance trajectories for several distinct maneuvers, relying on results from topology to distinguish trajectories. Each local planner is composed of an existing optimization-based planner (e.g., a scenario-based planner) and an additional set of constraints that are derived from one of the guidance trajectories. The guidance trajectories are optimized by the local planners in parallel, and the results are compared to determine which trajectory gets executed. T-MPC is faster, more consistent, and safer than several state-of-the-art planners. Contrary to similar existing work, it does not rely on an explicit lane structure and therefore enables both urban driving and mobile robotic applications.
The motion planners developed in this thesis are extensively validated in simulation and in experiments with a small-scale mobile robot and a full-scale self-driving vehicle navigating among pedestrians. The robot-agnostic implementation of the proposed planners that were developed for this thesis is available open source.
Trajectory planning and following in urban environments
To reduce traffic accidents involving vulnerable road users