Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning

Master Thesis (2025)
Author(s)

J. Zeng (TU Delft - Mechanical Engineering)

Contributor(s)

S. Sun – Mentor (TU Delft - Learning & Autonomous Control)

A. Matoses Gimenez – Mentor (TU Delft - Learning & Autonomous Control)

Eugene Vinitsky – Mentor (New York University)

J. Alonso-Mora – Mentor (TU Delft - Learning & Autonomous Control)

Christian Pek – Graduation committee member (TU Delft - Robot Dynamics)

E.J.J. Smeur – Graduation committee member (TU Delft - Control & Simulation)

Faculty
Mechanical Engineering
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
17-07-2025
Awarding Institution
Delft University of Technology
Programme
['Mechanical Engineering']
Sponsors
New York University
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper presents the first decentralized method to enable real-world 6-DoF manipulation of a cable-suspended load using a team of Micro-Aerial Vehicles (MAVs). Our method leverages multi-agent reinforcement learning (MARL) to train an outer-loop control policy for each MAV. Unlike state-of-the-art controllers that utilize a centralized scheme, our policy does not require global states, inter-MAV communications, nor neighboring MAV information. Instead, agents communicate implicitly through load pose observations alone, which enables high scalability and flexibility. It also significantly reduces computing costs during inference time, enabling onboard deployment of the policy. In addition, we introduce a new action space design for the MAVs using linear acceleration and body rates. This choice, combined with a robust low-level controller, enables reliable sim-to-real transfer despite significant uncertainties caused by cable tension during dynamic 3D motion. We validate our method in various real-world experiments, including full-pose control under load model uncertainties, showing setpoint tracking performance comparable to the state-of-the-art centralized method. We also demonstrate cooperation amongst agents with heterogeneous control policies, and robustness to the complete in-flight loss of one MAV. Videos of experiments: https://autonomousrobots.nl/paper_websites/aerial-manipulation-marl

Files

Jack_Zeng_msc_thesis.pdf
(pdf | 5.37 Mb)
- Embargo expired in 01-08-2025
License info not available