ARMCHAIR
Integrated Inverse Reinforcement Learning and Model Predictive Control for Human-Robot Collaboration
Angelo Caregnato-Neto (Instituto Tecnológico de Aeronáutica)
L. Cavalcante Siebert (TU Delft - Interactive Intelligence)
A. Zgonnikov (TU Delft - Human-Robot Interaction)
Marcos R.O.A. Maximo (Instituto Tecnológico de Aeronáutica)
Rubens J.M. Afonso (Instituto Tecnológico de Aeronáutica)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
One of the key issues in human-robot collaboration is the development of computational models that allow robots to predict and adapt to human behavior. Much progress has been achieved in developing such models, as well as control techniques that address the autonomy problems of motion planning and decision-making in robotics. However, the integration of computational models of human behavior with such control techniques still poses a major challenge, resulting in a bottleneck for efficient collaborative human-robot teams. In this context, we present a novel architecture for human-robot collaboration: Adaptive Robot Motion for Collaboration with Humans using Adversarial Inverse Reinforcement learning (ARMCHAIR). Our solution leverages adversarial inverse reinforcement learning and model predictive control to compute optimal trajectories and decisions for a mobile multi-robot system that collaborates with a human in an exploration task. During the mission, ARMCHAIR operates without human intervention, autonomously identifying the necessity to support and acting accordingly. Our approach also explicitly addresses the network connectivity requirement of the human-robot team. Extensive simulation-based evaluations demonstrate that ARMCHAIR allows a group of robots to safely support a simulated human in an exploration scenario, preventing collisions and network disconnections, and improving the overall performance of the task.
Files
File under embargo until 20-04-2026