Walk Along: An Experiment on Controlling the Mobile Robot ‘Spot’ with Voice and Gestures

Journal Article (2025)
Author(s)

Renchi Zhang (TU Delft - Human-Robot Interaction)

Jesse van der Linden (Student TU Delft)

Dimitra Dodou (TU Delft - Medical Instruments & Bio-Inspired Technology)

Harleigh C. Seyffert (TU Delft - Ship Hydromechanics)

Y. B. Eisma (TU Delft - Human-Robot Interaction)

Joost C.F. Winter (TU Delft - Human-Robot Interaction)

Research Group
Human-Robot Interaction
DOI related publication
https://doi.org/10.1145/3729540
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Human-Robot Interaction
Issue number
4
Volume number
14
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Robots are becoming more capable and can autonomously perform tasks such as navigating between locations. However, human oversight remains crucial. This study compared two touchless methods for directing mobile robots: voice control and gesture control, to investigate the efficiency of these methods and the preference of users. We tested these methods in two conditions: one in which participants remained stationary and one in which they walked freely alongside the robot. We hypothesized that walking alongside the robot would result in higher intuitiveness ratings and improved task performance, based on the idea that walking promotes spatial alignment and reduces the effort required for mental rotation. In a 2×2 within-subject design, 218 participants guided the quadruped robot Spot along a circuitous route with multiple 90° turns using rotate left, rotate right, and walk forward commands. After each trial, participants rated the intuitiveness of the command mapping, while post-experiment interviews were used to gather the participants’ preferences. Results showed that voice control combined with walking with Spot was the most favored and intuitive, whereas gesture control while standing caused confusion for left/right commands. Nevertheless, 29% of participants preferred gesture control, citing increased task engagement and visual congruence as reasons. An odometry-based analysis revealed that participants often followed behind Spot, particularly in the gesture control condition, when they were allowed to walk. In conclusion, voice control with walking produced the best outcomes. Improving physical ergonomics and adjusting gesture types could make gesture control more effective.