Visio-verbal teleimpedance interface: enabling semi-autonomous control of physical interaction via eye tracking and speech

None, None; None, None; None, None

Visio-verbal teleimpedance interface: enabling semi-autonomous control of physical interaction via eye tracking and speech

Journal Article (2026)

Author(s)

H.A. Jekel (Student TU Delft)

A. Díaz Rosales (CERN, TU Delft - Mechanical Engineering)

L. Peternel (TU Delft - Mechanical Engineering)

Research Group

Human-Robot Interaction

Gaze tracking Teleoperation Impedance control Verbal interaction Vision-language model

DOI related publication

https://doi.org/10.3389/frobt.2026.1749105 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:93746fbc-3766-4338-b746-04acacb0bedd

More Info

expand_more

Publication Year

2026

Language

English

Research Group

Human-Robot Interaction

Journal title

Frontiers In Robotics and AI

Volume number

13

Article number

1749105

Downloads counter

69

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The paper presents a visio-verbal teleimpedance interface for commanding 3D stiffness ellipsoids to the remote robot with a combination of the operator’s gaze and verbal interaction. The gaze is detected by an eye-tracker, allowing the system to understand the context in terms of what the operator is currently looking at in the scene. Along with verbal interaction, a Vision-Language Model (VLM) processes this information, enabling the operator to communicate their intended action or provide corrections. Based on these inputs, the interface can then generate appropriate stiffness matrices for different physical interaction actions. To validate the proposed visio-verbal teleimpedance interface, we conducted a series of experiments on a setup including a Force Dimension Sigma.7 haptic device to control the motion of the remote Kuka LBR iiwa robotic arm. The human operator’s gaze is tracked by Tobii Pro Glasses 2, while human verbal commands are processed by a VLM using GPT-4o. The first experiment explored the optimal prompt configuration for the interface. The second and third experiments demonstrated different functionalities of the interface on a slide-in-the-groove task

Files

Frobt-13-1749105.pdf

(pdf | 6.99 Mb)