EValueAction

a proposal for policy evaluation in simulation to support interactive imitation learning

Conference Paper (2023)
Author(s)

Fiorella Sibona (Politecnico di Torino)

Jelle Luijkx (TU Delft - Learning & Autonomous Control)

Bas Van Der Heijden (TU Delft - Learning & Autonomous Control)

Laura Ferranti (TU Delft - Learning & Autonomous Control)

Marina Indri (Politecnico di Torino)

DOI related publication
https://doi.org/10.1109/INDIN51400.2023.10218251 Final published version
More Info
expand_more
Publication Year
2023
Language
English
ISBN (electronic)
978-1-6654-9313-0
Event
Downloads counter
360
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The up-and-coming concept of Industry 5.0 fore-sees human-centric flexible production lines, where collaborative robots support human workforce. In order to allow a seamless collaboration between intelligent robots and human workers, designing solutions for non-expert users is crucial. Learning from demonstration emerged as the enabling approach to address such a problem. However, more focus should be put on finding safe solutions which optimize the cost associated with the demonstrations collection process. This paper introduces a preliminary outline of a system, namely EValueAction (EVA), designed to assist the human in the process of collecting interactive demonstrations taking advantage of simulation to safely avoid failures. A policy is pre-trained with human-demonstrations and, where needed, new informative data are interactively gathered and aggregated to iteratively improve the initial policy. A trial case study further reinforces the relevance of the work by demonstrating the crucial role of informative demonstrations for generalization.

Files

EValueAction_a_proposal_for_po... (pdf)
(pdf | 2.67 Mb)
- Embargo expired in 22-02-2024
License info not available