Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning

Conference paper (2022)

Authors

M. Lodel Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

B.F. Ferreira de Brito Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

A. Serra Gomez Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

L. Ferranti Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

R Babuska Czech Technical University, Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

R. Babuska Czech Technical University, Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

J. Alonso-Mora Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Javier Alonso-Mora Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Research Group

Learning & Autonomous Control (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI: https://doi.org/10.1109/ICRA46639.2022.9812190

Safety Navigation Uncertainty Planning Monte Carlo methods Reinforcement learning Trajectory planning

To reference this document use:

http://resolver.tudelft.nl/uuid:df38e036-9dee-4b21-b6d8-a5f823af19a5

More Info

expand_more

Published Date

2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Learning & Autonomous Control

Abstract

Search missions require motion planning and navigation methods for information gathering that continuously replan based on new observations of the robot's surroundings. Current methods for information gathering, such as Monte Carlo Tree Search, are capable of reasoning over long horizons, but they are computationally expensive. An alternative for fast online execution is to train, offline, an information gathering policy, which indirectly reasons about the information value of new observations. However, these policies lack safety guarantees and do not account for the robot dynamics. To overcome these limitations we train an information-aware policy via deep reinforcement learning, that guides a receding-horizon trajectory optimization planner. In particular, the policy continuously recommends a reference viewpoint to the local planner, such that the resulting dynamically feasible and collision-free trajectories lead to observations that maximize the information gain and reduce the uncertainty about the environment. In simulation tests in previously unseen environments, our method consistently outperforms greedy next-best-view policies and achieves competitive performance compared to Monte Carlo Tree Search, in terms of information gains and coverage time, with a reduction in execution time by three orders of magnitude.

Files

Where_to_Look_Next_Learning_Vi... (pdf)

(pdf | 0.556 Mb)

- Embargo expired in 12-01-2023

Unknown license