Persistent self-supervised learning

None, None; None, None; None, None; None, None; None, None

Persistent self-supervised learning

From stereo to monocular vision for obstacle avoidance

Journal Article (2018)

Author(s)

K.G. van Hecke (TU Delft - Control & Simulation)

G. C. H. E. de Croon (TU Delft - Control & Simulation)

L.J.P. van der Maaten (TU Delft - Pattern Recognition and Bioinformatics)

Daniel Hennes (European Space Agency (ESA))

Dario Izzo (European Space Agency (ESA))

Copyright

DOI related publication

https://doi.org/10.1177/1756829318756355

Robotics Stereo vision Monocular depth estimation Persistent self-supervised learning

To reference this document use:

https://resolver.tudelft.nl/uuid:295dfbba-6e4f-473c-81cd-4f83ae5b9601

More Info

expand_more

Publication Year

2018

Language

English

Copyright

Issue number

2

Volume number

10

Pages (from-to)

186-206

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Self-supervised learning is a reliable learning mechanism in which a robot uses an original, trusted sensor cue for training to recognize an additional, complementary sensor cue. We study for the first time in self-supervised learning how a robot’s learning behavior should be organized, so that the robot can keep performing its task in the case that the original cue becomes unavailable. We study this persistent form of self-supervised learning in the context of a flying robot that has to avoid obstacles based on distance estimates from the visual cue of stereo vision. Over time it will learn to also estimate distances based on monocular appearance cues. A strategy is introduced that has the robot switch from flight based on stereo to flight based on monocular vision, with stereo vision purely used as “training wheels” to avoid imminent collisions. This strategy is shown to be an effective approach to the “feedback-induced data bias” problem as also experienced in learning from demonstration. Both simulations and real-world experiments with a stereo vision equipped ARDrone2 show the feasibility of this approach, with the robot successfully using monocular vision to avoid obstacles in a 5 × 5 m room. The experiments show the potential of persistent self-supervised learning as a robust learning approach to enhance the capabilities of robots. Moreover, the abundant training data coming from the own sensors allow to gather large data sets necessary for deep learning approaches.

Files

45169434_1756829318756355.pdf

(pdf | 1.87 Mb)