Persistent self-supervised learning

From stereo to monocular vision for obstacle avoidance

Journal Article (2018)
Author(s)

K.G. van Hecke (TU Delft - Control & Simulation)

G. C. H. E. de Croon (TU Delft - Control & Simulation)

L.J.P. van der Maaten (TU Delft - Pattern Recognition and Bioinformatics)

Daniel Hennes (European Space Agency (ESA))

Dario Izzo (European Space Agency (ESA))

Copyright
© 2018 K.G. van Hecke, G.C.H.E. de Croon, L.J.P. van der Maaten, Daniel Hennes, Dario Izzo
DOI related publication
https://doi.org/10.1177/1756829318756355
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 K.G. van Hecke, G.C.H.E. de Croon, L.J.P. van der Maaten, Daniel Hennes, Dario Izzo
Issue number
2
Volume number
10
Pages (from-to)
186-206
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Self-supervised learning is a reliable learning mechanism in which a robot uses an original, trusted sensor cue for training to recognize an additional, complementary sensor cue. We study for the first time in self-supervised learning how a robot’s learning behavior should be organized, so that the robot can keep performing its task in the case that the original cue becomes unavailable. We study this persistent form of self-supervised learning in the context of a flying robot that has to avoid obstacles based on distance estimates from the visual cue of stereo vision. Over time it will learn to also estimate distances based on monocular appearance cues. A strategy is introduced that has the robot switch from flight based on stereo to flight based on monocular vision, with stereo vision purely used as “training wheels” to avoid imminent collisions. This strategy is shown to be an effective approach to the “feedback-induced data bias” problem as also experienced in learning from demonstration. Both simulations and real-world experiments with a stereo vision equipped ARDrone2 show the feasibility of this approach, with the robot successfully using monocular vision to avoid obstacles in a 5 × 5 m room. The experiments show the potential of persistent self-supervised learning as a robust learning approach to enhance the capabilities of robots. Moreover, the abundant training data coming from the own sensors allow to gather large data sets necessary for deep learning approaches.