How Do Neural Networks Estimate Optical Flow A Neuropsychology-Inspired Study

Journal Article (2022)
Author(s)

D.B. de Jong (TU Delft - Education AE)

Federico Paredes-Vallés (TU Delft - Control & Simulation)

Research Group
Control & Simulation
Copyright
© 2022 D.B. de Jong, Federico Paredes-Vallés, G.C.H.E. de Croon
DOI related publication
https://doi.org/10.1109/TPAMI.2021.3083538
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 D.B. de Jong, Federico Paredes-Vallés, G.C.H.E. de Croon
Research Group
Control & Simulation
Issue number
11
Volume number
44
Pages (from-to)
8290-8305
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

End-to-end trained convolutional neural networks have led to a breakthrough in optical flow estimation. The most recent advances focus on improving the optical flow estimation by improving the architecture and setting a new benchmark on the publicly available MPI-Sintel dataset. Instead, in this article, we investigate how deep neural networks estimate optical flow. A better understanding of how these networks function is important for (i) assessing their generalization capabilities to unseen inputs, and (ii) suggesting changes to improve their performance. For our investigation, we focus on FlowNetS, as it is the prototype of an encoder-decoder neural network for optical flow estimation. Furthermore, we use a filter identification method that has played a major role in uncovering the motion filters present in animal brains in neuropsychological research. The method shows that the filters in the deepest layer of FlowNetS are sensitive to a variety of motion patterns. Not only do we find translation filters, as demonstrated in animal brains, but thanks to the easier measurements in artificial neural networks, we even unveil dilation, rotation, and occlusion filters. Furthermore, we find similarities in the refinement part of the network and the perceptual filling-in process which occurs in the mammal primary visual cortex.

Files

How_Do_Neural_Networks_Estimat... (pdf)
(pdf | 3.83 Mb)
- Embargo expired in 01-07-2023
License info not available