Color Equivariant Object Detection
Integrating Color Equivariance into the Faster R-CNN Object Detection Architecture
D.L. Lieuw A Soe (TU Delft - Electrical Engineering, Mathematics and Computer Science)
Jan van Gemert – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
This paper studies the effect of integrating color equivariance and invariance into object detection, in particular into the Faster R-CNN architecture. To better understand the influence of this integration, we introduce modifications to the traditional convolutional layers of the standard Faster R-CNN model. By employing group theory in a similar way as Group Equivariant Convolutional Networks (G-CNNs), we replace the convolution operations with operations that are equivariant to hue transformations. The modified models are tested on several different datasets in which variations and imbalances in color distributions are present. Our toy experiments demonstrate that the replacement of the convolutional layers can lead to significant improvements in performance, especially in scenarios where the data contains a substantial amount of color variation. The findings of this work suggest that incorporating color equivariance and invariance into the design of convolutional layers can enhance object detection, proposing interesting possibilities for future research on real-world tasks.