Scale Learning in Scale-Equivariant Convolutional networks

None, None

Scale Learning in Scale-Equivariant Convolutional networks

Master Thesis (2023)

Author(s)

M.J. Basting (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R. Bruintjes – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Jan Gemert – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

K.A. Hildebrandt – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Deep Learning Computer Vision Convolutional Neural Networks Classification Scale-equivariance

To reference this document use:

https://resolver.tudelft.nl/uuid:72821daa-2034-4ac8-aabb-b03e0c4404c3

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

11-10-2023

Awarding Institution

Delft University of Technology

Programme

['Computer Science']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In real-life scenarios, there are many variations in sizes of objects of the same category and the objects are not always placed at a fixed distance from the camera. This results in objects taking up an arbitrary size of pixels in the image. Vanilla CNNs are by design only translation equivariant and thus have to learn separate filters for scaled variants of the same objects. Recently, scale-equivariant approaches have been developed that share features across a set of pre-determined fixed scales. We further refer to this set of scales as the internal scales. Existing work gives little information about how to best choose the internal scales when the underlying distribution of sizes, or scale distribution, in the dataset, is known. In this work, we develop a model of how the features at different internal scales are used for samples containing differently-sized objects. The proposed model return comparable internal scales to the best-performing internal scales for different data scale distribution of various width. However, in most cases, the scale distribution is not known. Compared to previous scale-equivariant methods, we do not treat the internal scales as a fixed set but directly optimise them with regard to the loss, removing the need for prior knowledge about the data scale distribution. We parameterise the internal scales by the smallest scale which we refer to as σbasis, and the Internal Scale Range (ISR) that models the ratio between the smallest and largest scale. By varying the ISR, we learn the range of the scales the model is equivariant to. We show that our method can learn the internal scales on various data scale distributions and can better adapt the internal scales than other parameterisations. Finally, we compare our scale learning approach and other parameterisations to current State-of-the-art scale-equivariant approaches on the MNIST-Scale dataset.

Files

Thesis_Part_1_3.pdf

(pdf | 0.676 Mb)

- Embargo expired in 10-11-2023

License info not available