Do More Elaborate Search Strategies Lead to Better Neural Architecture Search Performance?

None, None

Do More Elaborate Search Strategies Lead to Better Neural Architecture Search Performance?

Master Thesis (2020)

Author(s)

T. den Ottelander (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Peter A.N. Bosman – Mentor (TU Delft - Algorithmics)

Mathijs M. de de Weerdt – Graduation committee member (TU Delft - Algorithmics)

Jan Gemert – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

A. Dushatskiy – Graduation committee member (TU Delft - Algorithmics)

M. Virgolin – Graduation committee member (TU Delft - Algorithmics)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Optimization AutoML Convolutional Neural Network NAS Neural Architecture Search Local Search Baseline MO-GOMEA

To reference this document use:

https://resolver.tudelft.nl/uuid:40d97097-b72a-40cb-ad01-78b982a41668

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Graduation Date

28-10-2020

Awarding Institution

Delft University of Technology

Programme

Computer Science | Data Science and Technology

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Computer vision tasks, like supervised image classification, are effectively tackled by convolutional neural networks, provided that the architecture, which defines the structure of the network, is set correctly. Neural Architecture Search (NAS) is a relatively young and increasingly popular field that is concerned with automatically optimizing the architecture of neural networks. Previously known work shows that even though a recent trend has been to develop increasingly complex search strategies for NAS, several search strategies do not significantly outperform simple approaches like randomly sampling from the search space on single-objective NAS tasks. Additionally, proper ablation studies are often missing. Therefore, it is currently uncertain at best which mechanisms are key for an algorithm to have to achieve excellent NAS performance. In the first part of this thesis, Local Search (LS) and a differently biased form of random search, are proposed for multi-objective (MO) NAS. The multi-objective version of NAS is studied less and understanding the trade-off between between multiple objectives for architectures is arguably more interesting. We find that very simple algorithms can achieve search performance close to that of state-of-the-art evolutionary algorithms (EAs), while outperforming plain random search. Additionally, we find that the quality of the set of architectures found by LS is similar to the those found by the EAs, if compared with respect to test accuracy. Nevertheless, from the compared search strategies the Multi-Objective Gene-pool Optimal Mixing Evolutionary Algorithm (MO-GOMEA), a state-of-the-art model-based EA, achieves the best performance. In the second part of this thesis, it is explored which mechanisms are essential for MO-GOMEA to achieve an excellent search performance for NAS spaces. We find that the automatic population-sizing scheme of MO-GOMEA offers a welcome anytime-performance, but objective space clustering has only a small beneficial impact. The number of clusters can be set arbitrarily. Special (extreme) clusters that optimize for one objective only can be enabled to the practitioner’s preference, resulting in different search behaviors. The improvement in performance gained by automatically detecting and exploiting dependencies within architectures is limited: this model-based aspect of MO-GOMEA seems only helpful for finding highly accurate networks.

Files

MScThesis_TomDenOttelander.pdf

(pdf | 18.6 Mb)

License info not available