The Current State of the Art in Multi-Label Image Classification Applied on LEGO Bricks

None, None

The Current State of the Art in Multi-Label Image Classification Applied on LEGO Bricks

Bachelor Thesis (2020)

Author(s)

Nishad Tahur (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Jan van Gemert – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Attila Lengyel – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Ricardo Marroquim – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Deep Learning Multi-label classification LEGO recognition

To reference this document use:

https://resolver.tudelft.nl/uuid:f594616d-d8c9-47af-bcba-46fc23699fd0

More Info

expand_more

Publication Year

2020

Language

English

Graduation Date

22-06-2020

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project', 'Can deep learning recognize Lego pieces?']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper shows how the current state of the art in image classification performs on LEGO bricks. Currently the standard image classification models with deep learning are single label image classifiers. In this paper we will convert them to work on multi-label images and subsequently evaluate how well they perform. We show how well the classifiers will work on three different types of datasets. Experiments will be conducted on these three types of datasets to compare the performance of three different multi-label image classifiers. The main research question accompanying this paper is ``How well does the state of the art in image classification work on LEGO bricks?''. Three subquestions are set up to answer this question. The first will regard the existence of the image classifiers. The second subquestion will regard how big the influence is of real life aspects, such as deterioration of the LEGO bricks. The final subquestion will be about the performance on the datasets. After answering these questions and conducting the experiments, we came to the conclusion that the ResNext model performed the best on almost all of the categories. Based on the numbers of the results we can also conclude that the models should perform well with multi-label images of LEGO bricks.

Files

Final_Paper_4681517_Nishad_Tah... (pdf)

(pdf | 0.424 Mb)

License info not available