Advancing deep learning-based detection of floating litter using a novel open dataset

None, None; None, None; None, None; None, None; None, None

Advancing deep learning-based detection of floating litter using a novel open dataset

Journal Article (2023)

Author(s)

T. Jia (TU Delft - Sanitary Engineering)

A.J. Vallendar (TU Delft - Sanitary Engineering, Noria Sustainable Innovators)

Rinze de Vries (Noria Sustainable Innovators)

Z. Kapelan (TU Delft - Sanitary Engineering)

R. Taormina (TU Delft - Sanitary Engineering)

Research Group

Sanitary Engineering

Copyright

DOI related publication

https://doi.org/10.3389/frwa.2023.1298465

Computer vision Artificial intelligence Pollution Plastics Image classification Environmental monitoring

To reference this document use:

https://resolver.tudelft.nl/uuid:1a8d5f22-0243-4849-b82e-f3ff887d02b7

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Research Group

Sanitary Engineering

Volume number

5

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Supervised Deep Learning (DL) methods have shown promise in monitoring the floating litter in rivers and urban canals but further advancements are hard to obtain due to the limited availability of relevant labeled data. To address this challenge, researchers often utilize techniques such as transfer learning (TL) and data augmentation (DA). However, there is no study currently reporting a rigorous evaluation of the effectiveness of these approaches for floating litter detection and their effects on the models' generalization capability. To overcome the problem of limited data availability, this work introduces the “TU Delft—Green Village” dataset, a novel labeled dataset of 9,473 camera and phone images of floating macroplastic litter and other litter items, captured using experiments in a drainage canal of TU Delft. We use the new dataset to conduct a thorough evaluation of the detection performance of five DL architectures for multi-class image classification. We focus the analysis on a systematic evaluation of the benefits of TL and DA on model performances. Moreover, we evaluate the generalization capability of these models for unseen litter items and new device settings, such as increasing the cameras' height and tilting them to 45°. The results obtained show that, for the specific problem of floating litter detection, fine-tuning all layers is more effective than the common approach of fine-tuning the classifier alone. Among the tested DA techniques, we find that simple image flipping boosts model accuracy the most, while other methods have little impact on the performance. The SqueezeNet and DenseNet121 architectures perform the best, achieving an overall accuracy of 89.6 and 91.7%, respectively. We also observe that both models retain good generalization capability which drops significantly only for the most complex scenario tested, but the overall accuracy raises significantly to around 75% when adding a limited amount of images to training data, combined with flipping augmentation. The detailed analyses conducted here and the released open source dataset offer valuable insights and serve as a precious resource for future research.

Files

Frwa_05_1298465.pdf

(pdf | 3.37 Mb)