Towards Robust Object Detection in Unseen Catheterization Laboratories

None, None

Towards Robust Object Detection in Unseen Catheterization Laboratories

Master Thesis (2024)

Author(s)

Z. Wang (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

J. Dauwels – Mentor (TU Delft - Signal Processing Systems)

Rick Butler – Mentor (TU Delft - Medical Instruments & Bio-Inspired Technology)

J. van Den Dobbelsteen – Graduation committee member (TU Delft - Medical Instruments & Bio-Inspired Technology)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Object Detection Catheterization Laboratory Domain Generalization

To reference this document use:

https://resolver.tudelft.nl/uuid:c4b2d25c-0b2e-41e1-9d2b-32b9e0c88bb2

More Info

expand_more

Publication Year

2024

Language

English

Copyright

Graduation Date

29-01-2024

Awarding Institution

Delft University of Technology

Programme

['Electrical Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep-learning-based object detectors, while offering exceptional performance, are data-dependent and can suffer from generalization issues. In this thesis, we investigated deep neural networks for detecting people and medical instruments in the vision-based workflow analysis system inside Catheterization Laboratories (Cath Labs). The central problem explored in this thesis is the fact that the performance of the detector can degrade drastically if it is trained and tested on data from different Cath Labs.

Our research aimed to investigate the underlying causes of this specific performance degradation and find solutions to mitigate this issue. We employed the YOLOv8 object detector and created datasets from clinical procedures recorded at Reinier de Graaf Hospital (RdGG) and Philips Best Campus, supplemented with publicly accessible images. An aggregated version of object detection metrics was created for multi-camera system evaluation. Through a series of experiments complemented by data visualization, we discovered that the performance degradation primarily stems from data distribution shifts in the feature space. Notably, the object detector trained on non-sensitive online images can generalize to unseen Cath Labs, outperforming the model trained on a procedure recording from a different Cath Lab. The detector trained on the online images achieved an mAP@0.5 of 0.517 on the RdGG dataset. Furthermore, by switching to the most suitable camera for each object, the multi-camera system can further improve detection performance significantly. An aggregated 1-camera mAP@0.5 of 0.679 is achieved for single-object classes on the RdGG dataset.

Files

MSc_Zipeng_Wang_Cath_Lab2.pdf

(pdf | 11.5 Mb)

License info not available