Towards Robust Object Detection in Unseen Catheterization Laboratories

Conference Paper (2024)
Author(s)

Zipeng Wang (Student TU Delft)

Rick Butler (TU Delft - Mechanical Engineering)

John van den Dobbelsteen (TU Delft - Mechanical Engineering)

Benno Hendriks (Philips Research Laboratories, TU Delft - Mechanical Engineering)

Maarten van der Elst (TU Delft - Mechanical Engineering, Reinier de Graaf Group)

Justin Dauwels (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Signal Processing Systems
DOI related publication
https://doi.org/10.1109/MeMeA60663.2024.10596906 Final published version
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Signal Processing Systems
ISBN (print)
979-8-3503-0800-6
ISBN (electronic)
979-8-3503-0799-3
Event
2024 IEEE International Symposium on Medical Measurements and Applications, MeMeA 2024 (2024-06-26 - 2024-06-28), Eindhoven, Netherlands
Downloads counter
271
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep learning-based object detectors, while offering exceptional performance, are data-dependent and can suffer from generalization issues. In this work, we investigated deep neural networks for detecting people and medical instruments for the vision-based workflow analysis system inside Catheterization Laboratories (Cath Labs). The central problem explored in this paper is the fact that the performance of the detector can degrade drastically if it is trained and tested on data from different Cath Labs. Our research aimed to investigate the underlying causes of this specific performance degradation and find solutions to mitigate this issue. We employed the YOLOv8 object detector and created datasets from clinical procedures recorded at Reinier de Graaf Hospital (RdGG) and Philips Best Campus, supplemented with publicly accessible images. Through a series of experiments complemented by data visualization, we discovered that the performance degradation primarily stems from data distribution shifts in the feature space. Notably, the object detector trained on non-sensitive online images can generalize to unseen Cath Labs, outperforming the model trained on a procedure recording from a different Cath Lab. The detector trained on the online images achieved an mAP@0.5 of 0.517 on the RdGG dataset. Furthermore, by switching to the most suitable camera for each object in the Cath Lab, the multi-camera system can further improve the detection performance significantly. An aggregated L-camera mAP@0.5 of 0.679 is achieved for single-object classes on the RdGG dataset.

Files

Towards_Robust_Object_Detectio... (pdf)
(pdf | 2.45 Mb)
- Embargo expired in 29-01-2025
License info not available