Reducing annotation effort in patient-ventilator asynchrony detection with distance-based clustering

Journal Article (2026)
Author(s)

Lars van de Kamp (Eindhoven University of Technology, Demcon Life Sciences and Health)

Dolf Weller (Maasstad Ziekenhuis)

Rick Thijssen (Eindhoven University of Technology)

Bram Hunnekens (Demcon Life Sciences and Health)

Tom Bakkes (Eindhoven University of Technology)

Simona Turco (Eindhoven University of Technology)

Corstiaan den Uil (Maasstad Ziekenhuis)

Tom Oomen (TU Delft - Mechanical Engineering, Eindhoven University of Technology)

Nathan van de Wouw (Eindhoven University of Technology)

Research Group
Team Jan-Willem van Wingerden
DOI related publication
https://doi.org/10.1109/OJCSYS.2026.3685094 Final published version
More Info
expand_more
Publication Year
2026
Language
English
Research Group
Team Jan-Willem van Wingerden
Journal title
IEEE Open Journal of Control Systems
Volume number
5
Pages (from-to)
289-302
Downloads counter
7
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Objective: This study aims to reduce expert annotation effort in detecting patient-ventilator asynchrony (PVA) by introducing a semi-supervised learning framework for time series classification. Methods and procedures: We propose a model-independent framework that integrates hierarchical clustering and dynamic time-warping (DTW) for efficient data selection and label projection. The framework includes five steps: data collection, selection, annotation, projection, and model training. It is validated using a fully labeled dataset from Fondazione I.R.C.C.S. Policlinico San Matteo and applied to an unlabeled dataset from Maasstad Hospital, where annotation consistency and label quality are analyzed. Results: The framework reduces annotation effort by over 75% while closely resembling classification performance. On the San Matteo dataset, the model trained with projected labels achieved performance close to that of a fully supervised model. The method effectively captured rare PVA types and improved macro-averaged F1 scores compared to random sampling. On the Maasstad dataset, despite annotation inconsistencies, the framework demonstrated moderate detection performance (75% micro-averaged F1 score) using labels from a single clinical expert. Conclusion: Our semi-supervised framework enables scalable and efficient annotation of clinical time series data, maintaining model accuracy with minimal expert input. It is robust across datasets and adaptable to varying signal quality and annotation consistency.