A semi-supervised autoencoder framework for joint generation and classification of breathing

Journal Article (2021)
Author(s)

Oscar Pastor-Serrano (TU Delft - RST/Medical Physics & Technology)

Danny Lathouwers (TU Delft - RST/Reactor Physics and Nuclear Materials)

Zoltan Perkó (TU Delft - RST/Reactor Physics and Nuclear Materials)

Research Group
RST/Medical Physics & Technology
Copyright
© 2021 O. Pastor Serrano, D. Lathouwers, Z. Perko
DOI related publication
https://doi.org/10.1016/j.cmpb.2021.106312
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 O. Pastor Serrano, D. Lathouwers, Z. Perko
Research Group
RST/Medical Physics & Technology
Volume number
209
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Background and objective: One of the main problems with biomedical signals is the limited amount of patient-specific data and the significant amount of time needed to record the sufficient number of samples needed for diagnostic and treatment purposes. In this study, we present a framework to simultaneously generate and classify biomedical time series based on a modified Adversarial Autoencoder (AAE) algorithm and one-dimensional convolutions. Our work is based on breathing time series, with specific motivation to capture breathing motion during radiotherapy lung cancer treatments. Methods: First, we explore the potential in using the Variational Autoencoder (VAE) and AAE algorithms to model breathing signals from individual patients. We then extend the AAE algorithm to allow joint semi-supervised classification and generation of different types of signals within a single framework. To simplify the modeling task, we introduce a pre-processing and post-processing compressing algorithm that transforms the multi-dimensional time series into vectors containing time and position values, which are transformed back into time series through an additional neural network. Results: The resulting models are able to generate realistic and varied samples of breathing. By incorporating 4% and 12% of the labeled samples during training, our model outperforms other purely discriminative networks in classifying breathing baseline shift irregularities from a dataset completely different from the training set, achieving an average macro F1-score of 94.91% and 96.54%, respectively. Conclusion: To our knowledge, the presented framework is the first approach that unifies generation and classification within a single model for this type of biomedical data, enabling both computer aided diagnosis and augmentation of labeled samples within a single framework.