A semi-supervised autoencoder framework for joint generation and classification of breathing

None, None; None, None; None, None

A semi-supervised autoencoder framework for joint generation and classification of breathing

Journal Article (2021)

Author(s)

Oscar Pastor-Serrano (TU Delft - RST/Medical Physics & Technology)

Danny Lathouwers (TU Delft - RST/Reactor Physics and Nuclear Materials)

Zoltán Perkó (TU Delft - RST/Reactor Physics and Nuclear Materials)

Research Group

RST/Medical Physics & Technology

DOI related publication

https://doi.org/10.1016/j.cmpb.2021.106312

Deep learning Convolutional neural network Semi-supervised learning Respiratory motion Breathing signals Probabilistic autoencoder

To reference this document use:

https://resolver.tudelft.nl/uuid:01eb8725-c489-4481-ae21-a4e84ce128c4

More Info

expand_more

Publication Year

2021

Language

English

Research Group

RST/Medical Physics & Technology

Volume number

209

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Background and objective: One of the main problems with biomedical signals is the limited amount of patient-specific data and the significant amount of time needed to record the sufficient number of samples needed for diagnostic and treatment purposes. In this study, we present a framework to simultaneously generate and classify biomedical time series based on a modified Adversarial Autoencoder (AAE) algorithm and one-dimensional convolutions. Our work is based on breathing time series, with specific motivation to capture breathing motion during radiotherapy lung cancer treatments. Methods: First, we explore the potential in using the Variational Autoencoder (VAE) and AAE algorithms to model breathing signals from individual patients. We then extend the AAE algorithm to allow joint semi-supervised classification and generation of different types of signals within a single framework. To simplify the modeling task, we introduce a pre-processing and post-processing compressing algorithm that transforms the multi-dimensional time series into vectors containing time and position values, which are transformed back into time series through an additional neural network. Results: The resulting models are able to generate realistic and varied samples of breathing. By incorporating 4% and 12% of the labeled samples during training, our model outperforms other purely discriminative networks in classifying breathing baseline shift irregularities from a dataset completely different from the training set, achieving an average macro F1-score of 94.91% and 96.54%, respectively. Conclusion: To our knowledge, the presented framework is the first approach that unifies generation and classification within a single model for this type of biomedical data, enabling both computer aided diagnosis and augmentation of labeled samples within a single framework.

Files

1_s2.0_S0169260721003862_main.... (pdf)

(pdf | 3.8 Mb)