The Effect of Cross-Domain Class Imbalance on Distribution Alignment

Master Thesis (2025)
Author(s)

J. Luu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

J.H. Krijthe – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

G. van Tulder – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

E. Demirović – Graduation committee member (TU Delft - Algorithmics)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
12-11-2025
Awarding Institution
Delft University of Technology
Programme
['Computer Science']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Statistical distribution alignment methods for domain adaptation assume similar class distributions across domains, but this assumption cannot always be guaranteed in medical imaging data. This research investigates the effect of cross-domain class imbalance on statistical distribution alignment in unsupervised domain adaptation for medical image classification. Our experiments demonstrate that statistical distribution alignment using MMD performs reliably under mild domain shifts but struggles when both severe cross-domain class imbalance and complex domain shifts are present. To address this, we implement class-conditioned domain alignment with a new weighted minibatch sampling method. Under conditions of extreme domain shift and severe cross-domain class imbalance, combining statistical distribution alignment with more complex sampling strategies results in small improvements compared to alignment with random sampling, suggesting that class-conditioned distribution alignment offers limited practical benefits. The model appears robust to label noise, but since the performance gains are tiny, the choice of sampling strategy could have limited influence on overall performance. In our experiments, we employ the CHECK and OAI hip X-ray datasets to investigate binary osteoarthritis classification under varying levels of domain shift and cross-domain class imbalance.

Files

License info not available