Domain Adaptation for Enhancing Visual Hand Landmark Prediction AI in Infrared Imaging

Bachelor Thesis (2025)
Author(s)

V. Sachkov (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Z.Y. Lin – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

T.C. Markhorst – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

J.C. van Gemert – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

K. Liang – Graduation committee member (TU Delft - Cyber Security)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
30-01-2025
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this
work, we investigate how domain adaptation techniques can improve the
performance of hand landmark detection models originally trained on RGB images
when deployed on infrared (IR) data. Our motivation stems from a medical use
case in Nepal, where clinicians require reliable temperature estimation at hand
keypoints to detect early signs of leprosy. We evaluate three methods on a small
IR dataset (80 labeled images & 5000 unlabeled frames): a shallow
adaptation (AdaBN), a deep alignment approach (Deep CORAL), and a test-time
subspace alignment method (SSA). Our experiments show that while AdaBN and SSA
yield moderate improvements, Deep CORAL achieves stronger gains through
targeted training of specific model components. The combination of these
methods produces superior results, yielding an 11% improvement in percentage of
correct keypoints (PCK@0.05) on our custom annotated IR dataset. These findings
demonstrate that combining lightweight and deep domain adaptation approaches
can effectively enhance IR hand landmark detection accuracy without requiring
large labeled datasets, enabling practical deployment for clinical thermal
imaging in resource-limited settings.



Files

FPTemp.pdf
(pdf | 2.78 Mb)
License info not available