EAD-GAN

A Generative Adversarial Network for Disentangling Affine Transforms in Images

Journal Article (2022)
Author(s)

Letao Liu (Nanyang Technological University)

Xudong Jiang (Nanyang Technological University)

Martin Saerbeck (TÜV SÜD PSB)

J. Dauwels (TU Delft - Signal Processing Systems)

Research Group
Signal Processing Systems
DOI related publication
https://doi.org/10.1109/TNNLS.2022.3195533
More Info
expand_more
Publication Year
2022
Language
English
Research Group
Signal Processing Systems
Issue number
3
Volume number
35
Pages (from-to)
3652-3662
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This article proposes a generative adversarial network called explicit affine disentangled generative adversarial network (EAD-GAN), which explicitly disentangles affine transform in a self-supervised manner. We propose an affine transform regularizer to force the InfoGAN to have explicit properties of affine transform. To facilitate training an affine transform encoder, we decompose the affine matrix into two separate matrices and infer the explicit transform parameters by the least-squares method. Unlike the existing approaches, representations learned by the proposed EAD-GAN have clear physical meaning, where transforms, such as rotation, horizontal and vertical zooms, skews, and translations, are explicitly learned from training data. Thus, we set different values of each transform parameter individually to generate specifically affine transformed data by the learned network. We show that the proposed EAD-GAN successfully disentangles these attributes on the MNIST, CelebA, and dSprites datasets. EAD-GAN achieves higher disentanglement scores with a large margin compared to the state-of-the-art methods on the dSprites dataset. For example, on the dSprites dataset, EAD-GAN achieves the MIG and DCI score of 0.59 and 0.96 respectively, compared to 0.37 and 0.71, respectively, for the state-of-the-art methods.