EAD-GAN

None, None; None, None; None, None; None, None

EAD-GAN

A Generative Adversarial Network for Disentangling Affine Transforms in Images

Journal Article (2022)

Author(s)

Letao Liu (Nanyang Technological University)

Xudong Jiang (Nanyang Technological University)

Martin Saerbeck (TÜV SÜD PSB)

Justin Dauwels (TU Delft - Signal Processing Systems)

Research Group

Signal Processing Systems

DOI related publication

https://doi.org/10.1109/TNNLS.2022.3195533

Generative adversarial network (GAN) Affine transform Disentanglement

To reference this document use:

https://resolver.tudelft.nl/uuid:53a21bfb-e8ee-4be2-bc68-4ed14837f3bc

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Signal Processing Systems

Issue number

3

Volume number

35

Pages (from-to)

3652-3662

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This article proposes a generative adversarial network called explicit affine disentangled generative adversarial network (EAD-GAN), which explicitly disentangles affine transform in a self-supervised manner. We propose an affine transform regularizer to force the InfoGAN to have explicit properties of affine transform. To facilitate training an affine transform encoder, we decompose the affine matrix into two separate matrices and infer the explicit transform parameters by the least-squares method. Unlike the existing approaches, representations learned by the proposed EAD-GAN have clear physical meaning, where transforms, such as rotation, horizontal and vertical zooms, skews, and translations, are explicitly learned from training data. Thus, we set different values of each transform parameter individually to generate specifically affine transformed data by the learned network. We show that the proposed EAD-GAN successfully disentangles these attributes on the MNIST, CelebA, and dSprites datasets. EAD-GAN achieves higher disentanglement scores with a large margin compared to the state-of-the-art methods on the dSprites dataset. For example, on the dSprites dataset, EAD-GAN achieves the MIG and DCI score of 0.59 and 0.96 respectively, compared to 0.37 and 0.71, respectively, for the state-of-the-art methods.

Files

EAD-GAN_A_Generative_Adversari... (pdf)

(pdf | 4.5 Mb)