Beyond Local Nash Equilibria for Adversarial Networks

Conference Paper (2019)
Author(s)

FA Oliehoek (TU Delft - Interactive Intelligence)

Rahul Savani (University of Liverpool)

Jose Gallego (Universiteit van Amsterdam)

Elise van der van der Pol (Universiteit van Amsterdam)

Roderich Gross (University of Sheffield)

Research Group
Interactive Intelligence
Copyright
© 2019 F.A. Oliehoek, Rahul Savani, Jose Gallego, Elise van der Pol, Roderich Groß
DOI related publication
https://doi.org/10.1007/978-3-030-31978-6_7
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 F.A. Oliehoek, Rahul Savani, Jose Gallego, Elise van der Pol, Roderich Groß
Research Group
Interactive Intelligence
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care   Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Pages (from-to)
73-89
ISBN (print)
978-3-030-31977-9
ISBN (electronic)
978-3-030-31978-6
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Save for some special cases, current training methods for Generative Adversarial Networks (GANs) are at best guaranteed to converge to a ‘local Nash equilibrium’ (LNE). Such LNEs, however, can be arbitrarily far from an actual Nash equilibrium (NE), which implies that there are no guarantees on the quality of the found generator or classifier. This paper proposes to model GANs explicitly as finite games in mixed strategies, thereby ensuring that every LNE is an NE. We use the Parallel Nash Memory as a solution method, which is proven to monotonically converge to a resource-bounded Nash equilibrium. We empirically demonstrate that our method is less prone to typical GAN problems such as mode collapse and produces solutions that are less exploitable than those produced by GANs and MGANs.

Files

978_3_030_31978_6_7.pdf
(pdf | 2.88 Mb)
- Embargo expired in 25-03-2020
License info not available