Beyond Local Nash Equilibria for Adversarial Networks

None, None; None, None; None, None; None, None; None, None

Beyond Local Nash Equilibria for Adversarial Networks

Conference Paper (2019)

Author(s)

FA Oliehoek (TU Delft - Interactive Intelligence)

Rahul Savani (University of Liverpool)

Jose Gallego (Universiteit van Amsterdam)

Elise van der van der Pol (Universiteit van Amsterdam)

Roderich Gross (University of Sheffield)

Research Group

Interactive Intelligence

Copyright

DOI related publication

https://doi.org/10.1007/978-3-030-31978-6_7

To reference this document use:

https://resolver.tudelft.nl/uuid:6954eb47-2b2c-4501-872a-d8e621e19846

More Info

expand_more

Publication Year

2019

Language

English

Copyright

Research Group

Interactive Intelligence

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en

Pages (from-to)

73-89

ISBN (print)

978-3-030-31977-9

ISBN (electronic)

978-3-030-31978-6

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Save for some special cases, current training methods for Generative Adversarial Networks (GANs) are at best guaranteed to converge to a ‘local Nash equilibrium’ (LNE). Such LNEs, however, can be arbitrarily far from an actual Nash equilibrium (NE), which implies that there are no guarantees on the quality of the found generator or classifier. This paper proposes to model GANs explicitly as finite games in mixed strategies, thereby ensuring that every LNE is an NE. We use the Parallel Nash Memory as a solution method, which is proven to monotonically converge to a resource-bounded Nash equilibrium. We empirically demonstrate that our method is less prone to typical GAN problems such as mode collapse and produces solutions that are less exploitable than those produced by GANs and MGANs.

Files

978_3_030_31978_6_7.pdf

(pdf | 2.88 Mb)

- Embargo expired in 25-03-2020

License info not available