How to beat a Bayesian adversary

None, None; None, None; None, None; None, None

How to beat a Bayesian adversary

Journal Article (2025)

Author(s)

Zihan Ding (Princeton University)

Kexin Jin (Princeton University)

Jonas Latz (The University of Manchester)

C. Liu (TU Delft - Applied Probability)

Research Group

Applied Probability

DOI related publication

https://doi.org/10.1017/S0956792525000105

Machine learning Stochastic differential equations Particle system Adversarial robustness McKean-Vlasov process

To reference this document use:

https://resolver.tudelft.nl/uuid:f17071b1-9000-41db-9012-12dbfa660e38

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Applied Probability

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deep neural networks and other modern machine learning models are often susceptible to adversarial attacks. Indeed, an adversary may often be able to change a model's prediction through a small, directed perturbation of the model's input - an issue in safety-critical applications. Adversarially robust machine learning is usually based on a minmax optimisation problem that minimises the machine learning loss under maximisation-based adversarial attacks. In this work, we study adversaries that determine their attack using a Bayesian statistical approach rather than maximisation. The resulting Bayesian adversarial robustness problem is a relaxation of the usual minmax problem. To solve this problem, we propose Abram - a continuous-time particle system that shall approximate the gradient flow corresponding to the underlying learning problem. We show that Abram approximates a McKean-Vlasov process and justify the use of Abram by giving assumptions under which the McKean-Vlasov process finds the minimiser of the Bayesian adversarial robustness problem. We discuss two ways to discretise Abram and show its suitability in benchmark adversarial deep learning experiments.

Files

How-to-beat-a-bayesian-adversa... (pdf)

(pdf | 0.652 Mb)