Multi-objective differential evolution in the generation of adversarial examples

None, None; None, None; None, None

Multi-objective differential evolution in the generation of adversarial examples

Journal Article (2024)

Author(s)

A.J. Bartlett (TU Delft - Multimedia Computing)

CCS Liem (TU Delft - Multimedia Computing)

Annibale Panichella (TU Delft - Software Engineering)

Multimedia Computing

DOI related publication

https://doi.org/10.1016/j.scico.2024.103169

Software testing Differential evolution Adversarial examples Deep learning search-based software engineering

To reference this document use:

https://resolver.tudelft.nl/uuid:33fb2c05-f46a-4066-8fc2-93c691980a68

More Info

expand_more

Publication Year

2024

Language

English

Multimedia Computing

Volume number

238

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Adversarial examples remain a critical concern for the robustness of deep learning models, showcasing vulnerabilities to subtle input manipulations. While earlier research focused on generating such examples using white-box strategies, later research focused on gradient-based black-box strategies, as models' internals often are not accessible to external attackers. This paper extends our prior work by exploring a gradient-free search-based algorithm for adversarial example generation, with particular emphasis on differential evolution (DE). Building on top of the classic DE operators, we propose five variants of gradient-free algorithms: a single-objective approach (GADE), two multi-objective variations (NSGA-IIDE and MOEA/DDE), and two many-objective strategies (NSGA-IIIDE and AGE-MOEADE). Our study on five canonical image classification models shows that whilst GADE variant remains the fastest approach, NSGA-IIDE consistently produces more minimal adversarial attacks (i.e., with fewer image perturbations). Moreover, we found that applying a post-process minimization to our adversarial images, would further reduce the number of changes and overall delta variation (image noise).

Files

1-s2.0-S0167642324000923-main.... (pdf)

(pdf | 2.41 Mb)