S. Koffas | TU Delft Repository

Towards Backdoor Stealthiness in Model Parameter Space

Conference paper (2025) - Xiaoyun Xu , Zhuoran Liu , Stefanos Koffas , Stjepan Picek

Backdoor attacks maliciously inject covert functionality into machine learning models, representing a security threat. The stealthiness of backdoor attacks is a critical research direction, focusing on adversaries' efforts to enhance the resistance of backdoor attacks against def ...

Backdoor attacks maliciously inject covert functionality into machine learning models, representing a security threat. The stealthiness of backdoor attacks is a critical research direction, focusing on adversaries' efforts to enhance the resistance of backdoor attacks against defense mechanisms. Recent research on backdoor stealthiness focuses mainly on indistinguishable triggers in input space and inseparable backdoor representations in feature space, aiming to circumvent backdoor defenses that examine these respective spaces. However, existing backdoor attacks are typically designed to resist a specific type of backdoor defense without considering the diverse range of defense mechanisms. Based on this observation, we pose a natural question: Are current backdoor attacks truly a real-world threat when facing diverse practical defenses? To answer this question, we examine 12 common backdoor attacks that focus on input-space or feature-space stealthiness and 17 diverse representative defenses. Surprisingly, we reveal a critical blind spot that backdoor attacks designed to be stealthy in input and feature spaces can be mitigated by examining backdoored models in parameter space. To investigate the underlying causes behind this common vulnerability, we study the characteristics of backdoor attacks in the parameter space. Notably, we find that input- and feature-space attacks introduce prominent backdoor-related neurons in parameter space, which are not thoroughly considered by current backdoor attacks. Taking comprehensive stealthiness into account, we propose a novel supply-chain attack called Grond. Grond limits the parameter changes by a simple yet effective module, Adversarial Backdoor Injection (ABI), which adaptively increases the parameter-space stealthiness during the backdoor injection. Extensive experiments demonstrate that Grond outperforms all 12 backdoor attacks against state-of-the-art (including adaptive) defenses on CIFAR10, GTSRB, and a subset of ImageNet. Additionally, we show that ABI consistently improves the effectiveness of common backdoor attacks.

EmoBack

Backdoor Attacks Against Speaker Identification Using Emotional Prosody

Conference paper (2024) - Coen Schoof , Stefanos Koffas , Mauro Conti , Stjepan Picek

Speaker identification (SI) determines a speaker's identity based on their utterances. Previous work indicates that SI deep neural networks (DNNs) are vulnerable to backdoor attacks that embed a backdoor functionality in a DNN causing incorrect outputs during inference when a tri ...

ELMs Under Siege

A Study on Backdoor Attacks on Extreme Learning Machines

Conference paper (2024) - Behrad Tajalli , Stefanos Koffas , Gorka Abad , Stjepan Picek

Due to their computational efficiency and speed during training and inference, extreme learning machines are suitable for simple learning tasks on lightweight datasets. Examples of their real-world applications include healthcare and edge devices, where security concerns are cruc ...

Unveiling the Threat

Investigating Distributed and Centralized Backdoor Attacks in Federated Graph Neural Networks

Journal article (2024) - Jing Xu , Stefanos Koffas , Stjepan Picek

Graph neural networks (GNNs) have gained significant popularity as powerful deep learning methods for processing graph data. However, centralized GNNs face challenges in data-sensitive scenarios due to privacy concerns and regulatory restrictions. Federated learning has emerged a ...

Backdoors on Manifold Learning

Conference paper (2024) - Christina Kreza , Stefanos Koffas , Behrad Tajalli , Mauro Conti , Stjepan Picek

Recently, attackers have targeted machine learning systems, introducing various attacks. The backdoor attack is popular in this field and is usually realized through data poisoning. To the best of our knowledge, we are the first to investigate whether the backdoor attacks remain ...

Toward Stealthy Backdoor Attacks Against Speech Recognition via Elements of Sound

Journal article (2024) - Hanbo Cai , Pengcheng Zhang , Hai Dong , Yan Xiao , Stefanos Koffas , Yiming Li

Deep neural networks (DNNs) have been widely and successfully adopted and deployed in various applications of speech recognition. Recently, a few works revealed that these models are vulnerable to backdoor attacks, where the adversaries can implant malicious prediction behaviors ...

Beyond PhantomSponges

Enhancing Sponge Attack on Object Detection Models

Conference paper (2024) - Coen Schoof , Stefanos Koffas , Mauro Conti , Stjepan Picek

Given today's ongoing deployment of deep learning models, ensuring their security against adversarial attacks has become paramount. This paper introduces an enhanced version of the PhantomSponges attack by Shapira et al. The attack exploits the non-maximum suppression (NMS) algor ...

Going in Style

Audio Backdoors Through Stylistic Transformations

Conference paper (2023) - Stefanos Koffas , Luca Pajola , Stjepan Picek , Mauro Conti

This work explores stylistic triggers for backdoor attacks in the audio domain: dynamic transformations of malicious samples through guitar effects. We first formalize stylistic triggers – currently missing in the literature. Second, we explore how to develop stylistic triggers i ...

A Systematic Evaluation of Backdoor Attacks in Various Domains

Book chapter (2023) - Stefanos Koffas , Behrad Tajalli , Jing Xu , Mauro Conti , Stjepan Picek

Deep learning found its place in various real-world applications, where many also have security requirements. Unfortunately, as these systems become more pervasive, understanding how they fail becomes more challenging. While there are multiple failure modes in machine learning, o ...

Watermarking Graph Neural Networks based on Backdoor Attacks

Conference paper (2023) - Jing Xu , Stefanos Koffas , Oǧuzhan Ersoy , Stjepan Picek

Graph Neural Networks (GNNs) have achieved promising performance in various real-world applications. Building a powerful GNN model is not a trivial task, as it requires a large amount of training data, powerful computing resources, and human expertise. Moreover, with the developm ...

Backdoor Pony

Evaluating backdoor attacks and defenses in different domains

Journal article (2023) - Arthur Mercier , Nikita Smolin , Oliver Sihlovec , Stefanos Koffas , Stjepan Picek

Outsourced training and crowdsourced datasets lead to a new threat for deep learning models: the backdoor attack. In this attack, the adversary inserts a secret functionality in a model, activated through malicious inputs. Backdoor attacks represent an active research area due to ...

Dynamic Backdoors with Global Average Pooling

Conference paper (2022) - Stefanos Koffas , Stjepan Picek , Mauro Conti

Outsourced training and machine learning as a service have resulted in novel attack vectors like backdoor attacks. Such attacks embed a secret functionality in a neural network activated when the trigger is added to its input. In most works in the literature, the trigger is stati ...

On the Effect of Clock Frequency on Voltage and Electromagnetic Fault Injection

Conference paper (2022) - Stefanos Koffas , Praveen Kumar Vadnala

We investigate the influence of clock frequency on the success rate of a fault injection attack. In particular, we examine the success rate of voltage and electromagnetic fault attacks for varying clock frequencies. Using three different tests that cover different components of a ...

Can You Hear It? Backdoor Attacks via Ultrasonic Triggers

Conference paper (2022) - Stefanos Koffas , Jing Xu , Mauro Conti , Stjepan Picek

This work explores backdoor attacks for automatic speech recognition systems where we inject inaudible triggers. By doing so, we make the backdoor attack challenging to detect for legitimate users and, consequently, potentially more dangerous. We conduct experiments on two versio ...

More is Better (Mostly): On the Backdoor Attacks in Federated Graph Neural Networks

Conference paper (2022) - J. Xu , R. Wang , S. Koffas , K. Liang , S. Picek

Graph Neural Networks (GNNs) are a class of deep learning-based methods for processing graph domain information. GNNs have recently become a widely used graph analysis method due to their superior ability to learn representations for complex graph data. Due to privacy concerns an ...