Towards Benchmarking the Robustness of Neuro-Symbolic Learning against Data Poisoning Backdoor Attacks

None, None

Towards Benchmarking the Robustness of Neuro-Symbolic Learning against Data Poisoning Backdoor Attacks

Evaluating the Robustness of Logic Tensor Networks under BadNet attacks

Bachelor Thesis (2025)

Author(s)

M.C. Guranda (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Kaitai Liang – Mentor (TU Delft - Cyber Security)

A. Agiollo – Mentor (TU Delft - Cyber Security)

A. Hanjalic – Graduation committee member (TU Delft - Intelligent Systems)

Faculty

Electrical Engineering, Mathematics and Computer Science

Neural Network Backdoor attack Neuro-Symbolic Model Logic Tensor Networks BadNet attacks

To reference this document use:

https://resolver.tudelft.nl/uuid:57113cfb-7645-41e5-973e-8fb3458a1368

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

23-06-2025

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Neural Networks have become standard solutions in many real-life relevant applications, such as healthcare. Yet, their vulnerability to backdoor attacks is a concern. These attacks modify a small portion of the data or the model to insert hidden triggered behaviors. Neuro-symbolic (NeSy) models, which integrate neural networks with symbolic reasoning, have been proposed as more robust and explainable AI models. However, their resilience against backdoor attacks has not been examined. This research investigates the robustness of Logic Tensor Networks (LTNs), representative NeSy models, against BadNet attacks, a simple and stealthy class of data poisoning backdoor attacks. Through empirical evaluations, we analyze how LTNs are affected by a bigger focus on symbolic reasoning and in different settings of an LTN model and BadNet attack, we measure the attack success rate (ASR). Our findings aim to provide a first insight into the vulnerability of NeSy systems to backdoor attacks.

Files

Bachelor_Thesis_Myriam_Guranda... (pdf)

(pdf | 1.88 Mb)

License info not available