Watermarking Graph Neural Networks based on Backdoor Attacks

None, None; None, None; None, None; None, None

Watermarking Graph Neural Networks based on Backdoor Attacks

Conference Paper (2023)

Author(s)

Jing Xu (TU Delft - Cyber Security)

S. Koffas (TU Delft - Cyber Security)

Oǧuzhan Ersoy (Radboud Universiteit Nijmegen)

Stjepan Picek (Radboud Universiteit Nijmegen)

Research Group

Cyber Security

Copyright

DOI related publication

https://doi.org/10.1109/EuroSP57164.2023.00072

To reference this document use:

https://resolver.tudelft.nl/uuid:fa8734e9-ba92-472b-9eab-65f3c5a24817

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Research Group

Cyber Security

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. @en

Pages (from-to)

1179-1197

ISBN (electronic)

978-1-6654-6512-0

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Graph Neural Networks (GNNs) have achieved promising performance in various real-world applications. Building a powerful GNN model is not a trivial task, as it requires a large amount of training data, powerful computing resources, and human expertise. Moreover, with the development of adversarial attacks, e.g., model stealing attacks, GNNs raise challenges to model authentication. To avoid copyright infringement on GNNs, verifying the ownership of the GNN models is necessary.This paper presents a watermarking framework for GNNs for both graph and node classification tasks. We 1) design two strategies to generate watermarked data for the graph classification task and one for the node classification task, 2) embed the watermark into the host model through training to obtain the watermarked GNN model, and 3) verify the ownership of the suspicious model in a black-box setting. The experiments show that our framework can verify the ownership of GNN models with a very high probability (up to 99%) for both tasks. We also explore our watermarking mechanism against an adaptive attacker with access to partial knowledge of the watermarked data. Finally, we experimentally show that our watermarking approach is robust against a state-of-the-art model extraction technique and four state-of-the-art defenses against backdoor attacks.

Files

Watermarking_Graph_Neural_Netw... (pdf)

(pdf | 1.42 Mb)

- Embargo expired in 31-01-2024

License info not available