Are BERT-based fact-checking models robust against adversarial attack?

None, None

Are BERT-based fact-checking models robust against adversarial attack?

Bachelor Thesis (2023)

Author(s)

E.E. Afriat (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Avishek Anand – Graduation committee member (Leibniz Universität)

Lijun Lyu – Mentor (L3S)

L. Corti – Mentor (TU Delft - Web Information Systems)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Adversarial attacks NLP BERT Fact Checking

To reference this document use:

https://resolver.tudelft.nl/uuid:9b8e7e90-2487-4a52-8631-bd10e1537246

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

03-02-2023

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Abstract

We seek to examine the vulnerability of BERT-based fact-checking. We implement a gradient based, adversarial attack strategy, based on Hotflip swapping individual tokens from the input. We use this on a pre-trained ExPred model for fact-checking. We find that gradient based adversarial attacks are ineffective against ExPred. Uncertainties about the similitude of the examples generated by our adversarial attack implementation cast doubts on the results.

Files

Main.pdf

(pdf | 0.308 Mb)

License info not available