Uncovering phishing attacks using principles of persuasion analysis

Journal Article (2024)
Author(s)

Lázaro Bustio-Martínez (Universidad Iberoamericana, Optics and Electronic)

Vitali Herrera-Semenets (Advanced Technologies Application Center)

Juan-Luis García-Mendoza (Université Sorbonne Paris Nord)

Miguel Ángel Álvarez-Carmona (Centro de Investigacion en Matematicas, CIMAT)

Jorge Ángel González-Ordiano (Universidad Iberoamericana)

Luis Zúñiga-Morales (Universidad Iberoamericana)

José Emilio Quiróz-Ibarra (Universidad Iberoamericana)

Pedro Antonio Santander-Molina (Pontificia Universidad Católica de Valparaíso)

Jan van den Berg (TU Delft - Cyber Security)

Research Group
Cyber Security
DOI related publication
https://doi.org/10.1016/j.jnca.2024.103964
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Cyber Security
Volume number
230
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

With the rising of Internet in early ’90s, many fraudulent activities have migrated from physical to digital: one of them is phishing. Phishing is a deceptive practice focused on exploiting the human factor, which is the most vulnerable aspect of any security process. In this scam, social engineering techniques are extensively utilized, specifically focusing on the principles of persuasion, to deceive individuals into disclosing sensitive information or engaging in malicious actions. This research explores the use of message subjectivity for detecting phishing attacks. It does so by assessing the impact of various data representations and classifiers on automatically identifying principles of persuasion. Furthermore, it investigates how these detected principles of persuasion can be leveraged for identifying phishing attacks. The experiments conducted revealed that there is no universal solution for data representation and classifier selection to effectively detect all principles of persuasion. Instead, a tailored combination of data representation and classifiers is required for detecting each principle. The Machine Learning models created automatically detect principles of persuasion with confidence levels ranging from 0.7306 to 0.8191 for AUC-ROC. Next, principles of persuasion detected are used for phishing detection. This study also emphasizes the need for user-friendly and comprehensible models. To validate the proposal presented, several families of classifiers were tested, but among all of them, tree-based models (and Random Forest in particular) stand out as preferred option. These models achieve similar level of effectiveness as alternative methods while offering improved clarity and user-friendliness, with an AUC-ROC of 0.859842.

Files

License info not available

Download not available