A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models

Derner, Erik; Batistic, Kristina; Zahalka, Jan; Babuška, R

A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models

Review (2024)

Authors

Erik Derner ELLIS Alicante, Czech Technical University

Kristina Batistic Independent researcher

Jan Zahalka Czech Technical University

R Babuška Learning & Autonomous Control, Czech Technical University

Research Group

Learning & Autonomous Control

Security Natural language processing Large language models Jailbreak

To reference this document use:

http://resolver.tudelft.nl/uuid:40f5b56f-37d3-4c82-aefa-5abb75efec47

More Info

expand_more

Published Date

2024

Language

English

Research Group

Learning & Autonomous Control

Abstract

As large language models (LLMs) permeate more and more applications, an assessment of their associated security risks becomes increasingly necessary. The potential for exploitation by malicious actors, ranging from disinformation to data breaches and reputation damage, is substantial. This paper addresses a gap in current research by specifically focusing on security risks posed by LLMs within the prompt-based interaction scheme, which extends beyond the widely covered ethical and societal implications. Our work proposes a taxonomy of security risks along the user-model communication pipeline and categorizes the attacks by target and attack type alongside the commonly used confidentiality, integrity, and availability (CIA) triad. The taxonomy is reinforced with specific attack examples to showcase the real-world impact of these risks. Through this taxonomy, we aim to inform the development of robust and secure LLM applications, enhancing their safety and trustworthiness.

Files

A_Security_Risk_Taxonomy_for_P... (pdf)

(pdf | 2.47 Mb)