R. Baumgartner | TU Delft Repository

PDFA Distillation via String Probability Queries

Other (2024) - R. Baumgartner (author) , Sicco Verwer (author)

Probabilistic deterministic finite automata (PDFA) are discrete event systems modeling conditional probabilities over languages: Given an already seen sequence of tokens they return the probability of tokens of interest to appear next. These types of models have gained interest i ...

PDFA Distillation with Error Bound Guarantees

Conference paper (2024) - R. Baumgartner (author) , Sicco Verwer (author)

Active learning algorithms to infer probabilistic finite automata (PFA) have gained interest recently, due to their ability to provide surrogate models for some types of neural networks. However, recent approaches either cannot guarantee determinism, which makes the automaton har ...

Learning state machines from data streams: A generic strategy and an improved heuristic

Conference paper (2023) - R. Baumgartner (author) , Sicco Verwer (author)

State machines models are models that simulate the behavior of discrete event systems, capable of representing systems such as software systems, network interactions, and control systems, and have been researched extensively. The nature of most learning algorithms however is the ...

SoK

Explainable Machine Learning for Computer Security Applications

Conference paper (2023) - Azqa Nadeem (author) , D.A. Vos (author) , C.S. Cao (author) , Luca Pajola (author) , S. Dieck (author) , R. Baumgartner (author) , S.E. Verwer (author)

Explainable Artificial Intelligence (XAI) aims to improve the transparency of machine learning (ML) pipelines. We systematize the increasingly growing (but fragmented) microcosm of studies that develop and utilize XAI methods for defensive and offensive cybersecurity tasks. We id ...

Learning state machines via efficient hashing of future traces

Other (2022) - R. Baumgartner (author) , Sicco Verwer (author)

State machines are popular models to model and visualize discrete systems such as software systems, and to represent regular grammars. Most algorithms that passively learn state machines from data assume all the data to be available from the beginning and they load this data into ...