QV

Q.W. Voet

info

Please Note

1 records found

Do Agent Architectures Matter for Crypto CTFs?

An Empirical Evaluation of LLM Architectures on the AICrypto Benchmark

Large Language Models (LLMs) demonstrate gold-medal performance in pure mathematics but continue to struggle in professional Capture-The-Flag (CTF) cybersecurity competitions, where the goal is to obtain a flag string as proof. While models can solve textbook equations, the itera ...