K. Gulamov

Bachelor thesis (1)

1 records found

Exploring Speed/Quality Trade-offs in Dimensionality of Attention Mechanism

Optimization with Grouped Query Attention and Diverse Key-Query-Value Dimensionalities

Bachelor thesis (2024) - K. Gulamov (author) , A.D. de Moor (mentor) , M. Izadi (graduation committee member) , Arie Van Van Deursen (graduation committee member) , Thomas Abeel (graduation committee member)

The advent of transformer architectures revolutionized natural language processing, particularly with the popularity of decoder-only transformers for text generation tasks like GPT models. However, the autoregressive nature of these models challenges their inference speed, crucia ...