LK
L. Kesküll
1 records found
1
Pushing the Limits of the Compressive Memory Introduced in Infini-Attention
Architectural Decisions for Language Modelling with (Small) Transformers
Transformers are a type of neural network archi- tecture used in natural language processing. They excel in tasks such as translation, text generation, and language modeling by capturing long-range de- pendencies. Increasing input sequence length en- hances performance but at a h
...