Exploring Neural IR in Europeana
Suhaib Basir (Infinite Analytics)
Mónica Marrero (Europeana Foundation)
Julián Urbano (TU Delft - Multimedia Computing)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Europeana is the leading digital library of Europe’s cultural heritage, providing access to over 60 million items in more than 40 languages. Its search infrastructure relies on Solr and BM25 over the items’ metadata, thus depending heavily on keyword matching and resource-intensive treatments such as translation and multilingual metadata enrichment. This paper explores the application of Neural Information Retrieval (NIR) approaches in Europeana, focusing on multilinguality. We created a dataset for comparative evaluation, and show that while NIR demonstrates strong potential for multilingual search, challenges remain regarding its performance, particularly for entity-centric queries. This work also highlights the need for more reliable evaluation data.