Search results

Searched for: collection%253Air

(1 - 2 of 2)

document: Statistical Significance Testing in Information Retrieval: An Empirical Analysis of Type I, Type II and Type III Errors
Urbano, Julián (author), De Lima, H.A. (author), Hanjalic, A. (author)
Statistical significance testing is widely accepted as a means to assess how well a difference in effectiveness reflects an actual difference between systems, as opposed to random noise because of the selection of topics. According to recent surveys on SIGIR, CIKM, ECIR and TOIS papers, the t-test is the most popular choice among IR researchers....
conference paper 2019

document: A New Perspective on Score Standardization
Urbano, Julián (author), De Lima, H.A. (author), Hanjalic, A. (author)
In test collection based evaluation of IR systems, score standardization has been proposed to compare systems across collections and minimize the effect of outlier runs on specific topics. The underlying idea is to account for the difficulty of topics, so that systems are scored relative to it. Webber et al. first proposed standardization...
conference paper 2019

Source URL (retrieved on 2024-05-21 17:45): https://repository.tudelft.nl/islandora/search/collection%253Air?collection=research&%3Bf%5B0%5D=mods_name_personal_author_namePart_family_ss%3A%22Savenije%22&%3Bdisplay=tud_default&f%5B0%5D=mods_subject_topic_ss%3A%22Simulation%22&f%5B1%5D=mods_subject_topic_ss%3A%22Type%5C%20I%5C%20and%5C%20Type%5C%20II%5C%20errors%22