Optimizing Pharmacotherapy Exam Items and Assessing Student Proficiency

A Comparative Analysis between Item Response Theory and Classical Test Theory

Master Thesis (2023)
Author(s)

P. Nie (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Nestor Parolya – Mentor (TU Delft - Statistics)

N.V. Budko – Graduation committee member (TU Delft - Numerical Analysis)

Tim Preijers – Graduation committee member (Erasmus MC)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2023 Ping Nie
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Ping Nie
Graduation Date
24-11-2023
Awarding Institution
Delft University of Technology
Programme
Applied Mathematics
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper primarily employs Item Response Theory (IRT) to estimate item characteristics and the proficiency levels of students as reflected in the exam results. The process includes the application of algorithms for item characteristic parameter estimation and the utilization of statistical techniques for item selection from an item pool. Additionally, Classical Test Theory (CTT) is also used to gain insights into the item characteristics. The mathematical frameworks behind every algorithm and model will be introduced in detail. Our ultimate objective is to create an item bank that unifies all items from various exam versions onto a common scale. The technique to put items from different versions on the same scale is the test equating technique, and it will also be described in detail.

After a comprehensive analysis to determine the appropriate model within the framework of Item Response Theory (IRT), the Rasch model has been selected for all versions of the exams. One item from version 9 has been removed from the item pool due to its unsatisfactory item fit. Based on the Item-Map plot, the conclusion can be made that all three exam versions appear to be relatively easy for students to answer, as evidenced by the high bar of the exams. Ultimately, the item bank has been successfully established through the application of two test-equating methods, with results indicating its reliability.

Files

Master_Thesis_Ping_Nie.pdf
(pdf | 0.599 Mb)
License info not available