Feature Engineering for Second Language Acquisition Modeling

Conference Paper (2018)
Author(s)

Guanliang Chen (TU Delft - Web Information Systems)

Claudia Hauff (TU Delft - Web Information Systems)

Geert-Jan Houben (TU Delft - Web Information Systems)

Research Group
Web Information Systems
More Info
expand_more
Publication Year
2018
Language
English
Research Group
Web Information Systems
Pages (from-to)
356-364
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Knowledge tracing serves as a keystone in delivering personalized education. However, few works attempted to model students’ knowledge state in the setting of Second Language Acquisition. The Duolingo Shared Task on Second Language Acquisition Modeling (Settles et al., 2018) provides students’ trace data that we extensively analyze and engineer features from for the task of predicting whether a student will correctly solve a vocabulary exercise. Our analyses of students’ learning traces reveal that factors like exercise format and engagement impact their exercise performance to a large extent. Overall, we extracted 23 different features as input to a Gradient Tree Boosting framework, which resulted in an AUC score of between 0.80 and 0.82 on the official test set.

Files

Chen.slam18.pdf
(pdf | 0.408 Mb)
License info not available