Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis

Journal Article (2018)
Author(s)

Ahmad Alwosheel (TU Delft - Technology, Policy and Management)

Sander van Cranenburgh (TU Delft - Technology, Policy and Management)

Caspar G. Chorus (TU Delft - Technology, Policy and Management)

Research Group
Transport and Logistics
DOI related publication
https://doi.org/10.1016/j.jocm.2018.07.002 Final published version
More Info
expand_more
Publication Year
2018
Language
English
Research Group
Transport and Logistics
Volume number
28
Pages (from-to)
167-182
Downloads counter
431
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Artificial Neural Networks (ANNs) are increasingly used for discrete choice analysis. But, at present, it is unknown what sample size requirements are appropriate when using ANNs in this particular context. This paper fills this knowledge gap: we empirically establish a rule-of-thumb for ANN-based discrete choice analysis based on analyses of synthetic and real data. To investigate the effect of complexity of the data generating process on the minimum required sample size, we conduct extensive Monte Carlo analyses using a series of different model specifications with different levels of model complexity, including RUM and RRM models, with and without random taste parameters. Based on our analyses we advise to use a minimum sample size of fifty times the number of weights in the ANN; it should be noted, that the number of weights is generally much larger than the number of parameters in a discrete choice model. This rule-of-thumb is considerably more conservative than the rule-of-thumb that is most often used in the ANN community, which advises to use at least ten times the number of weights.

Files

1_s2.0_S1755534518300058_main.... (pdf)
(pdf | 1.94 Mb)
- Embargo expired in 12-01-2019
License info not available