Credit scoring for small medium enterprises using transaction data

None, None

Credit scoring for small medium enterprises using transaction data

Master Thesis (2018)

Author(s)

W.J. Verkade (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

P. Cirillo – Mentor

G. Jongbloed – Mentor

Roald Waaijer – Mentor

Faculty

Electrical Engineering, Mathematics and Computer Science

Monitoring Credit scoring Transaction data Default classification Relational neighbour Hierarchical logistic regression

To reference this document use:

https://resolver.tudelft.nl/uuid:6ed89f2f-2c5f-4b85-859b-47a244da609b

More Info

expand_more

Publication Year

2018

Language

English

Graduation Date

17-04-2018

Awarding Institution

Delft University of Technology

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Managing credit risk is a vital part of financial institutions. While the research into credit risk models is extensive, transaction data is a relatively untapped data source in these models. We investigate the explanatory value of transaction data for the Bank by developing default classification models for their small medium enterprises (SME) portfolio. We develop measures that summarize the transaction behaviour on a client level for different time windows. Variables that are included into traditional models are positive income shocks, balance returns, zero transactions (indicating rejected direct debits), and relative cash
expenditure. By combining these variables with client characteristics and loan behaviour information, we develop a hierarchical logistic regression model which has a good overall classification performance, reflected by an area under curve (AUC) of 0.850. Tolerating 2 out of 3 false warnings, the model identifies more than 50% of the defaults on average. We investigate relational classification methods, which classify clients according to similarity in terms of their transaction behaviour. The relational neighbour classifier achieves an AUC
of 0.768, using similarity between to clients that are determined according to a flexible weight function of the number of shared entities. By combining this approach with the aggregated transaction variables, we develop a model which is solely based on transaction data. The strong performance of this model is reflected by an AUC of 0.804, illustrating the effectiveness of transaction data in default classification.

Files

Credit_scoring_for_SME_using_t... (pdf)

(pdf | 0.955 Mb)

License info not available