Data science as knowledge creation a framework for synergies between data analysts and domain professionals

Journal Article (2021)
Author(s)

H.G. van der Voort (TU Delft - Organisation & Governance)

Sabine van Bulderen (IVO Rechtspraak)

Scott Cunningham (University of Strathclyde)

Marijn Janssen (TU Delft - Information and Communication Technology)

Research Group
Organisation & Governance
Copyright
© 2021 H.G. van der Voort, Sabine van Bulderen, Scott Cunningham, M.F.W.H.A. Janssen
DOI related publication
https://doi.org/10.1016/j.techfore.2021.121160
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 H.G. van der Voort, Sabine van Bulderen, Scott Cunningham, M.F.W.H.A. Janssen
Research Group
Organisation & Governance
Volume number
173
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The road from data generation to data use is commonly approached as a data-driven, functional process in which domain expertise is integrated as an afterthought. In this contribution we complement this functional view with an institutional view, that takes data analysis and domain professionalism as complementary (yet fallible) knowledge sources. We developed a framework that identifies and amplifies synergies between data analysts and domain professionals instead of taking one of them (i.e. data analytics) at the centre of the analytical process. The framework combines the often-cited CRISP-DM framework with a knowledge creation framework. The resulting framework is used in a data science project at a Dutch inspectorate that seeks to use data for risk-based inspection. The findings show first support of our framework. They also show that whereas more complex models have a higher predictive power, simpler models are sometimes preferred as they have the potential to create more synergies between inspectors and data analyst. Another issue driven by the integrated framework is about who of the involved actors should own the predictive model: data analysts or inspectors.