Print Email Facebook Twitter Questions for Data Scientists in Software Engineering: A Replication Title Questions for Data Scientists in Software Engineering: A Replication Author Huijgens, H.K.M. (TU Delft Software Engineering) Rastogi, A. (TU Delft Software Engineering) Mulders, E.A. (TU Delft Electrical Engineering, Mathematics and Computer Science) Gousios, G. (TU Delft Software Engineering) van Deursen, A. (TU Delft Software Technology) Contributor Devanbu, Prem (editor) Cohen, Myra (editor) Zimmermann, Thomas (editor) Faculty Electrical Engineering, Mathematics and Computer Science Department Software Technology Date 2020 Abstract In 2014, a Microsoft study investigated the sort of questions that data science applied to software engineering should answer. This resulted in 145 questions that developers considered relevant for data scientists to answer, thus providing a research agenda to the community. Fast forward to five years, no further studies investigated whether the questions from the software engineers at Microsoft hold for other software companies, including software-intensive companies with different primary focus (to which we refer as software-defined enterprises). Furthermore, it is not evident that the problems identified five years ago are still applicable, given the technological advances in software engineering. This paper presents a study at ING, a software-defined enterprise in banking in which over 15,000 IT staff provides in-house software solutions. This paper presents a comprehensive guide of questions for data scientists selected from the previous study at Microsoft along with our current work at ING. We replicated the original Microsoft study at ING, looking for questions that impact both software companies and software-defined enterprises and continue to impact software engineering. We also add new questions that emerged from differences in the context of the two companies and the five years gap in between. Our results show that software engineering questions for data scientists in the software-defined enterprise are largely similar to the software company, albeit with exceptions. We hope that the software engineering research community builds on the new list of questions to create a useful body of knowledge. Subject Data ScienceSoftware AnalyticsSoftware Engineering To reference this document use: http://resolver.tudelft.nl/uuid:3585b229-84a0-4c26-b52f-e18c7a79782d DOI https://doi.org/10.1145/3368089.3409717 Publisher Association for Computing Machinery (ACM), New York, NY, USA ISBN 978-1-4503-7043-1 Source Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering Series ESEC/FSE 2020 Part of collection Institutional Repository Document type conference paper Rights © 2020 H.K.M. Huijgens, A. Rastogi, E.A. Mulders, G. Gousios, A. van Deursen Files PDF esec_fse_questions_with_a ... pendix.pdf 1.17 MB Close viewer /islandora/object/uuid:3585b229-84a0-4c26-b52f-e18c7a79782d/datastream/OBJ/view