On the Difficulty of Identifying Incident-Inducing Changes

None, None; None, None; None, None; None, None

On the Difficulty of Identifying Incident-Inducing Changes

Conference Paper (2024)

Author(s)

Eileen Kapel (ING Analytics, TU Delft - Software Engineering)

Luis Cruz (TU Delft - Software Engineering)

D. Spinellis (TU Delft - Software Engineering)

Arie Van Van Deursen (TU Delft - Software Engineering)

Research Group

Software Engineering

DOI related publication

https://doi.org/10.1145/3639477.3639755

Change management Traceability Incident management

To reference this document use:

https://resolver.tudelft.nl/uuid:53be214a-4e74-4c13-9aca-e9381e35b0fe

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Software Engineering

Pages (from-to)

36-46

ISBN (electronic)

9798400705007

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Effective change management is crucial for businesses heavily reliant on software and services to minimise incidents induced by changes. Unfortunately, in practice it is often difficult to effectively use artificial intelligence for IT Operations (AIOps) to enhance service management, primarily due to inadequate data quality. Establishing reliable links between changes and the induced incidents is crucial for identifying patterns, improving change deployment, identifying high-risk changes, and enhancing incident response. In this research, we investigate the enhancement of traceability between changes and incidents through AIOps methods. Our approach involves a close examination of incident-inducing changes, the replication of methods linking incidents to the changes that caused them, introducing an adapted method, and demonstrating its results using historical data and practical evaluations. Our findings reveal that incident-inducing changes exhibit different characteristics dependent on context. Furthermore, a significant disparity exists between assessments based on historical data and real-world observation, with an increased occurrence of false positives when identifying links between unlabeled changes and incidents. This study highlights the complex nature of identifying links between changes and incidents, emphasising the contextual influence on AIOps method effectiveness. While we are actively working on improving the quality of current data through AIOps approaches, it remains apparent that further measures are necessary to address issues like data imbalances and promote a postmortem culture that brings attention to the value of properly administrating tickets. A better overview of change failure rates contributes to improved risk compliance and reliable change management.

Files

3639477.3639755.pdf

(pdf | 0.847 Mb)

- Embargo expired in 02-12-2024

License info not available