Application of clustering algorithms for dimensionality reduction in infrastructure resilience prediction models

Journal Article (2024)
Author(s)

S. Balakrishnan (TU Delft - Technology, Policy and Management)

Beatrice Cassottana (Singapore-ETH Centre)

Arun Verma (National University of Singapore)

Research Group
Transport and Logistics
DOI related publication
https://doi.org/10.1080/15732479.2024.2366958 Final published version
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Transport and Logistics
Pages (from-to)
1-13
Downloads counter
137
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Recent studies increasingly adopt simulation-based machine learning (ML) models to analyse critical infrastructure system resilience. For realistic applications, these ML models consider the component-level characteristics that influence the network response during emergencies. However, such an approach could result in a large number of features and cause ML models to suffer from the ’curse of dimensionality’. A clustering-based method is presented that simultaneously minimises the problem of high-dimensionality and improves the prediction accuracy of ML models developed for resilience analysis in large-scale interdependent infrastructure networks. The methodology has three parts: (a) generation of simulation dataset, (b) network component clustering, and (c) dimensionality reduction and development of prediction models. First, an interdependent infrastructure simulation model simulates the network-wide consequences of various disruptive events. The component-level features are extracted from the simulated data. Next, clustering algorithms are used to derive the cluster-level features by grouping component-level features based on their topological and functional characteristics. Finally, ML algorithms are used to develop models that predict the network-wide impacts of disruptive events using the cluster-level features. The applicability of the method is demonstrated using an interdependent power-water-transport testbed. The proposed method can be used to develop decision-support tools for post-disaster recovery of infrastructure networks.