Application of clustering algorithms for dimensionality reduction in infrastructure resilience prediction models

Journal Article (2024)
Author(s)

Srijith Balakrishnan (TU Delft - Transport and Logistics)

Beatrice Cassottana (Singapore-ETH Centre)

Arun Verma (National University of Singapore)

Research Group
Transport and Logistics
DOI related publication
https://doi.org/10.1080/15732479.2024.2366958
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Transport and Logistics
Pages (from-to)
1-13
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Recent studies increasingly adopt simulation-based machine learning (ML) models to analyse critical infrastructure system resilience. For realistic applications, these ML models consider the component-level characteristics that influence the network response during emergencies. However, such an approach could result in a large number of features and cause ML models to suffer from the ’curse of dimensionality’. A clustering-based method is presented that simultaneously minimises the problem of high-dimensionality and improves the prediction accuracy of ML models developed for resilience analysis in large-scale interdependent infrastructure networks. The methodology has three parts: (a) generation of simulation dataset, (b) network component clustering, and (c) dimensionality reduction and development of prediction models. First, an interdependent infrastructure simulation model simulates the network-wide consequences of various disruptive events. The component-level features are extracted from the simulated data. Next, clustering algorithms are used to derive the cluster-level features by grouping component-level features based on their topological and functional characteristics. Finally, ML algorithms are used to develop models that predict the network-wide impacts of disruptive events using the cluster-level features. The applicability of the method is demonstrated using an interdependent power-water-transport testbed. The proposed method can be used to develop decision-support tools for post-disaster recovery of infrastructure networks.