Assessing the Optimality of Decentralized Inspection and Maintenance Policies for Stochastically Degrading Engineering Systems

None, None; None, None

Assessing the Optimality of Decentralized Inspection and Maintenance Policies for Stochastically Degrading Engineering Systems

Conference Paper (2025)

Author(s)

Prateek Bhustali (TU Delft - Architectural Technology)

C. Andriotis (TU Delft - Architectural Technology)

Research Group

Architectural Technology

DOI related publication

https://doi.org/10.1007/978-3-031-74650-5_13

Inspection and maintenance planning Decentralized partially observable Markov decision processes Multi-agent deep reinforcement learning Actor-critic methods Stochastic deterioration

To reference this document use:

https://resolver.tudelft.nl/uuid:57ed1e7d-533e-4a1b-b2b5-4314f750406a

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Architectural Technology

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en

Pages (from-to)

236-254

ISBN (print)

978-3-031-74649-9

ISBN (electronic)

978-3-031-74650-5

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Long-term inspection and maintenance (I&M) planning, a multi-stage stochastic optimization problem, can be efficiently formulated as a partially observable Markov decision process (POMDP). However, within this context, single-agent approaches do not scale well for large multi-component systems since the joint state, action and observation spaces grow exponentially with the number of components. To alleviate this curse of dimensionality, cooperative decentralized approaches, known as decentralized POMDPs, are often adopted and solved using multi-agent deep reinforcement learning (MADRL) algorithms. This paper examines the centralization vs. decentralization performance of MADRL formulations in I&M planning of multi-component systems. Towards this, we set up a comprehensive computational experimental program focused on k-out-of-n system configurations, a common and broadly applicable archetype of deteriorating engineering systems, to highlight the manifestations of MADRL strengths and pathologies when optimizing global returns under varying decentralization relaxations.

Files

978-3-031-74650-2.pdf

(pdf | 2.38 Mb)

- Embargo expired in 02-05-2025

License info not available