Inference and maintenance planning of monitored structures through Markov chain Monte Carlo and deep reinforcement learning

None, None; None, None; None, None

Inference and maintenance planning of monitored structures through Markov chain Monte Carlo and deep reinforcement learning

Conference Paper (2023)

Author(s)

Christos Lathourakis (TU Delft - Mechanics and Physics of Structures)

C.P. Andriotis (TU Delft - Architectural Technology)

Alice Cicirello (TU Delft - Mechanics and Physics of Structures)

Research Group

Mechanics and Physics of Structures

To reference this document use:

https://resolver.tudelft.nl/uuid:80637618-bcd1-4a86-a0bb-4a3e3bbbf765

More Info

expand_more

Publication Year

2023

Language

English

Research Group

Mechanics and Physics of Structures

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

A key computational challenge in maintenance planning for deteriorating structures is to concurrently secure (i) optimality of decisions over long planning horizons, and (ii) accuracy of realtime parameter updates in high-dimensional stochastic spaces. Both are often encumbered by the presence of discretized continuous-state models that describe the underlying deterioration processes, and the emergence of combinatorial decision spaces due to multi-component environments. Recent advances in Deep Reinforcement Learning (DRL) formulations for inspection and maintenance planning provide us with powerful frameworks to handle efficiently near-optimal decision-making in immense state and action spaces without the need for offline system knowledge. Moreover, Bayesian Model Updating (BMU), aided by advanced sampling methods, allows us to address dimensionality and accuracy issues related to discretized degradation processes. Building upon these concepts, we develop a joint framework in this work, coupling DRL, more specifically deep Q-learning and actor-critic algorithms, with BMU through Hamiltonian Monte Carlo. Single- and multi-component systems are examined, and it is shown that the proposed methodology yields reduced lifelong maintenance costs, and policies of high fidelity and sophistication compared to traditional optimized time- and condition-based maintenance strategies.

Files

Submission_222.pdf

(pdf | 2.57 Mb)