Obey validity limits of data-driven models through topological data analysis and one-class classification

None, None; None, None; None, None; None, None; None, None

Obey validity limits of data-driven models through topological data analysis and one-class classification

Journal Article (2021)

Author(s)

Artur M. Schweidtmann (TU Delft - Applied Sciences, RWTH Aachen University)

Jana M. Weber (University of Cambridge)

Christian Wende (RWTH Aachen University)

Linus Netze (RWTH Aachen University)

Alexander Mitsos (RWTH Aachen University, JARA-FIT, Forschungszentrum Jülich)

Research Group

ChemE/Product and Process Engineering

Machine-learning Deterministic global optimization One-class support vector machine Persistent homology Topological data analysis

DOI related publication

https://doi.org/10.1007/s11081-021-09608-0 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:ad8a12b2-4d35-473b-91a1-fbcb61269475

More Info

expand_more

Publication Year

2021

Language

English

Research Group

ChemE/Product and Process Engineering

Issue number

2

Volume number

23

Pages (from-to)

855-876

Downloads counter

275

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Data-driven models are becoming increasingly popular in engineering, on their own or in combination with mechanistic models. Commonly, the trained models are subsequently used in model-based optimization of design and/or operation of processes. Thus, it is critical to ensure that data-driven models are not evaluated outside their validity domain during process optimization. We propose a method to learn this validity domain and encode it as constraints in process optimization. We first perform a topological data analysis using persistent homology identifying potential holes or separated clusters in the training data. In case clusters or holes are identified, we train a one-class classifier, i.e., a one-class support vector machine, on the training data domain and encode it as constraints in the subsequent process optimization. Otherwise, we construct the convex hull of the data and encode it as constraints. We finally perform deterministic global process optimization with the data-driven models subject to their respective validity constraints. To ensure computational tractability, we develop a reduced-space formulation for trained one-class support vector machines and show that our formulation outperforms common full-space formulations by a factor of over 3000, making it a viable tool for engineering applications. The method is ready-to-use and available open-source as part of our MeLOn toolbox (https://git.rwth-aachen.de/avt.svt/public/MeLOn).

Files

Schweidtmann2021_Article_ObeyV... (pdf)

(pdf | 2.34 Mb)