Flowsheet generation through hierarchical reinforcement learning and graph neural networks

Journal Article (2022)
Author(s)

L. Stops (TU Delft - ChemE/Product and Process Engineering)

Roel Leenhouts (Student TU Delft)

Q. Gao (TU Delft - ChemE/Product and Process Engineering)

A.M. Schweidtmanna (TU Delft - ChemE/Product and Process Engineering)

Research Group
ChemE/Product and Process Engineering
Copyright
© 2022 L. Stops, Roel Leenhouts, Q. Gao, A.M. Schweidtmann
DOI related publication
https://doi.org/10.1002/aic.17938
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 L. Stops, Roel Leenhouts, Q. Gao, A.M. Schweidtmann
Research Group
ChemE/Product and Process Engineering
Issue number
1
Volume number
69
Pages (from-to)
14
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Process synthesis experiences a disruptive transformation accelerated by artificial intelligence. We propose a reinforcement learning algorithm for chemical process design based on a state-of-the-art actor-critic logic. Our proposed algorithm represents chemical processes as graphs and uses graph convolutional neural networks to learn from process graphs. In particular, the graph neural networks are implemented within the agent architecture to process the states and make decisions. We implement a hierarchical and hybrid decision-making process to generate flowsheets, where unit operations are placed iteratively as discrete decisions and corresponding design variables are selected as continuous decisions. We demonstrate the potential of our method to design economically viable flowsheets in an illustrative case study comprising equilibrium reactions, azeotropic separation, and recycles. The results show quick learning in discrete, continuous, and hybrid action spaces. The method is predestined to include large action-state spaces and an interface to process simulators in future research.