Flowsheet Recognition using Deep Convolutional Neural Networks

Book Chapter (2022)
Author(s)

Lukas Balhorn (TU Delft - ChemE/Product and Process Engineering)

Qinghe Gao (TU Delft - ChemE/Product and Process Engineering)

Dominik Goldstein (RWTH Aachen University)

Artur Schweidtmann (TU Delft - ChemE/Product and Process Engineering)

Research Group
ChemE/Product and Process Engineering
Copyright
© 2022 L. Schulze Balhorn, Q. Gao, Dominik Goldstein, A.M. Schweidtmann
DOI related publication
https://doi.org/10.1016/B978-0-323-85159-6.50261-X
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 L. Schulze Balhorn, Q. Gao, Dominik Goldstein, A.M. Schweidtmann
Research Group
ChemE/Product and Process Engineering
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
Pages (from-to)
1567-1572
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Flowsheets are the most important building blocks to define and communicate the structure of chemical processes. Gaining access to large data sets of machine-readable chemical flowsheets could significantly enhance process synthesis through artificial intelligence. A large number of these flowsheets are publicly available in the scientific literature and patents but hidden among innumerable other figures. Therefore, an automatic program is needed to recognize flowsheets. In this paper, we present a deep convolutional neural network (CNN) that can identify flowsheets within images from literature. We use a transfer learning approach to initialize the CNN's parameter. The CNN reaches an accuracy of 97.9% on an independent test set. The presented algorithm can be combined with publication mining algorithms to enable an autonomous flowsheet mining. This will eventually result in big chemical process databases.

Files

1_s2.0_B978032385159650261X_ma... (pdf)
(pdf | 2.38 Mb)
- Embargo expired in 01-07-2023
License info not available