Digitization of chemical process flow diagrams using deep convolutional neural networks

Journal Article (2023)
Author(s)

Maximilian F. Theisen (TU Delft - ChemE/Product and Process Engineering)

K.F. Nishizaki Flores (TU Delft - ChemE/Product and Process Engineering)

L. Schulze Balhorn (TU Delft - ChemE/Product and Process Engineering)

Artur Schweidtmanna (TU Delft - ChemE/Product and Process Engineering)

Research Group
ChemE/Product and Process Engineering
Copyright
© 2023 M.F. Theisen, K.F. Nishizaki Flores, L. Schulze Balhorn, A.M. Schweidtmann
DOI related publication
https://doi.org/10.1016/j.dche.2022.100072
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 M.F. Theisen, K.F. Nishizaki Flores, L. Schulze Balhorn, A.M. Schweidtmann
Research Group
ChemE/Product and Process Engineering
Volume number
6
Pages (from-to)
11
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Advances in deep convolutional neural networks led to breakthroughs in many computer vision applications. In chemical engineering, a number of tools have been developed for the digitization of Process and Instrumentation Diagrams. However, there is no framework for the digitization of process flow diagrams (PFDs). PFDs are difficult to digitize because of the large variability in the data, e.g., there are multiple ways to depict unit operations in PFDs. We propose a two-step framework for digitizing PFDs: (i) unit operations are detected using a deep learning powered object detection model, (ii) the connectivities between unit operations are detected using a pixel-based search algorithm. To ensure robustness, we collect and label over 1000 PFDs from diversified sources including various scientific journals and books. To cope with the high intra-class variability in the data, we define 47 distinct classes that account for different drawing styles of unit operations. Our algorithm delivers accurate and robust results on an independent test set. We report promising results for line and unit operation detection with an Average Precision at 50 percent (AP50) of 88% and an Average Precision (AP) of 68% for the detection of unit operations.