Machine Learning-Based Predictions of Henry Coefficients for Long-Chain Alkanes in One-Dimensional Zeolites

Application to Hydroisomerization

Journal Article (2025)
Author(s)

S. Sharma (Eindhoven University of Technology, TU Delft - Engineering Thermodynamics)

Ping Yang (University of Massachusetts Amherst)

Yachan Liu (University of Massachusetts Amherst)

K.R. Rossi (TU Delft - Team Kevin Rossi)

Peng Bai (University of Massachusetts Amherst)

Marcello Rigutto (Shell Global Solutions International B.V.)

Erik Zuidema (Shell Global Solutions International B.V.)

Umang Agarwal (Shell Global Solutions International B.V.)

Richard Baur (Shell Global Solutions International B.V.)

Sofía Calero (Eindhoven University of Technology)

David Dubbeldam (Universiteit van Amsterdam)

T.J.H. Vlugt (TU Delft - Engineering Thermodynamics)

Research Group
Engineering Thermodynamics
DOI related publication
https://doi.org/10.1021/acs.jpcc.5c03868
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Engineering Thermodynamics
Issue number
40
Volume number
129
Pages (from-to)
18234-18249
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Shape-selective adsorption in zeolites plays a pivotal role in catalytic hydroisomerization of long-chain alkanes, a key process in producing sustainable aviation fuels from Fischer–Tropsch products. Accurately predicting adsorption behavior for the large number of alkane isomers in different zeolite frameworks is computationally intensive. To address this, we have developed a machine learning framework that rapidly and accurately predicts Henry coefficients of linear (C1–C30) and branched (C4–C20) alkanes in one-dimensional zeolites. Using descriptors based on chain length, branching patterns, and molecular graphs, we evaluate multiple ML models, including Random Forest, XGBoost, CatBoost, TabPFN, and D-MPNN in MTT-, MTW-, MRE-, and AFI-type zeolites. TabPFN and D-MPNN offer the highest predictive accuracy. Active learning further boosts model performance by efficiently selecting diverse and structurally informative isomers. We also uncover activity cliffs, where small changes in molecular structure lead to sharp variations in adsorption, and demonstrate that targeted oversampling of these cases improves model robustness. Finally, we combine the ML-predicted Henry coefficients with gas-phase thermodynamics to compute reaction equilibrium distributions for C16 hydroisomerization. This integrated, data-driven approach enables efficient screening and design of shape-selective zeolite catalysts, thereby reducing the need for costly simulations