Hydrothermal carbonization (HTC) is a promising process for biomass valorization; however, optimizing HTC conditions and characterizing hydrochar through experimental methods remain costly and time-consuming. Artificial intelligence (AI) and its related machine learning (ML) tech
...
Hydrothermal carbonization (HTC) is a promising process for biomass valorization; however, optimizing HTC conditions and characterizing hydrochar through experimental methods remain costly and time-consuming. Artificial intelligence (AI) and its related machine learning (ML) techniques provide an efficient and cost-effective alternative, enabling efficient optimization and predictive analysis without extensive experimental tests. In this study, an analysis-ready database (ARD) comprising 544 data points was constructed using the authors' previous research (41) and scattered data compiled from the literature (503). An ensemble of eight diversified machine learning (ML) models was developed using biomass-agnostic properties and process conditions to predict hydrochar properties including elemental analysis, proximate analysis, and hydrochar yield. Tailor-made decision fusion models were developed for each target by merging the outputs of the best-performing models. Furthermore, a set of interpretable machine learning (IML) and explainable AI (XAI) techniques, leveraging feature importance analysis and SHapley Additive exPlanations (SHAP) values, indicated that biomass fixed carbon (FC) content and process temperature are the most influencing features, which were then considered as inputs for the ensemble models. The decision fusion models achieved high accuracy for target prediction, surpassing the currently used models in the literature, with adjusted R2 values ranging from 0.98 to 1.0. For outputs that are typically difficult to predict, such as hydrochar yield, the model achieved an adjusted R2 of 0.98, representing over a 5 % improvement compared to the best-performing model reported in the literature. All predictions were derived from white-box modeling by incorporating XAI into the ensemble-based learning, ensuring greater model transparency and interpretability.