Interpretable machine learning for predicting and evaluating hydrogen production from supercritical water gasification of coal

  • Jianghua Tian
  • , Runqiu Dong
  • , Hanbing Jia
  • , Zhiyong Peng
  • , Zhigang Liu
  • , Le Wang
  • , Lei Yi
  • , Jialing Xu
  • , Hui Jin
  • , Bin Chen
  • , Liejin Guo

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Coal Supercritical Water Gasification (SCWG) process optimization by Machine Learning (ML) models is a promising strategy to conserve experimental resources. However, the lack of diversity in ML models and the neglect of their interpretability in existing works may limit the development of coal SCWG technology. This paper systematically collected 233 experimental results (1631 data points) to develop five ML models to analyze coal SCWG: Support Vector Regression (SVR), AdaBoost Regression (ABR), Decision Tree (DT), Random Forest (RF) Regression and Gradient Boosting Regression (GBR). The DT and GBR were found to have more robust predictive ability among the five models due to their superior performance in Mean Square Error (MSE), coefficient of determination (R2) and Mean Absolute Error (MAE). Temperature (TEMP) and Residence Time (RT) are the main controlling factors in determining gas production by analyzing the results based on SHapley Additive exPlanations (SHAP) values. There is a significant positive correlation between TEMP and RT and gas production. The SHAP values of the GBR model can well interpret the mechanism of the influence of coal SCWG parameters, especially the Concentration (CR) is negatively correlated with the gasification yields of H2, CO, and CO2, while it is positively correlated with the gas yield of CH4. Combining with the model predictive ability (MSE of 0.54, R2 of 0.97, MAE of 0.19) of the model and the interpretability of the mechanism, the GBR model may be a superior tool to assist the coal SCWG technology. The error analysis and catalyst were input into the GBR model as characteristic parameters to further enhance its robustness. Compared to the kinetic model, the GBR model improved the accuracy and generalization ability of the four-gas yield prediction by expanding the input parameters (TEMP, RT, CR, error, catalyst type and concentration). This work would be of great value in the prediction and optimization of the coal SCWG process.

Original languageEnglish
Article number136173
JournalFuel
Volume404
DOIs
StatePublished - 15 Jan 2026

Keywords

  • Coal
  • Machine learning
  • Optimization
  • Prediction and interpretation
  • Supercritical water gasification

Fingerprint

Dive into the research topics of 'Interpretable machine learning for predicting and evaluating hydrogen production from supercritical water gasification of coal'. Together they form a unique fingerprint.

Cite this