Guoqiang Wang | TU Delft Repository

Zeeman: A Deep Learning Framework for Regional Atmospheric Chemistry Forecasting

Journal article (2026) - Mijie Pang, Jianbing Jin, Arjo Segers, Hai Xiang Lin, Guoqiang Wang, Hong Liao, Wei Han

Abstract Atmospheric chemistry encapsulates the emission of various pollutants, the complex chemistry reactions, and the meteorology dominant transport, which form a dynamic system that governs air quality. While deep learning (DL) models have shown promise in capturing intricate patterns for forecasting individual atmospheric components—such as PM2.5 and ozone—the critical interactions among multiple pollutants and the combined influence of emissions and meteorology are often overlook. This study introduces a DL-based framework–Zeeman for atmospheric chemistry forecasting. Our model effectively captures the nuanced relationships among these constituents and while achieving a 68.5-fold increase in computational speed compared to traditional numerical model. Evaluations demonstrate that our approach rivals numerical model, offering an efficient solution for atmospheric chemistry forecasting. In the future, this model could be further integrated with data assimilation techniques to facilitate efficient and accurate atmospheric emission estimation and concentration forecast. ...

Sequential hierarchical Bayesian model and particle filter estimation with two-step RJMCMC resampling

Journal article (2026) - Yue Huan, Guoqiang Wang, Hai Xiang Lin

Data assimilation (DA) combines numerical model simulations with observed data to obtain the best possible description of a dynamical system and its uncertainty. Incorrect modeling assumptions can lead to filter divergence, making model identification an important issue in the field of DA. Variations in dynamic model structures can result in differences in parameter dimensions, complicating the resampling step in PFs. To meet this challenge, the Sequential Hierarchical Bayesian Model (SHBM) is proposed in this paper, which integrates the evolution model along with observation model from the DA scheme, and the hierarchical parameter model. A two-step resampling method are also proposed to estimate the SHBM: the first step uses the resampling scheme in the bootstrap filter to resample new particles based on weights, which may produce some duplicate particles; the second step utilizes the Reversible Jump Markov Chain Monte Carlo (RJMCMC) methods to draw new particles from the target distribution. This approach ensures particle diversity, with the first step aiming at avoiding particle degeneracy, and the second step intends to prevent the sample impoverishment. The performance in the Advection Equation example and Lorenz 96 example demonstrates the effectiveness of the proposed method. ...

A bayesian non-parametric approach to dynamic conditional angular correlation model with application to portfolio optimization

Journal article (2026) - Zhangshuang Sun, Qian Li, Haixiang Lin, Guoqiang Wang

In financial time series analysis, the dynamic conditional correlation model is the most popular method for estimating the conditional covariance matrix, which represents financial risk and is critical for risk management, portfolio optimization, and asset pricing. Traditional covariance matrix estimation is often constrained by the rigid parameter settings and the assumption of the normal distribution, leading to the estimation biases when the markets are not normally distributed. To address these limitations, this paper proposes a Bayesian Non-parametric Dynamic Conditional Angular Correlation model based on the Fractionally Integrated GARCH model (BNDCAC-FIGARCH) that incorporates the asymmetric parameter and the student’s t-distribution to increase the adaptability and flexibility. Simulation experiments demonstrate that under overall correlation paths shaped as the sine or ramp functions, our model provides more accurate estimates, showcasing its effectiveness and stability. Empirical studies apply real stock market data, which includes DAX 40, FTSE 100, SSE 50, and CSI 100, to construct the portfolio optimization. The results demonstrate the superiority of the proposed model in terms of both portfolio returns and the reduction of parameter uncertainty. Furthermore, the results indicate that CSI 100 exhibits the weaker asymmetry compared to the other indices, likely due to its higher liquidity and a more accurate reflection of improved economic conditions resulting from national policies. ...

A Transformer-based agent model of GEOS-Chem v14.2.2 for informative prediction of PM2.5 and O3 levels to future emission scenarios: TGEOS v1.0

Journal article (2026) - Dehao Li, Jianbing Jin, Guoqiang Wang, Mijie Pang, Weihong Zhang, Hong Liao

Efficient and informative air quality modeling in future emission scenarios is vital for effective formulation of emission reduction policies. Traditional chemical transport models (CTMs) struggle with the computational demands required for timely predictions. While advanced emulator techniques greatly accelerate CTM simulating process, they fall short in providing comprehensive estimates of future air quality due to their limited model structure. Additionally, these emulators often have difficulty simultaneously accounting for varying emission variables and the effects of regional transport, which limits their applicability and undermines prediction accuracy. In this study, an informative future air quality prediction model “TGEOS v1.0” based on the Transformer framework is developed as an efficient agent model of GEOS-Chem v14.2.2. TGEOS is able to efficiently estimate key statistical indicators of PM2.5 and O3 concentrations under future emission scenarios and capture potential extreme pollution events, with approximately 2.51 s to execute one-year estimation. The model incorporates sectoral emissions of up to 26 distinct species as well as the impacts of regional emissions and meteorology on pollutant concentrations, enhancing its versatility and predictive accuracy. The spatial and probability distributions predicted by TGEOS are in good agreement with GEOS-Chem, with the correlation coefficients for PM2.5 and O3 exceed 0.98 in high-pollution months. Compared with other machine learning models, TGEOS based on Transformer framework showcases superior performance, underscoring the potential of the Transformer framework in air quality modeling. ...

An AUC-based multi-kernel weighted support vector machine ensemble algorithm for breast cancer diagnosis

Journal article (2025) - Mushuang Cheng, Lintong Liu, Haixiang Lin, Guoqiang Wang

Machine learning algorithms have demonstrated outstanding performance for disease diagnosis. Kernel function selection plays a crucial role in effectively transforming the nonlinear nature of input data. To enhance breast cancer diagnosis, we propose a novel ensemble algorithm, namely, AUC-Ada- (Formula presented.) MKL-WSVM, which integrates Weighted Support Vector Machines (WSVM), AdaBoost, and Multi-Kernel Learning (MKL). This ensemble algorithm introduces two main innovations. First, it simultaneously updates the weights of training samples and the combined kernel function during classification. Second, it incorporates an AUC-based approach to adjust training sample weights, effectively controlling the growth rate of misclassified sample weights in AdaBoost. Experimental results are provided to demonstrate the effectiveness of our method, which achieves an AUC score of 97.21% and an accuracy of 97.64% on the WDBC dataset, and an AUC of 97.53% and an accuracy of 97.46% on the WBC dataset. Comparative analysis further confirms that our ensemble algorithm outperforms four benchmark models in classification accuracy. ...

Machine learning based bias correction for numerical chemical transport models

Journal article (2021) - Min Xu, Jianbing Jin, Guoqiang Wang, Arjo Segers, Tuo Deng, Hai Xiang Lin

Air quality warning and forecasting systems are usually based on numerical chemical transport models (CTMs). Those dynamic models perform predictions by simulating the life cycles of the atmospheric components, including emission, transport and removal. However, the accuracy of these CTMs are still limited because of many imperfections, e.g., uncertainties in the input sources such as emission inventories, wind fields, boundary conditions, as well as insufficient knowledge about the atmospheric dynamics themselves. All these will mislead the CTM prediction constantly, or in a systematic way. In this paper, an approach based on machine learning is applied to predict model bias in the CTM. It is then combined with the CTM for formulating a hybrid forecast system. To our knowledge, it is the first time that machine learning methods are used in this way. The hybrid system is tested on the fine particular matter (PM_2.5) prediction in Shanghai, China. The results showed that machine learning can be an effective tool to improve the accuracy of CTM prediction. In case of short term PM_2.5 forecast (forecast length less than 12 h), statistical metrics of the root mean square error, mean absolute error, mean absolute percentage error as well as the air quality rank predicted accuracy all show the forecast skill is remarkably improved; while for long term prediction, improvement is not ensured. ...