Data-Driven Weather Forecasting in Singapore Using Markov Chains and Naïve Bayes

Conference Paper (2026)
Author(s)

E. Gong (Westminster School )

D. Wang (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Statistics
DOI related publication
https://doi.org/10.1109/ISBDAS69350.2026.11485004 Final published version
More Info
expand_more
Publication Year
2026
Language
English
Research Group
Statistics
Pages (from-to)
273-277
Publisher
IEEE
ISBN (print)
979-8-3315-7219-8
ISBN (electronic)
979-8-3315-7218-1
Event
2026 9th International Symposium on Big Data and Applied Statistics (ISBDAS) (2026-03-06 - 2026-03-08), Guangzhou, China
Downloads counter
4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Singapore’s tropical climate poses challenges for weather forecasting. This paper proposes a hybrid model combining first-order Markov Chains with Gaussian Naïve Bayes. Markov Chains capture temporal dependencies through transition probabilities, serving as dynamic priors for Naïve Bayes classification incorporating temperature, humidity, dew point, visibility, wind direction, cloud cover, and a monsoon phase index. Using daily data from Singapore (2020-2025) with an 80/20 split, the hybrid model achieved 84.3% (two-state) and 81.2% (four-state) accuracy, outperforming the standalone Markov Chain $(79.8 \% / 74.6 \%)$. Comparison against Logistic Regression, Random Forest, and XGBoost shows competitive performance with lower computational cost. Per-class evaluation using precision, recall, and F1-score reveals limitations for rare events. The approach provides a lightweight, interpretable framework for tropical weather forecasting.

Files

Taverne
warning

File under embargo until 24-08-2026