Assessing the Performance of Ensemble and Regularized Models for Daily Rainfall Forecasting in Singapore
DOI:
https://doi.org/10.60084/ijds.v3i2.360Keywords:
Tropical rainfall, Hydrometeorology, Ensemble learning, Predictive modeling, Flood risk assessmentAbstract
This study benchmarks ensemble and regularized machine learning models for daily rainfall forecasting using meteorological data from forty-four observation stations across Singapore. The country’s highly variable tropical climate and frequent short-duration rainfall events pose major challenges for urban flood mitigation and operational forecasting. To address this, three algorithms—Lasso Regression, XGBoost Regression, and Gradient Boosting Regression—were developed and evaluated through a systematic comparison of predictive performance. Each model was trained using data from 1980–2023 and validated on independent observations from 2024–2025. The input variables included sub-hourly rainfall intensity, temperature, and wind-related parameters processed through a standardized data-cleaning and imputation pipeline. Results show that XGBoost achieved the most consistent and accurate predictions, with superior performance under both normal and heavy rainfall conditions. Statistical tests confirmed that the improvement was significant compared to Lasso and Gradient Boosting. These findings demonstrate the effectiveness of ensemble-based approaches for enhancing the reliability of data-driven rainfall forecasting in tropical urban environments and support their integration into early warning and hydrological risk management systems.
Downloads
References
- IPCC. (2021). Climate Change 2021: The Physical Science Basis, Cambridge University Press, Cambridge, United Kingdom.
- Trenberth, K. (2011). Changes in Precipitation with Climate Change, Climate Research, Vol. 47, No. 1, 123–138. doi:10.3354/cr00953.
- Roth, M., and Chow, W. T. L. (2012). A Historical Review and Assessment of Urban Heat Island Research in Singapore, Singapore Journal of Tropical Geography, Vol. 33, No. 3, 381–397. doi:10.1111/sjtg.12003.
- Chow, W. T. L., Cheong, B. D., and Ho, B. H. (2016). A Multimethod Approach towards Assessing Urban Flood Patterns and Its Associated Vulnerabilities in Singapore, Advances in Meteorology, Vol. 2016, 1–11. doi:10.1155/2016/7159132.
- Singapore, M. S. (2025). Climate Data and Monitoring.
- Guo, Y., Fu, Q., Leung, L. R., Na, Y., and Lu, R. (2023). Trends in Warm Season Mesoscale Convective Systems Over Asia in 2001–2020, Journal of Geophysical Research: Atmospheres, Vol. 128, No. 17. doi:10.1029/2023JD038969.
- Tangang, F., Chung, J. X., Juneng, L., Supari, Salimun, E., Ngai, S. T., Jamaluddin, A. F., Mohd, M. S. F., Cruz, F., Narisma, G., Santisirisomboon, J., Ngo-Duc, T., Van Tan, P., Singhruck, P., Gunawan, D., Aldrian, E., Sopaheluwakan, A., Grigory, N., Remedio, A. R. C., Sein, D. V., Hein-Griggs, D., McGregor, J. L., Yang, H., Sasaki, H., and Kumar, P. (2020). Projected Future Changes in Rainfall in Southeast Asia Based on CORDEX–SEA Multi-Model Simulations, Climate Dynamics, Vol. 55, Nos. 5–6, 1247–1267. doi:10.1007/s00382-020-05322-2.
- World Meteorological Organization. (2016). Guidelines on Climate Data RescueGeneva, Switzerland, World Meteorological Organization.
- El-Shafie, A., Noureldin, A., Taha, M., Hussain, A., and Mukhlisin, M. (2012). Dynamic versus Static Neural Network Model for Rainfall Forecasting at Klang River Basin, Malaysia, Hydrology and Earth System Sciences, Vol. 16, No. 4, 1151–1169. doi:10.5194/hess-16-1151-2012.
- Cong, T., Ha, T.-H.-G., Vu, G.-L., Bui, H.-N., Dao, N.-Q.-H., Le, M.-H., Dinh, C.-M., Cong, M.-P., and Doan, Q.-V. (2025). Advancing Subseasonal Extreme Rainfall Forecasting in Vietnam Using Machine Learning, SOLA, Vol. 21, 2025–034. doi:10.2151/sola.2025-034.
- Akbar, A. A., Darmawan, Y., Wibowo, A., and Rahmat, H. K. (2024). Accuracy Assessment of Monthly Rainfall Predictions Using Seasonal ARIMA and Long Short-Term Memory (LSTM), Journal of Computer Science and Engineering (JCSE), Vol. 5, No. 2, 100–115.
- Chen, T., and Guestrin, C. (2016). Xgboost: A Scalable Tree Boosting System, Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, 785–794.
- Friedman, J. H. (2001). Greedy Function Approximation: A Gradient Boosting Machine., The Annals of Statistics, Vol. 29, No. 5. doi:10.1214/aos/1013203451.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Musliadi Musliadi, Muhammad Zulkarnaini, Asalul Musaffa, Yolanda Yolanda

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.




















