Using Explainable Machine Learning Forecasts to Discover Subseasonal Drivers of High Summer Temperatures in Western and Central Europe

Chiem van Straaten, Kirien Whan, Dim Coumou, Bart van den Hurk, Maurice Schmeits

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Reliable subseasonal forecasts of high summer temperatures would be very valuable for society. Although state-of-the-art numerical weather prediction (NWP) models have become much better in representing the relevant sources of predictability like land and sea surface states, the subseasonal potential is not fully realized. Complexities arise because drivers depend on the state of other drivers and on interactions over multiple time scales. This study applies statistical modeling to ERA5 data, and explores how nine potential drivers, interacting on eight time scales, contribute to the subseasonal predictability of high summer temperatures in western and central Europe. Features and target temperatures are extracted with two variations of hierarchical clustering, and are fitted with a machine learning (ML) model based on random forests. Explainable AI methods show that the ML model agrees with physical understanding. Verification of the forecasts reveals that a large part of predictability comes from climate change, but that reliable and valuable subseasonal forecasts are possible in certain windows, like forecasting monthly warm anomalies with a lead time of 15 days. Contributions of each driver confirm that there is a transfer of predictability from the land and sea surface state to the atmosphere. The involved time scales depend on lead time and the forecast target. The explainable AI methods also reveal surprising driving features in sea surface temperature and 850 hPa temperature, and rank the contribution of snow cover above that of sea ice. Overall, this study demonstrates that complex statistical models, when made explainable, can complement research with NWP models, by diagnosing drivers that need further understanding and a correct numerical representation, for better future forecasts.
Original languageEnglish
Pages (from-to)1115–1134
Number of pages20
JournalMonthly Weather Review
Volume150
Issue number5
Early online date1 May 2022
DOIs
Publication statusPublished - 20 May 2022

Funding

Acknowledgments. This study is part of the open research programme Aard-en Levenswetenschappen, Project ALWOP.395, which is financed by the Dutch Research Council (NWO). We thank maintainers and funders of the BAZIS cluster at VU Amsterdam for computational resources. We thank Kees Kok, Kate Saunders (TU Delft), Jasper Velthoen (TU Delft), and Sem Vijverberg (VU Amsterdam) for useful discussions. We thank two anonymous reviewers for comments that improved the quality of this manuscript. We thank Michael Scheuerer for further helpful comments and his role as editor. Acknowledgments. This study is part of the open research programme Aard- en Levenswetenschappen, Project ALWOP.395, which is financed by the Dutch Research Council (NWO). We thank maintainers and funders of the BAZIS cluster at VU Amsterdam for computational resources. We thank Kees Kok, Kate Saunders (TU Delft), Jasper Velthoen (TU Delft), and Sem Vijverberg (VU Amsterdam) for useful discussions. We thank two anonymous reviewers for comments that improved the quality of this manuscript. We thank Michael Scheuerer for further helpful comments and his role as editor.

FundersFunder number
Nederlandse Organisatie voor Wetenschappelijk Onderzoek
open research programme Aard- en Levenswetenschappen

    Keywords

    • Statistical forecasting
    • Subseasonal variability
    • Machine learning
    • ; Model interpretation and visualization

    Fingerprint

    Dive into the research topics of 'Using Explainable Machine Learning Forecasts to Discover Subseasonal Drivers of High Summer Temperatures in Western and Central Europe'. Together they form a unique fingerprint.

    Cite this