TY - JOUR
T1 - The plumbing of land surface models: benchmarking model performance
AU - Best, M.J.
AU - Abramowitz, G.
AU - Johnson, H.R.
AU - Pitman, A.J.
AU - Balsamo, G.
AU - Boone, A.
AU - Cuntz, M.
AU - Decharme, B.
AU - Dirmeyer, P.A.
AU - Dong, J.
AU - Ek, M.
AU - Guo, Z.
AU - Haverd, V.
AU - van den Hurk, B.J.J.M.
AU - Nearing, G.S.
AU - Pak, B.
AU - Peters-Lidard, C.
AU - Santanello, J.A.
AU - Stevens, L.
AU - Vuichard, N.
PY - 2015
Y1 - 2015
N2 - The Protocol for the Analysis of Land Surface Models (PALS) Land Surface Model Benchmarking Evaluation Project (PLUMBER) was designed to be a land surface model (LSM) benchmarking intercomparison. Unlike the traditionalmethods of LSMevaluation or comparison, benchmarking uses a fundamentally different approach in that it sets expectations of performance in a range ofmetrics a priori-before model simulations are performed. This can lead to very different conclusions about LSM performance. For this study, both simple physically basedmodels and empirical relationships were used as the benchmarks. Simulations were performed with 13 LSMs using atmospheric forcing for 20 sites, and thenmodel performance relative to these benchmarks was examined. Results show that even for commonly used statistical metrics, the LSMs' performance varies considerably when compared to the different benchmarks. All models outperform the simple physically based benchmarks, but for sensible heat flux the LSMs are themselves outperformed by an out-of-sample linear regression against downward shortwave radiation.Whilemoisture information is clearly central to latent heat flux prediction, the LSMs are still outperformed by a three-variable nonlinear regression that uses instantaneous atmospheric humidity and temperature in addition to downward shortwave radiation. These results highlight the limitations of the prevailing paradigm of LSMevaluation that simply compares an LSMto observations and to other LSMs without a mechanism to objectively quantify the expectations of performance. The authors conclude that their results challenge the conceptual view of energy partitioning at the land surface.
AB - The Protocol for the Analysis of Land Surface Models (PALS) Land Surface Model Benchmarking Evaluation Project (PLUMBER) was designed to be a land surface model (LSM) benchmarking intercomparison. Unlike the traditionalmethods of LSMevaluation or comparison, benchmarking uses a fundamentally different approach in that it sets expectations of performance in a range ofmetrics a priori-before model simulations are performed. This can lead to very different conclusions about LSM performance. For this study, both simple physically basedmodels and empirical relationships were used as the benchmarks. Simulations were performed with 13 LSMs using atmospheric forcing for 20 sites, and thenmodel performance relative to these benchmarks was examined. Results show that even for commonly used statistical metrics, the LSMs' performance varies considerably when compared to the different benchmarks. All models outperform the simple physically based benchmarks, but for sensible heat flux the LSMs are themselves outperformed by an out-of-sample linear regression against downward shortwave radiation.Whilemoisture information is clearly central to latent heat flux prediction, the LSMs are still outperformed by a three-variable nonlinear regression that uses instantaneous atmospheric humidity and temperature in addition to downward shortwave radiation. These results highlight the limitations of the prevailing paradigm of LSMevaluation that simply compares an LSMto observations and to other LSMs without a mechanism to objectively quantify the expectations of performance. The authors conclude that their results challenge the conceptual view of energy partitioning at the land surface.
U2 - 10.1175/JHM-D-14-0158.1
DO - 10.1175/JHM-D-14-0158.1
M3 - Article
SN - 1525-7541
VL - 16
SP - 1425
EP - 1442
JO - Journal of Hydrometeorology
JF - Journal of Hydrometeorology
IS - 3
ER -