A generic forecast verification framework for administrative purposes

Mason, Simon J.; Weigel, Andreas

There are numerous reasons for calculating forecast verification scores, and considerable attention has been given to designing and analyzing the properties of scores that can be used for scientific purposes. Much less attention has been given to scores that may be useful for administrative reasons, such as communicating changes in forecast quality to bureaucrats and providing indications of forecast quality to the general public. The two-alternative forced choice (2AFC) test is proposed as a scoring procedure that is sufficiently generic to be usable on forecasts ranging from simple yes–no forecasts of dichotomous outcomes to forecasts of continuous variables, and can be used with deterministic or probabilistic forecasts without seriously reducing the more complex information when available. Although, as with any single verification score, the proposed test has limitations, it does have broad intuitive appeal in that the expected score of an unskilled set of forecasts (random guessing or perpetually identical forecasts) is 50%, and is interpretable as an indication of how often the forecasts are correct, even when the forecasts are expressed probabilistically and/or the observations are not discrete.


  • thumnail for Mason_SJ_&_AP_Weigel_2009_MWR_137_331.pdf Mason_SJ_&_AP_Weigel_2009_MWR_137_331.pdf application/pdf 203 KB Download File

Also Published In

Monthly Weather Review

More About This Work

Academic Units
International Research Institute for Climate and Society
Published Here
March 13, 2020