Skip to main content
Research article - Peer-reviewed, 2021

Multi-model evaluation of phenology prediction for wheat in Australia

Wallach, Daniel; Palosuo, Taru; Thorburn, Peter; Hochman, Zvi; Andrianasolo, Fety; Asseng, Senthold; Basso, Bruno; Buis, Samuel; Crout, Neil; Dumont, Benjamin; Ferrise, Roberto; Gaiser, Thomas; Gayler, Sebastian; Hiremath, Santosh; Hoek, Steven; Horan, Heidi; Hoogenboom, Gerrit; Huang, Mingxia; Jabloun, Mohamed; Jansson, Per-Erik; Jing, Qi; Justes, Eric; Kersebaum, Kurt Christian; Launay, Marie; Lewan, Elisabet; Luo, Qunying; Maestrini, Bernardo; Moriondo, Marco; Olesen, Jorgen Eivind; Padovan, Gloria; Poyda, Arne; Priesack, Eckart; Pullens, Johannes Wilhelmus Maria; Qian, Budong; Schuetze, Niels; Shelia, Vakhtang; Souissi, Amir; Specka, Xenia; Srivastava, Amit Kumar; Stella, Tommaso; Streck, Thilo; Trombi, Giacomo; Wallor, Evelyn; Wang, Jing; Weber, Tobias Kd; Weihermueller, Lutz; de Wit, Allard; Woehling, Thomas; Xiao, Liujun; Zhao, Chuang; Zhu, Yan; Seidel, Sabine J.
Show less authors


Predicting wheat phenology is important for cultivar selection, for effective crop management and provides a baseline for evaluating the effects of global change. Evaluating how well crop phenology can be predicted is therefore of major interest. Twenty-eight wheat modeling groups participated in this evaluation. Our target population was wheat fields in the major wheat growing regions of Australia under current climatic conditions and with current local management practices. The environments used for calibration and for evaluation were both sampled from this same target population. The calibration and evaluation environments had neither sites nor years in common, so this is a rigorous evaluation of the ability of modeling groups to predict phenology for new sites and weather conditions. Mean absolute error (MAE) for the evaluation environments, averaged over predictions of three phenological stages and over modeling groups, was 9 days, with a range from 6 to 20 days. Predictions using the multi-modeling group mean and median had prediction errors nearly as small as the best modeling group. About two thirds of the modeling groups performed better than a simple but relevant benchmark, which predicts phenology by assuming a constant temperature sum for each development stage. The added complexity of crop models beyond just the effect of temperature was thus justified in most cases. There was substantial variability between modeling groups using the same model structure, which implies that model improvement could be achieved not only by improving model structure, but also by improving parameter values, and in particular by improving calibration techniques.


Evaluation; Phenology; Wheat; Australia; Structure uncertainty; Parameter uncertainty

Published in

Agricultural and Forest Meteorology
2021, Volume: 298, article number: 108289
Publisher: ELSEVIER

    SLU Authors

    UKÄ Subject classification

    Agricultural Science
    Meteorology and Atmospheric Sciences

    Publication Identifiers


    Permanent link to this page (URI)