terclim by ICS banner
IVES 9 IVES Conference Series 9 Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Abstract

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Luca Brillante1, Greg Jones2 and Diego Tomasi3

1Department of Viticulture & Enology, California State University, Fresno, USA
2Abacela Vineyard and Winery, Roseburg, OR, USA
3CREA-VE Research Centre for Viticulture and Enology, Conegliano, Italy

Contact the author

Keywords

phenology, climate change, time series, imputation methods, recurrent neural networks

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

Evoluzione stagionale delle temperature ed andamento della maturazione nel vitigno Aglianico: risultati di un quadriennio di osservazioni in Campania

In viticoltura, la comprensione dell’influenza della temperatura dell’aria sulla dinamica della maturazione assume importante rilievo in relazione all’ ottimizzazione dell’ epoca di raccolta da cui dipende in modo significativo la qualità del prodotto finale.

Impact of climate on berry weight dynamics of a wide range of Vitis vinifera cultivars 

In order to study the impact of climate change on Bordeaux grape varieties and to assess the behavior of candidate grape varieties potentially better adapted to the new climatic conditions, an experimental vineyard composed of 52 grape varieties was planted in 2009 at the INRAE Bordeaux Aquitaine center[1]. Among the many parameters studied since 2012, berry weight for each variety was measured weekly from mid-veraison to maturity, with four independent replicates. The kinetics obtained allowed to study berry growth, a key parameter in grape composition and yield.

Distinctive flavour or taint? The case of smoky characters in wine

Forest fires in the vicinity of vineyards have significantly increased in the last decade and are a concern for grapegrowers and winemakers in many wine producing countries. The fires cause smoke drift throughout vineyards which cannot be avoided and may result in the production of wines described as ‘smoke tainted’. Such wines are characterized by undesirable sensory characters described as ‘smoky’, ‘burnt’, ‘ash’ aromas and flavours, and also may cause a lingering, unpleasant ashy aftertaste [1; 2].

The Pampa and the vineyard: gaucho´s natural and symbolic aspects in the identity´s constitution of “Vinhos da Campanha”’s terroir – RS/Brasil

The wine region of “Vinhos da Campanha” is located in southern Brazil, on the Uruguay borderline. The colonization’s process in the region was characterized by territorial disputes between Portuguese

Historical zoning in the world

The study of the interaction between vineyards and the environment to establish the grapevines in the appropriate places has been applied in wine science for 5000 years. Advances in the field of the zoning have not been uniform in time, and have occupied a preferential place in the contributions of Roman writers of the 1st Century AC, the contemplations of Tokay (1700) and Porto (1756) and works of the second half of the 20th century. Zoning practices today integrate multidisciplinary methodologies (viticulture, enology, soils, climatology, cartography, statistics, computer science) and require further development for future application.