terclim by ICS banner
IVES 9 IVES Conference Series 9 Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Abstract

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Luca Brillante1, Greg Jones2 and Diego Tomasi3

1Department of Viticulture & Enology, California State University, Fresno, USA
2Abacela Vineyard and Winery, Roseburg, OR, USA
3CREA-VE Research Centre for Viticulture and Enology, Conegliano, Italy

Contact the author

Keywords

phenology, climate change, time series, imputation methods, recurrent neural networks

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

Structural composition of polymeric polyphenols of red wine after long-term ageing: effect of vinification technology

Aged red wines possess phenolic composition very different from young ones due to the transformations among native grape phenolics and the formation of new polymeric polyphenols during aging process.

Exploring the impact of grape pressing on must and wine composition

Pressing has a relevant impact on the characteristics of the must and subsequently on white wines produced [1]. Therefore, the adequate management of pressing can lead to the desired extraction of phenols and other grape compounds (i.e. Organic acids), aromas and their precursors, allowing the production of balanced wines [2]. This aspect is especially important to sparkling wine where the acidity and pH, and the content of phenols affect its longevity and the expected sensory character.

Impact of smoke exposure on the chemical composition of grapes

Vineyard exposure to smoke can lead to grapes and wine which exhibit objectionable smoky and ashy aromas and flavours, more commonly known as ‘smoke taint’ [1, 2]. In the last decade, significant bushfires have occurred around the world, including near wine regions in Australia, Canada, South Africa and the USA, as a consequence of the warmer, drier conditions associated with climate change. Considerable research has subsequently been undertaken to determine the chemical, sensory and physiological consequences of grapevine exposure to smoke. The sensory attributes associated with smoke-tainted wine have been linked to the presence of several smoke-derived volatile phenols, such as guaiacols, syringols and cresols [2].

Impact of environmental conditions in vscs production during wine fermentation by Saccharomyces cerevisiae

The aroma of wine is one of the most important determinants of quality as it strongly influences the consumer’s acceptance or rejection. Among the thousands of molecules comprising the wine aroma, sulfur-containing compounds can be considered as a “double-edged sword”: some of them, deriving from varietal precursors provide fruity pleasant aromas, while other ones, produced by yeast metabolism are related to “unpleasant” aromas

Composition and molar mass distribution of different must and wine colloids

A major problem for winemakers is the formation of proteinaceous haze after bottling. Although the exact mechanisms remain unclear, this haze is formed by unfolding and agglomeration of grape proteins, being additionally influenced by numerous further factors.