terclim by ICS banner
IVES 9 IVES Conference Series 9 Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Abstract

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Luca Brillante1, Greg Jones2 and Diego Tomasi3

1Department of Viticulture & Enology, California State University, Fresno, USA
2Abacela Vineyard and Winery, Roseburg, OR, USA
3CREA-VE Research Centre for Viticulture and Enology, Conegliano, Italy

Contact the author

Keywords

phenology, climate change, time series, imputation methods, recurrent neural networks

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

Capture depletion of grapevine DNA: an approach to advance the study of microbial community in wine

The use of next-generation sequencing (NGS) has helped understand microbial genetics in oenology. Current studies mainly focus on barcoded amplicon NGS but not shotgun sequencing, which is useful for functional analyses. Since the high percentage of grapevine DNA conceals the microbial DNA in must, the majority of sequencing data is wasted in bioinformatic analyses. Here we present capture depletion of grapevine whole genome DNA.

Oligosaccharides from Vitis vinifera grape seeds: a focus on gentianose as a novel bioactive compound

AIM. Grape seeds (Vitis vinifera) are among the main constituents of grape pomace, also exploited in ingredients for nutraceutics and cosmeceutics, particularly regarding the phenolic fraction. The macromolecules of grape/wine include polyphenols, proteins and polysaccharides.

«Promitheus» the new greek red wine grape arromatic variety

This paper presents is the create, the study and amplographic description the newGreek aromatic variety of red wine grapes “Promitheus”, created in 2012

SIP and save the planet: a sensory and consumer exploration of australian wines made from potentially drought-tolerant white wine grapes

In order to attenuate the effects of climate change on the ability to cultivate quality wine grape vines in Australia, it is essential to adapt to the projected less favourable Australian climate scenarios. One response may be to convert a portion of the current grapevine plantings to those varieties that demand less water and can tolerate increased heat. This investigation aimed to (i) generate sensory profiles and (ii) obtain knowledge about Australian wine consumers’ preferences and opinions of Australian wines made from potentially drought tolerant, white wine grape varieties not traditionally cultivated in Australia. A Rate-All-That-Apply (RATA) sensory panel (n = 49) generated sensory profiles of 44 commercial white wines made from 7 different white grape varieties (Arinto, Fiano, Garganega, Greco, Verdejo, Verdelho and Vermentino), plus two benchmark examples each of an Australian Riesling, Pinot Gris and Chardonnay wine.

Applications of a novel molecular phenology scale to align the stages of grape berry development

Phenology scales widely adopted by viticulturists (i.e., BBCH or modified E-L systems) are classification tools that describe seasonal and precisely recognized stages of fruit growth and development based on specific descriptors such as visual/physical traits or easy-to-measure compositional parameters.