terclim by ICS banner
IVES 9 IVES Conference Series 9 Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Abstract

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Luca Brillante1, Greg Jones2 and Diego Tomasi3

1Department of Viticulture & Enology, California State University, Fresno, USA
2Abacela Vineyard and Winery, Roseburg, OR, USA
3CREA-VE Research Centre for Viticulture and Enology, Conegliano, Italy

Contact the author

Keywords

phenology, climate change, time series, imputation methods, recurrent neural networks

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

The ability of wine yeasts fermenting by the addition of exogenous biotin

Research is focused on the increase of the field of obtaining the wine yeast, under physical and chemical conditions. Study of different influences on yeast production is very important for the promotion

Characterization of commercial enological tannins and its effect on human saliva diffusion

Commercial oenological tannins (TECs) are widely used in the wine industry. TECs are rich in condensed tannins, hydrolyzable tannins or a mixture of both. Wine grapes are a important source of proanthocyanidins or condensed tannins while oak wood possess a high concentration of hydrolyzable tannins (Obreque-Slier et al., 2009). TECs contribute with the antioxidant capacity of wine, catalyze oxide-reduction reactions and participate in the removal of sulfur compounds and metals.

Key learnings about the chemical bases of wine uniqueness and quality, essential companions for future developments

This presentation aims to demonstrate that the value attributed to wine as we today know it is based on three factors: 1) sensory balance, 2) personality, and 3) bioactivity.

Sensory definition of green aroma concept in red French wines. Evidence for the contribution of novel volatile markers

The aromatic complexity of a wine results from the perception of the association of volatile molecules and each aroma can be categorized into different families. The “green” aromas family in red wines has retained our attention by its close link with the fruity perception. In that study, the “green” olfactory concept of red wines was considered through a strategy combining both sensory analysis and hyphenated chromatographic techniques including HPLC and MDGC (Multidimensional Gas Chromatography). The aromatic space of this concept was specified by lexical generation through a free association task on 22 selected wines by a panel of wine experts. Then, 70 French red wines were scored on the basis of the intensity of their “green” and “fruity” attributes.

Stomatal restrictions to photosynthesis in grapevine cultivars grown in a semiarid environment

Diurnal changes in the leaves of field-grown grapevine (Vitis vinifera L.) cultivars Syrah and Tempranillo were followed over summer 2009 with respect to gas exchanges. Net photosynthetic rate (AN) of both cultivars rapidly increased in the morning, decreasing slowly until the late afternoon, when reached the lowest values.