terclim by ICS banner
IVES 9 IVES Conference Series 9 Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Abstract

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Luca Brillante1, Greg Jones2 and Diego Tomasi3

1Department of Viticulture & Enology, California State University, Fresno, USA
2Abacela Vineyard and Winery, Roseburg, OR, USA
3CREA-VE Research Centre for Viticulture and Enology, Conegliano, Italy

Contact the author

Keywords

phenology, climate change, time series, imputation methods, recurrent neural networks

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

Anthropogenic intervention in shaping Terroir in a California Pinot noir vineyard

In many vineyards optimal parcel size exceeds the geospatial complexity that exists in soils and topographic features that influence hydrological properties, sunlight interception and soil depth and texture (available water capacity).

Effect of pre-fermentative cold soaking and use of different enzymes on the chemical and sensory properties of Catarratto wines

The wine industry widely recognizes that early-harvested grapes or those with uneven ripeness at harvest can produce wines with an “unripe fruit” mouthfeel [1,2]. Despite this, it is still unknown which compounds cause these sensory flaws or the most effective winemaking techniques to address them.

Estimation of plant hydraulics of grapevine in various «terroirs» in the Canton of Vaud (Switzerland)

The study of the physiological behaviour of the grapevine (cv. Chasselas), and of plant hydraulics in particular, was conducted on various « terroirs » in the Canton of Vaud (Switzerland) between 2001 and 2003 by Agroscope Changins-Wädenswil ACW, in collaboration with the firm I. Letessier (SIGALES) in Grenoble and the Federal Polytechnic School of Lausanne (EPFL). An evaluation of the vine plant hydraulics was made by means of physiological indicators (leaf and stem water potentials, transpiration and leaf stomatal conductance, carbon isotope discrimination and a model of transpirable soil water), in relation to estimations of the soil water reservoir and climatic factors.

Investigation of the biostimulant activity of naringenin on anthocyanins biosynthesis: from an explanatory transcriptomic approach on Gamay callus towards a future vineyard application

Context and purpose of the study. Anthocyanins are essential phenolic compounds in red wine, contributing significantly to colour intensity, stability, and sensory quality.

Coming of age: do old vines actually produce berries with higher enological potential than young vines? A case study on the Riesling cultivar

Consumers and the wine industry tend to agree on the ability of old vines to produce fruit that allows the production of wine of superior character. However, despite past and ongoing research, objective evidence of this point of view is still debated and studies on robust, specifically dedicated plots are scarce. Thus the impact of grapevine age on berry oenological potential and wine quality remains an open question. To try to objectively address the issue, a unique vineyard was established at Geisenheim University, Germany. It was planted in 1971 with cv. Riesling grafted on 5C Teleki. In 1995 and 2012, several rows were uprooted and replanted with the same rootstock/scion combination, resulting in a vineyard with alternate rows of identical plant material, but with different planting dates. The parameters of technical maturity and grape composition at harvest were analyzed during seasons 2014, 2015, 2016 and 2017 combining HPLC and enzymatic methods. Separate micro-vinifications were made for each age group and wine composition was analyzed by a combination of 1H-NMR and SPE-GC-MS.