terclim by ICS banner
IVES 9 IVES Conference Series 9 Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

Abstract

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Luca Brillante1, Greg Jones2 and Diego Tomasi3

1Department of Viticulture & Enology, California State University, Fresno, USA
2Abacela Vineyard and Winery, Roseburg, OR, USA
3CREA-VE Research Centre for Viticulture and Enology, Conegliano, Italy

Contact the author

Keywords

phenology, climate change, time series, imputation methods, recurrent neural networks

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

Scientific research for an «Ad Maiora 4.1C» application «A step back towards the future universally sustainable EME4.1C». A concrete example of forward-looking and revolutionary entrepreneurial choices in the vine and wine sector

In 1979 an enlightened and farsighted business owner in an area and in an activity unknown to him and in 120 hectares of land cultivated with corn and wheat expressed to one of us that he wanted to start a business in the wine sector. The first innovative “Vigna Dogarina Scientific Applicative Project” has become famous and harmoniously inserted in and with the “Territoir” of eastern Veneto in northeastern Italy. The revolutionary project allowed one of us: 1. to put into practice results of research related to the applied philosophy, vision, methodology of the “Great MetaEthic Chain 4.1C®” algorithm of the “Conegliano Campus 5.1C®” that considers all material, immaterial, spiritual, technical, economic, environmental, social, existential, relational, ethical, MetaEthical factors with basic indexing in a harmonious chain “ 4.1C®” and application “5.1C®”, 2. to implement:

EFFECTIVENESS OF APPLIED MATERIALS IN REDUCING THE ABSORPTION OF SMOKE MARKER COMPOUNDS IN A SIMULATED WILDFIRE SCENARIO

Smoke taint (ST) is a grape-wine off-flavour that may occur when grapes absorb volatile phenols (VPs) originating from wildfire smoke (1). ST is associated with the negative sensory attributes such as smoky and ashy notes. VPs are glycosylated in the plant and thus present in both free and bound forms (2; 3). Wildfire smoke has resulted in a decline in grape and wine quality and financial losses which has become a prominent issue for the global wine industry.

Exploring multisensory interactions through the study of astringency diversity of mono-varietal Italian red wines

According to the OIV Focus 2017 estimating the vine varieties distribution in the world, Italy is the richest grape producing country in terms of varieties.

Modification on grape phenolic and aromatic composition due to different leafroll virus infections

Viral diseases are reported to cause several detrimental effects on grapevine. Among them, leafroll, due to single or mixed infection of GLRaV1 and GLRaV3, and rugose wood, associated to GVA, are considered the most widespread and dangerous.

Perception of Rose Oxide Enantiomers, Linalool and α-Terpineol to Gewürztraminer Wine Aroma

Monoterpenes are important aroma compounds in white wines. Many monoterpenes are chiral and the chiral forms have different aroma qualities.