WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

Climate change projections to support the transition to climate-smart viticulture

The Earth’s system is undergoing major changes through a wide range of spatial and temporal scales as a response to growing anthropogenic radiative forcing, which is pushing the whole system far beyond its natural variability. Sources of greenhouse gases largely exceed their sinks, thus leading to a strengthened greenhouse effect. More energy is thereby being supplied to the system, with inevitable shifts in climatic patterns and weather regimes. Over the last decades, these modifications have been manifested in the full statistical distributions of the atmospheric variables, with dramatic changes in the frequency and intensity of extremes. Natural hazards, such as severe droughts, floods, forest fires, or heatwaves, are being triggered by extreme atmospheric events worldwide, thus threatening human activities. Viticultculture is not only exposed to changing climates but is also highly vulnerable, as grapevine phenology and physiological development are strongly controlled by atmospheric conditions. Therefore, the assessment of climate change projections for a given region is critical for climate change adaptation and risk reduction in viticulture. By adopting timely and suitable measures, the future sustainability and resiliency of the sector can be fostered. Climate-grapevine chain modelling is an essential tool for better planning and management. However, the accuracy of the resulting projections is limited by many uncertainties that must be duly taken into account when transferring knowledge to stakeholders and decision-makers. Climate-smart viticulture will comprise ensembles of locally tuned strategies, envisioning both adaptation and mitigation, assisted by emerging technologies and decision-support systems.

Evaluation of climate change impacts at the Portuguese Dão terroir over the last decades: observed effects on bioclimatic indices and grapevine phenology

In the last decades the growers of the Portuguese Dão winegrowing region (center of Portugal) are experiencing changes in climate that are influencing either grape phenology berry health and ripening. Aiming to study the relationships between climate indices (CI), seasonal weather and grapevine phenology, in this work long-term climate and phenological data collected at the experimental vineyard of the Portuguese Dão research centre between 1958 and 2019 (61 years) for the red variety Touriga Nacional, was analyzed. The trends over time for the classical temperature-based indices (Growing Season Temperature – GST -, Growing Degree Days – GDD, Huglin Index – HI and Cool Night Index – CI) presented a significantly positive slope while the Dryness Index (DI) showed a negative trend over the last 61 years. Regarding grapevine phenology, an average advance of 4.5 days per decade in the harvest day was observed throughout the last 61 years. Consequently, the weather conditions during the ripening period have changed, showing an increasing trend over time in the average temperature (higher magnitude in the maximum than in the minimum temperature) and a decrease in the accumulated rainfall. A regression analysis showed that ~50% of harvest date variability over years was explained by the temperature-based indices variability. These observed effects of climate change on bioclimatic indices and corresponding anticipation of harvest date can still be considered advantageous for the Dão terroir as it allows to achieve an optimal berry ripening before the common equinox rains and, therefore, avoid the potential negative impacts of the rainfall on berry health and composition.

Adaptation to soil and climate through the choice of plant material

Choosing the rootstock, the scion variety and the training system best suited to the local soil and climate are the key elements for an economically sustainable production of wine. The choice of the rootstock/scion variety best adapted to the characteristics of the soil is essential but, by changing climatic conditions, ongoing climate change disrupts the fine-tuned local equilibrium. Higher temperatures induce shifts in developmental stages, with on the one hand increasing fears of spring frost damages and, on the other hand, ripening during the warmest periods in summer. Expected higher water demand and longer and more frequent drought events are also major concerns. The genetic control of the phenotypes, by genomic information but also by the epigenetic control of gene expression, offers a lot of opportunities for adapting the plant material to the future. For complex traits, genomic selection is also a promising method for predicting phenotypes. However, ecophysiological modelling is necessary to better anticipate the phenotypes in unexplored climatic conditions Genetic approaches applied on parameters of ecophysiological models rather than raw observed data are more than ever the basis for finding, or building, the ideal varieties of the future.

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.