WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Impact of climate change on the viticultural climate of the Protected Designation of Origin “Jumilla” (SE Spain)

Protected Designation of Origin “Jumilla” (PDO Jumilla) is located in the Spanish provinces of Albacete and Murcia, in the South-eastern part of the Iberian Peninsula, where most of the models predict a severe impact of climate change in next decades. PDO Jumilla covers an area of 247,054 hectares, of which more than 22,000 hectares

Grapevine xylem embolism resistance spectrum reveals which varieties have a lower mortality risk in a future dry climate

Wine growing regions have recently faced intense and frequent droughts that have led to substantial economical losses, and the maintenance of grapevine productivity under warmer and drier climate will rely notably on planting drought-resistant cultivars. Given that plant growth and yield depend on water transport efficiency and maintenance of photosynthesis, thus on the preservation of the vascular system integrity during drought, a better understanding of drought-related hydraulic traits that have a significant impact on physiological processes is urgently needed. We have worked towards this end by assessing vulnerability to xylem embolism in 30 grapevine commercial varieties encompassing red and white Vitis vinifera varieties, hybrid varieties characterized by a polygenic resistance for powdery and downy mildew, and commonly used rootstocks. These analyses further allowed a global assessment of wine regions with respect to their varietal diversity and resulting vulnerability to stem embolism. Hybrid cultivars displayed the highest vulnerability to embolism, while rootstocks showed the greatest resistance. Significant variability also arose among Vitis vinifera varieties, with Ψ12 and Ψ50 values ranging from -0.4 to -2.7 MPa and from -1.8 to -3.4 MPa, respectively. Cabernet franc, Chardonnay and Ugni blanc featured among the most vulnerable varieties while Pinot noir, Merlot and Cabernet Sauvignon ranked among the most resistant. In consequence, wine regions bearing a significant proportion of vulnerable varieties, such as Poitou-Charentes, France and Marlborough, New Zealand, turned out to be at greater risk under drought. These results highlight that grapevine varieties may not respond equally to warmer and drier conditions, outlining the importance to consider hydraulic traits associated with plant drought tolerance into breeding programmes and modeling simulations of grapevine yield maintenance under severe drought. They finally represent a step forward to advise the wine industry about which varieties and regions would have the lowest risk of drought-induced mortality under climate change.

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.

Ecophysiological performance of Vitis rootstocks under water stress

The use of rootstocks tolerant to soil water deficit is an interesting strategy to cope with limited water availability. Currently, several nurseries are breeding new genotypes, but the physiological basis of its responses under water stress are largely unknown. To this end, an ecophysiological assessment of the conventional 110-Richter (110R) and SO4, and the new M1 and M4 rootstocks was carried out in potted ungrafted plants. During one season, these Vitis genotypes were grown under greenhouse conditions and subjected to two water regimes, well-watered and water deficit. Water potentials of plants under water deficit down to < -1.4 MPa, and net photosynthesis (AN) <5 μmol m-2 s-1 did not cause leaf oxidative stress damage compared to well-watered conditions in any of the genotypes. The antioxidant capacity was sufficient to neutralize the mild oxidative stress suffered. Under both treatments, gravimetric differences in daily water use were observed among genotypes, leading to differences in the biomass of root, shoot and leaf. Under well-watered conditions, SO4 and 110R were the most vigorous and M1 and M4 the least. However, under water stress, SO4 exhibited the greatest reduction in biomass while M4 showed the lowest. Remarkably, under these conditions, SO4 reached the least negative stem water potential (Ψstem), while M1 reduced stomatal conductance (gs) and AN the most. In addition, SO4 and M1 genotypes also showed the highest and lowest hydraulic conductance values, respectively. Our results suggest that there are differences in water use regulation among genotypes, not only attributed to differences in stomatal regulation or intrinsic water use efficiency at the leaf level. Therefore, because no differences in canopy-to-root ratio were achieved, it is hypothesized that xylem vessel anatomical differences may be driving the reported differences among rootstocks performance. Results demonstrate that each Vitis rootstock differs in its ecophysiological responses under water stress.