WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

Bioclimatic shifts and land use options for Viticulture in Portugal

Land use, plays a relevant role in the climatic system. It endows means for agriculture practices thus contributing to the food supply. Since climate and land are closely intertwined through multiple interface processes, climate change may lead to significant impacts in land use. In this study, 1-km observational gridded datasets are used to assess changes in the Köppen–Geiger and Worldwide Bioclimatic (WBCS)

The combined effects of climate, soils, and deficit irrigation on yield and quality of Touriga Nacional under high atmospheric demand in the Douro Region

Global warming is one of the biggest environmental, social and economic threats in several viticultural regions. In the Douro Valley, changes are expected in the coming years, namely an increase in temperature and a decrease in precipitation. These changes are likely to have consequences for the production and quality of wine.
The aim of this study was to explore the effects of different soil characteristics combined with several deficit irrigation strategies, managed throughout ETc references and predawn leaf water potentials thresholds, on physiology, yield, and qualitative attributes on the Touriga Nacional variety under years of mild to severe water and heat stress.
The studies were conducted over seven years (2015 to 2021) in two plots of a commercial vineyard located at Quinta do Ataíde (Symington Family Estates) planted in 2011 and 2014 at 170 meters elevation, growing under three water regimes: non-irrigated (NI) and two deficit irrigation strategies (30% and 60% ETc) assessed weekly by Ψpd. The site has an annual rainfall below 500 mm, with high atmospheric demand. Climate data was collected from a weather station, located on site. Berry ripening was followed weekly for fruit analysis. At harvest, yield, vigour and pruning weight per vine were determined from 90 vines by treatment. Each season at veraison the NDVI Index was accessed by a drone. The soils physic-chemistry in the experimental blocs were analysed and grouped by SWHC. Delta C-13 analyses were also performed per treatment in two years.Irrigation had a positive effect on yield per vine, mostly due to an increase in berry and cluster weight, and fertility index through the years. A significant increase in sugar content, colour and phenols was observed with deficit irrigation in some years, but vine vigour related to soil characteristics had by far the greatest impact on quality.

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Influence of weather and climatic conditions on the viticultural production in Croatia

The research includes an analysis of the impact of weather conditions on phenological development of the vine and grape quality, through monitoring of four experimental cultivars (Chardonnay, Graševina, Merlot and Plavac mali) over two production years. In each experimental vineyard, which were evenly distributed throughout the regions of Slavonia and The Croatian Danube, Croatian Uplands,