WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

How distinctive are single vineyard Gewürztraminer musts and wines from Alto Adige (Italy) based on untargeted analysis, sensory profiling, and chemometric elaboration?

Vitis vinifera L. ‘Gewürztraminer’ is a historical grape variety of Alto Adige (Südtirol), Italy, which is widely grown in the area of Tramin an der Weinstraße, but is also grown globally. It produces highly aromatic wines that are strongly influenced by the terroir of the vineyard sites where they are grown. This study looked at musts and young wines from ‘Gewürztraminer’ grapes harvested in seven distinct vineyards near Tramin and then processed at Cantina di Termeno, minimizing winemaking protocol variability. Samples were profiled using bidimensional gas chromatography–time-of-flight mass spectrometry, liquid chromatography coupled to electrochemical detection, and near-IR spectrometry. The data were subjected to Principle Component Analysis and Hierarchical Clustering Analysis. Sensory discriminant testing was undertaken using the sorting method with a semi-trained panel, and the data were processed using Multidimensional Scaling. Seven must/wine pairs could be distinguished based on their untargeted volatilome profiles and on sensory evaluation. As expected, there were greater differences in the volatile compounds between the wines than between the musts. The wines from vineyards 4 and 5 were nonetheless quite homogenous in terms of chemical and sensory analyses, as were the wines from vineyards 1 and 3. For the phenolic profile, differences were noted between the musts and wines of vineyards 2, 3, and 4, but the musts from vineyards 5 and 7 were similar. Sensory analysis showed the wines from vineyards 6 and 7 to be distinct from the rest. These results reinforce that the composition of ‘Gewürztraminer’ musts and wines is strongly determined by vineyard site, even in a small geographic area with high variability of the terroir (soil and microclimate), and that these differences are apparent in the flavours and aromas of the finished wines. Further confirmation would require a larger sample of wines, preferably from several vintages.

Is wine terroir a valid concept under a changing climate?

The OIV[i] defines terroir as a concept referring to an area in which collective knowledge of the interactions between the physical and biological environment (soil, topography, climate, landscape characteristics and biodiversity features) and vitivinicultural practices develops, providing distinctive wine characteristics. Those are perceptible in the taste of wine, which drives consumer preference and, therefore, wine’s value in the marketplace. Geographical indications (GI) are recognized regulatory constructs formalizing and protecting the nexus between wine taste and the terroir generating it. Despite considering updates, GIs do not consider the nexus as a dynamic one and do not anticipate change, namely of climate. Being climate a fundamental feature of terroir, it strongly impacts wine characteristics, such as taste. According to IPCC[ii], many widespread, rapid and unprecedented changes of climate occurred, some being irreversible over hundreds to thousands of years. Climatic shifts and atmospheric-driven extreme events have been widely reported worldwide. Recent climatic trends are projected to strengthen in upcoming decades, whereas extremes are expected to increase in frequency and intensity, forcing wines away from GI definitions. Geographical shifts of viticultural suitability are projected, often moving into regions and countries different from current ones. Some authors propose adaptation in viticulture, winemaking and product innovation. We show evidence of climate changing wine characteristics in the Douro valley, home of 270-year-old Port GI. We discuss herein resist or adapt stances for when climate changes the nexus between terroir and wine characteristics. Using the MED-GOLD[iii] dashboard, a tool allowing for easy visual navigation of past and future climates, we demonstrate how policymakers can identify future moments, throughout the 21st century under different emission scenarios, when GI specifications will likely need updates (e.g., boundaries, varieties) to reduce climate-change impacts.

Bioclimatic shifts and land use options for Viticulture in Portugal

Land use, plays a relevant role in the climatic system. It endows means for agriculture practices thus contributing to the food supply. Since climate and land are closely intertwined through multiple interface processes, climate change may lead to significant impacts in land use. In this study, 1-km observational gridded datasets are used to assess changes in the Köppen–Geiger and Worldwide Bioclimatic (WBCS)

Spatial determination of areas in the Western Balkans region favorable for organic production

In problematic conditions for production of grapes and wine caused by the COVID-19 pandemic and the resulting occurrence of wine surpluses, producers are increasingly turning to the innovative viticulture and winemaking of products that are more appealing to the market and the consumers. On the other hand, consumption of the food safety or organic products, and therefore of organic grapes and wine, is increasingly common in the world, in particular in Europe. The Regional Rural Development Standing Working Group (SWG RRD), as a regional intergovernmental organization gathers actors in the viticulture and winemaking sector from states and territories of the Western Balkans (South-East Europe) in the Expert Working Group for Wine, with the aim of improving viticulture and winemaking in this region through joint activities. In accordance with the aforementioned, the SWG RRD is working on advancing organic production of grapes and wine, and on recognition of specificities of the terroir of wine-growing areas in Western Balkans. In addition, as part of the project “Facilitation of Exchange and Advice on Wine Regulations in Western Balkan Countries” helmed by the German Federal Ministry of Food and Agriculture, in addition to harmonization of relevant legislation with EU regulations, efforts are being invested towards recognition of organic wines. Within activities and project implemented by this organization, expert analyses and scientific research of the terroir of Western Balkans were carried out, and some of the results are presented in this paper.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.