WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

Rapid damage assessment and grapevine recovery after fire

There is increasing scientific consensus that climate changeis the underlying cause of the prolonged dry and hot conditions that have increased the risk of extreme fire weather in many countries around the world. In December 2019, a bushfire event occurred in the Adelaide Hills, South Australia where 25,000 hectares were burnt and in vineyards and surrounding areas various degrees of scorching and infrastructure damage occurred. The ability to coordinate and plan recovery after a fire event relies on robust and timely data. The current practice for measuring the scale and distribution of fire damage is to walk or drive the vineyard and score individual vines based on visual observation. The process is time consuming, subjective, or semi-quantitative at best. After the December 2019 fires, it took many months to access properties and estimate the area of vineyard damaged. This study compares the rapid assessment and mapping of fire damage using high-resolution satellite imagery with more traditional ground based measures. Satellite imagery tracking vineyard recovery in the season following the bushfire is being correlated to field assessments of vineyard productivity such as canopy health and development, fertility and carbohydrate storage. Canopy health in the seasons following the fires correlated to the severity of the initial fire damage. Severely damaged vines had reduced canopy growth, were infertile or had very low fertility as well as lower carbohydrate levels in buds and canes during dormancy, which reduced productivity in the seasons following the bushfire event. In contrast, vines that received minor damage were able to recover within 1-2 years. Tools that rapidly and affordably capture the extent and severity of damage over large vineyard area will allow producers, government and industry bodies to manage decisions in relation to fire recovery planning, coordination and delivery, improving the efficiency and effectiveness of their response.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Elucidating vineyard site contributions to key sensory molecules: Identification of correlations between elemental composition and volatile aroma profile of site-specific Pinot noir wines

The reproducibility of elemental profile in wines produced across multiple vintages has been previously reported using grapes from a single scion clone of Vitis vinifera L. cv. Pinot noir. The grapevines were grown on fourteen different vineyard sites, from Oregon to southern California in the U.S.A., which span distances from approximately hundreds of meters to 1450 km, while elevations range from near sea level to nearly 500 m. In addition, sensorial (i.e. aroma, taste, and mouthfeel) and chemical (i.e. polyphenolic and volatile) differences across the different vineyard sites have also been observed among these wines at two aging time points. While strong evidence exists to support that grapes grown in different regions can produce wines with unique chemical and sensorial profiles, even when a single clone is used, the understanding of growing site characteristics that result in this reproducible differentiation continues to emerge. One hypothesis is that the elemental profile that a vineyard site imparts to the grape berries and the resulting wine is an important contributor to this differentiation in chemistry and sensory of wines. For example, various classes of enzymes that catalyze the formation of key aroma compounds or their precursors require specific metals. In this work, we begin to report correlations between elemental and volatile aroma profiles of site-specific Pinot noir wines, made under standardized winemaking conditions, that have been previously shown to be distinguished separately by these chemical analyses.

Aromatic maturity is a cornerstone of terroir expression in red wine

Harvesting grapes at adequate maturity is key to the production of high-quality red wines. Enologists and wine makers define several types of maturity, including technical maturity, phenolic maturity and aromatic maturity. Technical maturity and phenolic maturity are relatively well documented in the scientific literature, while articles on aromatic maturity are scarcer. This is surprising, because aromatic maturity is, without a doubt, the most important of the three in determining wine quality and typicity (including terroir expression). Optimal terroir expression can be obtained when the different types of maturity are reached at the same time, or within a short time frame. This is more likely to occur when the ripening takes place under mild temperatures, neither too cool, nor too hot. Aromatic expression in wine can be driven, from low to high maturity, by green, herbal, fresh fruit, ripe fruit, jammy fruit, candied fruit or cooked fruit aromas. Green and cooked fruit aromas are not desirable in red wines, while the levels of other aromatic compounds contribute to the typicity of the wine in relation to its origin. Wines produced in cool climates, or on cool soils in temperate climates, are likely to express herbal or fresh fruit aromas; while wines produced under warm climates, or on warm soils in temperate climates, may express ripe fruit, jammy fruit or candied fruit aromas. Growers can optimize terroir expression through their choice of grapevine variety. Early ripening varieties perform better in cool climates and late ripening varieties in warm climates. Additionally, maturity can be advanced or delayed by different canopy management practices or training systems.

What are the optimal ranges and thresholds for berry solar radiation for flavonoid biosynthesis?

In wine grape production, canopy management practices are applied to control the source-sink balance and improve the cluster microclimate to enhance berry composition. The aim of this study was to identify the optimal ranges of berry solar radiation exposure (exposure) for upregulation of flavonoid biosynthesis and thresholds for their degradation, to evaluate how canopy management practices such as leaf removal, shoot thinning, and a combination of both affect the grapevine (Vitis vinifera L. cv. Cabernet Sauvignon) yield components, berry composition, and flavonoid profile under context of climate change. First experiment assessed changes in the grape flavonoid content driven by four degrees of exposure. In the second experiment, individual grape berries subjected to different exposures were collected from two cultivars (Cabernet Sauvignon and Petit Verdot). The third experiment consisted of an experiment with three canopy management treatments (i) LR (removal of 5 to 6 basal leaves), (ii) ST (thinned to 24 shoots per vine), and (iii) LRST (a combination of LR and ST) and an untreated control (UNT). Berry composition, flavonoid content and profiles, and 3-isobutyl 2-methoxypyrazine were monitored during berry ripening. Although increasing canopy porosity through canopy management practices can be helpful for other purposes, this may not be the case of flavonoid compounds when a certain proportion of kaempferol was achieved. Our results revealed different sensitivities to degradation within the flavonoid groups, flavonols being the only monitored group that was upregulated by solar radiation. Within different canopy management practices, the main effects were due to the ST. Under environmental conditions given in this trial, ST and LRST hastened fruit maturity; however, a clear improvement of the flavonoid compounds (i.e., greater anthocyanin) was not observed at harvest. Methoxypyrazine berry content decreased with canopy management practices studied. Although some berry traits were improved (i.e. 2.5° Brix increase in berry total soluble solids) due to canopy management practices (ST), this resulted in a four-fold increase in labor operations cost, two-fold decrease in yield with a 10-fold increase in anthocyanin production cost per hectare that should be assessed together as the climate continues to get hot.