WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

The combined effects of climate, soils, and deficit irrigation on yield and quality of Touriga Nacional under high atmospheric demand in the Douro Region

Global warming is one of the biggest environmental, social and economic threats in several viticultural regions. In the Douro Valley, changes are expected in the coming years, namely an increase in temperature and a decrease in precipitation. These changes are likely to have consequences for the production and quality of wine.
The aim of this study was to explore the effects of different soil characteristics combined with several deficit irrigation strategies, managed throughout ETc references and predawn leaf water potentials thresholds, on physiology, yield, and qualitative attributes on the Touriga Nacional variety under years of mild to severe water and heat stress.
The studies were conducted over seven years (2015 to 2021) in two plots of a commercial vineyard located at Quinta do Ataíde (Symington Family Estates) planted in 2011 and 2014 at 170 meters elevation, growing under three water regimes: non-irrigated (NI) and two deficit irrigation strategies (30% and 60% ETc) assessed weekly by Ψpd. The site has an annual rainfall below 500 mm, with high atmospheric demand. Climate data was collected from a weather station, located on site. Berry ripening was followed weekly for fruit analysis. At harvest, yield, vigour and pruning weight per vine were determined from 90 vines by treatment. Each season at veraison the NDVI Index was accessed by a drone. The soils physic-chemistry in the experimental blocs were analysed and grouped by SWHC. Delta C-13 analyses were also performed per treatment in two years.Irrigation had a positive effect on yield per vine, mostly due to an increase in berry and cluster weight, and fertility index through the years. A significant increase in sugar content, colour and phenols was observed with deficit irrigation in some years, but vine vigour related to soil characteristics had by far the greatest impact on quality.

Upscaling the integrated terroir zoning through digital soil mapping: a case study in the Designation of Origin Campo de Borja

homogeneous zones by intersecting several partial zonings of major factors that influence vineyard growth. Each of them follows specific process from their corresponding disciplines. Soil zoning specifically refers to a Soil Resource Inventory map that has traditionally been generated by conventional soil mapping methods. These methods have shortcomings in reaching fine cartographic and categorical details and involve significant expenses, which undermines their applicability. A new framework named Digital Soil Mapping has introduced quantitative models by statistical techniques to establish soil-landscape relationships and is able to provide intensive scale cartography.

In the present study, a microzoning at 1:10.000 scale is generated from an initial zoning, where the conventional soil map with polytaxic map units is replaced by a new one from digital techniques that disaggregates them. The comparison between the zonings considers a quantitative evaluation of capability for each Homogeneous Terroir Unit by means of the Viticultural Quality Index and its categorization based on its distribution by map. The spatial intersection of both maps gives rise to a confusion matrix in which the flows of class variations after the substitution are assessed.

The results show a five-fold increase in the number of Homogeneous Terroir Units identified and a larger differentiation among them, evidenced by a wider range in the capability index distribution. Both elements are accompanied by an increase in the detection of areas of higher potential within previously undervalued uniform zones.These features are a direct effect of the improvements brought by Digital Soil Mapping techniques and would verify the advantages of their implementation in the Integrated Terroir zoning. Eventually, such new highly detailed terroir units would benefit precision viticulture and sustainable management practices.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Local ancient grapevine cultivars to face future viticulture

Among the different strategies to cope with the negative impacts of climate change on viticulture, the exploitation of genetic diversity is one of the most promising to adapt to new conditions and maintain wine production and quality. One of the biggest concerns in the context of climate change is to improve water use efficiency (WUE). In this way, the use of genotypes that present a better response to drought and high WUE is a key issue. In this work, physiological performance analysis was conducted to compare the water deficit stress (WDS) responses of local and widespread grapevines cultivars. Leaf gas exchange, water use efficiency (WUE) at different levels (leaf and long-term WUE (∆13C)), leaf osmotic adjustment and other water relations parameters were determined in plants under well-watered and WDS conditions alongside assessment of the levels of foliar hormones concentrations. Results denote that local cultivars displayed better physiological performance under WDS as compared to the widely-distributed ones. he results corroborate the hypothesis that better stomatal control allows increasing leaf WUE under drought as occurred in the local Callet cv.; but the minority local cultivar Escursac cv. showed high WUE under both treatments. In this case, high WUE can be related to maintaining higher photosynthetic activity under drought. The different mechanisms underlying the better performance under WDS and high WUE of minority local cultivars are discussed.

Differential responses of red and white grape cultivars trained to a single trellis system – the VSP

Commercial grape production relies on training grapevine cultivars onto a variety of trellis systems. Training allows for well-lit leaves and clusters, maximizing fruit quality in addition to facilitating cultivation, harvesting, and diseases control. Although grapevines can be trained onto an infinite variety of trellis systems, most red and white cultivars are trained to the standard VSP (Vertical Shoot Positioning) system. However, red and white cultivars respond differently to VSP in fruit composition and growth characteristics, which are yet to be fully understood. Therefore, the objective of this study was to examine the influence of the VSP trellis system on fruit composition of three red, Cabernet Sauvignon, Merlot and Syrah, and three white, Chardonnay, Riesling, and Gewurztraminer cultivars grown under uniform growing conditions in the same vineyard. All cultivars were monitored for maturity and harvested at their physiologically maximum possible sugar concentration to compare various fruit quality attributes such as Brix, pH, TA, malic and tartaric acids, glucose and fructose, potassium, YAN, and phenolic compounds including total anthocyanins, anthocyanin profile, and tannins. A distinct pattern in fruit composition was observed in each cultivar. In regards to growth characteristics, Syrah grew vigorously with the highest cluster weight. Although all cultivars developed pyriform seeds, the seed size and weight varied among all cultivars. Also varied were mesocarp cell viability, brush morphology, and cane structure. This knowledge of the canopy architectural characteristics assessed by the widely employed fruit compositional attributes and growth characteristics will aid the growers in better management of the vines in varied situations.