WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

Organic recycled mulches in sustainable viticulture: assessment of spontaneous plants communities and weed coverage

In recent years, developing more efficient and sustainable viticulture management has been essential due to the impact of climate change in semiarid regions. For this reason, the use of recycled organic mulching (ROM) in the vineyard has become an interesting strategy to cope with water stress, isolated soil from extreme temperatures and improving soil humidity, control the presence of weeds and therefore reduce the inputs of herbicides and improve soil fertility. This work aimed to analyse the effect of three different organic mulches [straw (S), grape pruning debris (GPD) and spent mushroom compost (SMC)] and two traditional soil management techniques [herbicide (H) and interrow (IN)] on weed coverage and the spontaneous plant communities’ presence. Data sampling was collected throughout the vine vegetative cycle of 2021 in La Rioja, Spain. The different soil management techniques had a clear effect on weed coverage and his development during the vine vegetative cycle. SMC and H were the treatments with the highest and the lowest coverage percentage, respectively. IN had a delayed weed emergence at the beginning of the vine vegetative cycle, but finally it reached maximum values nearby SMC. GPD and S had similar effects on weed emergence, reaching 25-30% of the maximum coverage values. A total of 29 herbaceous species were identified during the vegetative cycle, some of them very isolated and occasional. Principal component analysis (PCAs) showed a good association between spontaneous species and treatments, furthermore, specific species-treatment associations were found. Moreover, three clear groups of herbaceous communities were identified by cluster analysis. This study provides interesting information about the effect of different alternative soil management on herbaceous plant coverage and weed species communities which could contribute to making more sustainable viticulture.

Delaying irrigation initiation linearly reduces yield with little impact on maturity in Pinot noir

When to initiate irrigation is a critical annual management decision that has cascading effects on grapevine productivity and wine quality in the context of climate change. A multi-site trial was begun in 2021 to optimize irrigation initiation timing using midday stem water potential (ψstem) thresholds characterized as departures from non-stressed baseline ψstemvalues (Δψstem). Plant material, vine and row spacing, and trellising systems were concomitant among sites, while vine age, soil type, and pruning systems varied. Five target Δψstem thresholds were arranged in an RCBD and replicated eight times at each site: 0.2, 0.4, 0.6, 0.8, and 1.0 MPa (T1, T2, T3, T4, and T5, respectively). When thresholds were reached, plots were irrigated weekly at 70% ETc. Yield components and berry composition were quantified at harvest. To better generalize inferences across sites, data were analyzed by ANOVA using a mixed model including site as a random factor. Across sites, irrigation was initiated at Δψstem = 0.24, 0.50, 0.65, 0.93, and 0.98 MPa for T1, T2, T3, T4, and T5, respectively. Consistent significant negative linear trends were found for several key yield and berry composition variables. Yield decreased by 12.9, 15.9, 19.5, and 27.4% for T2, T3, T4, and T5, respectively, compared to T1 (p < 0.0001) across sites that were driven by similarly linear reductions in berry weight (p < 0.0001). Comparatively, berry composition varied little among treatments. Juice total soluble solids decreased linearly from T1 to T5 – though only ranged 0.9 Brix (p = 0.012). Because producers are paid by the ton, and contracts simply stipulate a target maturity level, first-year results suggest that there is no economic incentive to induce moderate water deficits before irrigation initiation, regardless of vineyard site. Subsequent years will further elucidate the carryover effects of delaying irrigation initiation on productivity over the long term.

Anthocyanin profile is differentially affected by high temperature, elevated CO2 and water deficit in Tempranillo (Vitis vinifera L.) clones

Anthocyanin potential of grape berries is an important quality factor in wine production. Anthocyanin concentration and profile differ among varieties but it also depends on the environmental conditions, which are expected to be greatly modified by climate change in the future. These modifications may significantly modify the biochemical composition of berries at harvest, and thus wine typicity. Among the diverse approaches proposed to reduce the potential negative effects that climate change may have on grape quality, genetic diversity among clones can represent a source of potential candidates to select better adapted plant material for future climatic conditions. The effects of individual and combined factors associated to climate change (increase of temperature, rise of air CO2 concentration and water deficit) on the anthocyanin profile of different clones of Tempranillo that differ in the length of their reproductive cycle were studied. The aim was to highlight those clones more adapted to maintain specific Tempranillo typicity in the future. Fruit-bearing cuttings were grown in controlled conditions under two temperatures (ambient temperature versus ambient temperature + 4ºC), two CO2 levels (400 ppm versus 700 ppm) and two water regimes (well-watered versus water deficit), both in combination or independently, in order to simulate future climate change scenarios. Elevated temperature increased anthocyanin acylation, whereas elevated CO2 and water deficit favoured the accumulation of malvidin derivatives, as well as the acylation and tri-hydroxylation level of anthocyanins. Although the changes in anthocyanin profile observed followed a common pattern among clones, such impact of environmental conditions was especially noticeable in one of the most widely distributed Tempranillo clones, the accession RJ43.

Grapevine xylem embolism resistance spectrum reveals which varieties have a lower mortality risk in a future dry climate

Wine growing regions have recently faced intense and frequent droughts that have led to substantial economical losses, and the maintenance of grapevine productivity under warmer and drier climate will rely notably on planting drought-resistant cultivars. Given that plant growth and yield depend on water transport efficiency and maintenance of photosynthesis, thus on the preservation of the vascular system integrity during drought, a better understanding of drought-related hydraulic traits that have a significant impact on physiological processes is urgently needed. We have worked towards this end by assessing vulnerability to xylem embolism in 30 grapevine commercial varieties encompassing red and white Vitis vinifera varieties, hybrid varieties characterized by a polygenic resistance for powdery and downy mildew, and commonly used rootstocks. These analyses further allowed a global assessment of wine regions with respect to their varietal diversity and resulting vulnerability to stem embolism. Hybrid cultivars displayed the highest vulnerability to embolism, while rootstocks showed the greatest resistance. Significant variability also arose among Vitis vinifera varieties, with Ψ12 and Ψ50 values ranging from -0.4 to -2.7 MPa and from -1.8 to -3.4 MPa, respectively. Cabernet franc, Chardonnay and Ugni blanc featured among the most vulnerable varieties while Pinot noir, Merlot and Cabernet Sauvignon ranked among the most resistant. In consequence, wine regions bearing a significant proportion of vulnerable varieties, such as Poitou-Charentes, France and Marlborough, New Zealand, turned out to be at greater risk under drought. These results highlight that grapevine varieties may not respond equally to warmer and drier conditions, outlining the importance to consider hydraulic traits associated with plant drought tolerance into breeding programmes and modeling simulations of grapevine yield maintenance under severe drought. They finally represent a step forward to advise the wine industry about which varieties and regions would have the lowest risk of drought-induced mortality under climate change.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.