WAC 2022 banner
IVES 9 IVES Conference Series 9 WAC 9 WAC 2022 9 3 - WAC - Oral 9 Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Accurate Quantification of Quality Compounds and Varietal Classification from Grape Extracts using the Absorbance-Transmittance Fluorescence Excitation Emission Matrix (A-TEEM) Method and Machine Learning

Abstract

Rapid and accurate quantification of grape berry phenolics, anthocyanins and tannins, and identification of grape varieties are both important for effective quality control of harvesting and initial processing for wine making. Current reference technologies including High Performance Liquid Chromatography (HPLC) can be rate limiting and too complex and expensive for effective field operations. Secondary calibrated techniques including UV-VIS and Near and Mid Infrared spectroscopy are insensitive to specific quality compounds and unable to make accurate varietal assignments. In this paper we analyze robotically prepared grape extracts from several key varieties (n=Calibration/p=Prediction samples) including Cabernet sauvignon (64/10), Grenache (16/4), Malbec (14/4), Merlot (56/10), Petit syrah (52/10), Pinot noir (54/8), Syrah (20/2), Terlodego (14/2) and Zinfandel (62/12). Key phenolic and anthocyanin parameters measured by HPLC included Catechin, Epicatechin, Quercetin Glycosides, Malvidin 3-glucoside, Total Anthocyanins and Polymeric Tannins. Separate samples diluted 150 fold in 50% EtOH pH 2 were analyzed in parallel using the A-TEEM method following Multiblock Data Fusion of the absorbance and unfolded EEM data. A-TEEM chemical data were calibrated (n=390) using Extreme Gradient Boost (XGB) Regression and evaluated based on the Root Mean Square Error of the Prediction (RMSEP), the Relative Error of Prediction (REP%) and Coefficient of Variation (R2P) of the Prediction data (n=62). The regression results yielded an average REP% value of 6.0±2.4% and R2P of 0.941±0.024. While we consider the REP% values to be in the acceptable range at <10% we acknowledge that both the grape extraction method repeatability and HPLC reference method repeatability likely contributed the major sources of variation; e.g., A-TEEM sample REP%=1.31 for Polymeric Tannins. Varietal classification was analyzed using XGB discrimination analysis of the Multiblock data and evaluated based on the Prediction data. The classification results yielded 100% True Positive and True negative results for the Prediction Data for all varieties. We conclude that the A-TEEM method requires a minimum of sample preparation and rapid acquisition times (<1 min) and can serve as an accurate secondary method for both grape composition and varietal identification. Importantly, the application of the regression and classification models can be effectively automated for operators.

DOI:

Publication date: June 13, 2022

Issue: WAC 2022

Type: Article

Authors

Adam, Gilmore, Qiang, Sui

Presenting author

Adam, Gilmore – HORIBA Instruments Inc.

E & J Gallo Wines

Contact the author

Keywords

Extreme Gradient Boost – Phenolics – Anthocyanins- Tannins-Grape Variety

Tags

IVES Conference Series | WAC 2022

Citation

Related articles…

Anthocyanin profile is differentially affected by high temperature, elevated CO2 and water deficit in Tempranillo (Vitis vinifera L.) clones

Anthocyanin potential of grape berries is an important quality factor in wine production. Anthocyanin concentration and profile differ among varieties but it also depends on the environmental conditions, which are expected to be greatly modified by climate change in the future. These modifications may significantly modify the biochemical composition of berries at harvest, and thus wine typicity. Among the diverse approaches proposed to reduce the potential negative effects that climate change may have on grape quality, genetic diversity among clones can represent a source of potential candidates to select better adapted plant material for future climatic conditions. The effects of individual and combined factors associated to climate change (increase of temperature, rise of air CO2 concentration and water deficit) on the anthocyanin profile of different clones of Tempranillo that differ in the length of their reproductive cycle were studied. The aim was to highlight those clones more adapted to maintain specific Tempranillo typicity in the future. Fruit-bearing cuttings were grown in controlled conditions under two temperatures (ambient temperature versus ambient temperature + 4ºC), two CO2 levels (400 ppm versus 700 ppm) and two water regimes (well-watered versus water deficit), both in combination or independently, in order to simulate future climate change scenarios. Elevated temperature increased anthocyanin acylation, whereas elevated CO2 and water deficit favoured the accumulation of malvidin derivatives, as well as the acylation and tri-hydroxylation level of anthocyanins. Although the changes in anthocyanin profile observed followed a common pattern among clones, such impact of environmental conditions was especially noticeable in one of the most widely distributed Tempranillo clones, the accession RJ43.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Genotypic variability in root architectural traits and putative implications for water uptake in grafted grapevine

Root system architecture (RSA) is important for soil exploration and edaphic resources acquisition by the plant, and thus contributes largely to its productivity and adaptation to environmental stresses, particularly soil water deficit. In grafted grapevine, while the degree of drought tolerance induced by the rootstock has been well documented in the vineyard, information about the underlying physiological processes, particularly at the root level, is scarce, due to the inherent difficulties in observing large root systems in situ. The objectives of this study were to determine genetic differences in the root architectural traits and their relationships to water uptake in two Vitis rootstocks genotypes (RGM, 140Ru) differing in their adaptation to drought. Young rootstocks grafted upon the Riesling variety were transplanted into cylindrical tubes and in 2D rhizotrons under two conditions, well watered and moderate water stress. Root traits were analyzed by digital imaging and the amount of transpired water was measured gravimetrically twice a week. Root phenotyping after 30 days reveal substantial variation in RSA traits between genotypes despite similar total root mass; the drought-tolerant 140Ru showed higher root length density in the deep layer, while the drought-sensitive RGM was characterised by shallow-angled root system development with more basal roots and a larger proportion of fine roots in the upper half of the tube. Water deficit affected canopy size and shoot mass to a greater extent than root development and architectural-related traits for both 140Ru and RGM, suggesting vertical distribution of roots was controlled by genotype rather than plasticity to soil water regime. The deeper root system of 140Ru as compared to RGM correlated with greater daily water uptake and sustained stomata opening under water-limited conditions but had little effect on above-ground growth. Our results highlight that grapevine rootstocks have constitutively distinct RSA phenotypes and that, in the context of climate change, those that develop an extensive root network at depth may provide a desirable advantage to the plant in coping with reduced water resources.

Bioclimatic shifts and land use options for Viticulture in Portugal

Land use, plays a relevant role in the climatic system. It endows means for agriculture practices thus contributing to the food supply. Since climate and land are closely intertwined through multiple interface processes, climate change may lead to significant impacts in land use. In this study, 1-km observational gridded datasets are used to assess changes in the Köppen–Geiger and Worldwide Bioclimatic (WBCS)