Macrowine 2021
IVES 9 IVES Conference Series 9 Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Abstract

AIM: This study aimed to use simultaneous measurements of absorbance, transmittance, and fluorescence excitation-emission matrix (A-TEEM) combined with chemometrics as a rapid method to authenticate wines from three vintages within a single geographical indication (GI) according to their subregional variations.

METHODS: The A-TEEM technique (Gilmore, Akaji, & Csatorday, 2017) has been applied to analyse experimental Shiraz wines (n = 186) from six subregions of Barossa Valley, South Australia, from 2018, 2019 and 2020 vintages. Absorbance spectra and EEM fingerprints of the wines were recorded and the data were fused for multivariate statistical modelling with extreme gradient boost discriminant analysis (XGBDA) as reported by Ranaweera, Gilmore, Capone, Bastian, and Jeffery (2021) to classify wine according to their subregions. The cross-validated (k =10, Venetian blinds) confusion matrix score probabilities of classes were used to assess the accuracy of the classification models. A similar procedure was also carried out to discriminate subregions for a single vintage year. Basic chemical parameters (alcohol %v/v, pH, titratable acidity, and volatile acidity) were modelled with the partial least squares regression (PLSR) using A-TEEM data and reference chemical data.

RESULTS: Results have shown an unprecedented 100% correct classification of wines according to subregion across the three vintages and 98% accuracy for subregion in a single vintage year. Other model performance parameters of confusion matrix, including sensitivity, specificity, precision, and F1 score, were also showing the highest values (1.0) for each of the subregions. PLSR modelling revealed that A-TEEM data can also be used for a rapid assessment of basic wine chemical parameters. Notably, the results confirmed a distinct resolution among subregions despite their relatively close proximity within a single GI, indicating the effect of terroir on intraregional variation.

CONCLUSIONS

The sensitivity of A-TEEM allied with multivariate statistical analysis of fluorescence data facilitated the accurate classification of Shiraz wines according to the subregion of origin and production year. As a robust analytical method, A-TEEM can help identify the drivers of regional expression of wine and can potentially be developed for use within the supply chain to guarantee the provenance indicated on the label and to provide an assurance of quality. Overall, A-TEEM with XGBDA modelling continues to be shown as an accurate wine authentication tool that could even be applied at a subregional level.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Ruchira Ranaweera

Department of Wine Science, The University of Adelaide, South Australia, Australia,Adam GILMORE, Horiba Instruments Inc., Piscataway, New Jersey, USA Dimitra CAPONE, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide Susan BASTIAN, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide David JEFFERY, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide

Contact the author

Keywords

geographical indication, authenticity, subregion, excitation-emission matrix, chemometrics, terroir

Citation

Related articles…

Understanding graft union formation by using metabolomic and transcriptomic approaches during the first days after grafting in grapevine

Since the arrival of Phyloxera (Daktulosphaira vitifolia) in Europe at the end of the 19th century, grafting has become essential to cultivate Vitis vinifera. Today, grafting provides not only resistance to this aphid, but it used to adapt the cultivars according to the type of soil, environment, or grape production requirements by using a panel of rootstocks. As part of vineyard decline, it is often mentioned the importance of producing quality grafted grapevine to improve vineyard longevity, but, to our knowledge, no study has been able to demonstrate that grafting has a role in this context. However, some scion/rootstock combinations are considered as incompatible due to poor graft union formation and subsequently high plant mortality soon after grafting. In a context of climate change where the creation of new cultivars and rootstocks is at the centre of research, the ability of new cultivars to be grafted is therefore essential. The early identification of graft incompatibility could allow the selection of non-viable plants before planting and would have a beneficial impact on research and development in the nursery sector. For this reason, our studies have focused on the identification of metabolic and transcriptomic markers of poor grafting success during the first days/week after grafting; we have identified some correlations between some specialized metabolites, especially stilbenes, and grafting success, as well as an accumulation of some amino acids in the incompatible combination. The study of the metabolome and the transcriptome allowed us to understand and characterise the processes involved during graft union formation.

‘Cabernet Sauvignon’ (Vitis vinifera L.) berry skin flavonol and anthocyanin composition is affected by trellis systems and applied water amounts

Trellis systems are selected in wine grape vineyards to mainly maximize vineyard yield and maintain berry quality. This study was conducted in 2020 and 2021 to evaluate six commonly utilized trellis systems including a vertical shoot positioning (VSP), two relaxed VSPs (VSP60 and VSP80), a single high wire (SH), a high quadrilateral (HQ), and a guyot (GY), combined with three levels of irrigation regimes based on different crop evapotranspiration (ETc) replacements, including a 25% ETc, 50% ETc, and 100% ETc. The results indicated SH yielded the most fruits and accumulated the most total soluble solids (TSS) at harvest in 2020, however, it showed the lowest TSS in the second season. In 2020, SH and HQ showed higher concentrations in most of the anthocyanin derivatives compared to the VSPs. Similar comparisons were noticed in 2021 as well. SH and HQ also accumulated more flavonols in both years compared to other trellis systems. Overall, this study provides information on the efficacy of trellis systems on grapevine yield and berry flavonoid accumulation in a currently warming climate.

Spatiotemporal patterns of chemical attributes in Vitis vinifera L. cv. Cabernet Sauvignon vineyards in Central California

Spatial variability of vine productivity in winegrapes is important to characterise as both yield and quality are relevant for the production of different wine styles and products. The objectives were to understand how patterns of variability of Cabernet Sauvignon fruit composition changed over time and space, how these patterns could be characterised with indirect measurements, and how spatial patterns of the variation in fruit compositional attributes can aid in improving management. Prior to the 2017 vintage, 125 data vines were distributed across each of four vineyards in the Lodi American Viticultural Area (AVA) of California. Each data vine was sampled at commercial harvest in 2017, 2018, and 2019. Yield components and fruit composition were measured at harvest for each data vine, and maps of yield and fruit composition were produced for eight ‘objective measures of fruit quality’: total anthocyanins, polymeric tannins, quercetin glycosides, malic acid, yeast assimilable nitrogen, β-damascenone, C6 alcohols and aldehydes, and 3-isobutyl-2-methoxypyrazine. Patterns of variation in anthocyanins and phenolic compounds were found to be most stable over time. Given this relative stability, management decisions focused on fruit quality could be based on zonal descriptions of anthocyanins or phenolics to increase profitability in some vineyards. In each vineyard, dormant season pruning weights and soil cores were collected at each location, elevation and soil apparent electrical conductivity surveys were completed, and remotely sensed imagery was captured by fixed wing aircraft and two satellite platforms at major phenological stages. The data collected were used to develop relationships among biophysical data, soil, imagery, and fruit composition. The standardised and aggregated samples from four vineyards over three seasons were included in the estimation of ‘common variograms’ to assess how this technique could aid growers in producing geostatistically rigorous maps of fruit composition variability without cumbersome, single season sampling efforts.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Drought effect on aromatic and phenolic potential of seven recovered grapevine varieties in Castilla-La Mancha region (Spain)

The effects of climate change are seriously affecting the quality of wine grapes. High temperatures and drought cause imbalances in the chemical composition of grapes. The result is overripe grapes with low acidity and high sugar content, which produce wines with excessive alcohol content, lacking in freshness and not very aromatic. As a consequence, the search of varieties with capacity of produce quality grapes in adverse climate conditions is a good alternative to preserve the sustainability of vineyards. In this work, quality parameters of seven Vitis vinifera L. cultivars (five whites and two reds) recently recovered from extinction and grown under two different hydric regimes (rainfed and irrigated) were analyzed during the 2020 vintage. At harvest time, weight of 100 berries, must physicochemical parameters (brix degree, total acidity, malic acid, pH), and carbon and oxygen isotope ratios (δ13C, δ18O) were determined. Subsequently, varietal aroma potential index (IPAv) and total polyphenol index (TPI) were analyzed. Quality parameters, IPAv and TPI, showed significant differences between varieties and water regimes. Both red varieties, Moribel and Tinto Fragoso, stood out for their high aromatic and phenolic potential, which was higher under rainfed regime. Regarding to white varieties, Montonera del Casar and Jarrosuelto stood out in terms of varietal aroma potential. Montonera del Casar high acidity in its musts and Jarrosuelto showed the highest berry weights.