Macrowine 2021
IVES 9 IVES Conference Series 9 Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Abstract

AIM: This study aimed to use simultaneous measurements of absorbance, transmittance, and fluorescence excitation-emission matrix (A-TEEM) combined with chemometrics as a rapid method to authenticate wines from three vintages within a single geographical indication (GI) according to their subregional variations.

METHODS: The A-TEEM technique (Gilmore, Akaji, & Csatorday, 2017) has been applied to analyse experimental Shiraz wines (n = 186) from six subregions of Barossa Valley, South Australia, from 2018, 2019 and 2020 vintages. Absorbance spectra and EEM fingerprints of the wines were recorded and the data were fused for multivariate statistical modelling with extreme gradient boost discriminant analysis (XGBDA) as reported by Ranaweera, Gilmore, Capone, Bastian, and Jeffery (2021) to classify wine according to their subregions. The cross-validated (k =10, Venetian blinds) confusion matrix score probabilities of classes were used to assess the accuracy of the classification models. A similar procedure was also carried out to discriminate subregions for a single vintage year. Basic chemical parameters (alcohol %v/v, pH, titratable acidity, and volatile acidity) were modelled with the partial least squares regression (PLSR) using A-TEEM data and reference chemical data.

RESULTS: Results have shown an unprecedented 100% correct classification of wines according to subregion across the three vintages and 98% accuracy for subregion in a single vintage year. Other model performance parameters of confusion matrix, including sensitivity, specificity, precision, and F1 score, were also showing the highest values (1.0) for each of the subregions. PLSR modelling revealed that A-TEEM data can also be used for a rapid assessment of basic wine chemical parameters. Notably, the results confirmed a distinct resolution among subregions despite their relatively close proximity within a single GI, indicating the effect of terroir on intraregional variation.

CONCLUSIONS

The sensitivity of A-TEEM allied with multivariate statistical analysis of fluorescence data facilitated the accurate classification of Shiraz wines according to the subregion of origin and production year. As a robust analytical method, A-TEEM can help identify the drivers of regional expression of wine and can potentially be developed for use within the supply chain to guarantee the provenance indicated on the label and to provide an assurance of quality. Overall, A-TEEM with XGBDA modelling continues to be shown as an accurate wine authentication tool that could even be applied at a subregional level.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Ruchira Ranaweera

Department of Wine Science, The University of Adelaide, South Australia, Australia,Adam GILMORE, Horiba Instruments Inc., Piscataway, New Jersey, USA Dimitra CAPONE, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide Susan BASTIAN, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide David JEFFERY, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide

Contact the author

Keywords

geographical indication, authenticity, subregion, excitation-emission matrix, chemometrics, terroir

Citation

Related articles…

Vineyards and clay minerals: multi-technique analytical approach and correlations with soil properties

Purpose of this research is to quantitatively assess the mineral component of vineyard soils, with particular attention to the mineralogical analysis of clays, which represent an element of high importance in the vineyard culture as well as in general agriculture. An X-ray diffraction (XRD) / thermogravimetric (TG) multi-technique analytical approach was developed, tested on soil samples taken from vineyards around the world. This codified analytical procedure was necessary to obtain precise qualitative and quantitative mineralogical data, globally comparable to distinguish the geopedological identity of the vineyards. Soil samples from vineyards of various locations were analysed, in very different geological conditions. The bulk-rock quantitative phase analysis (QPA) was obtained by the Rietveld method while the detailed composition of the clay-sized fraction was determined by modelling of the oriented X-ray diffraction patterns. The research provided a precise classification of the mineral component of soils, distinguishing the mineral phases of the clays and the so-called mixed-layer clay minerals. We found that the content in mixed layers can be directly correlated with the water retention and the cation exchange capacity ​​of the soil, while the presence of other clayey minerals and phyllosilicates in this research did not affect this CEC parameter, which codes the fertility level of the soils. The study demonstrates that terroir, in particular soils formed in complex or very different geological conditions, can only be effectively interpreted by properly analysing its mineral phases, in particular the mixed-layer clay component. These are characteristic abiotic ecological indicators, which may have specific eco-physiological influences on the plant.

Co-design and evaluation of spatially explicit strategies of adaptation to climate change in a Mediterranean watershed

Climate change challenges differently wine growing systems, depending on their biophysical, sociological and economic features. Therefore, there is a need to locally design and evaluate adaptation strategies combining several technical options, and considering the local opportunities and constraints (e.g. water access, wine typicity). The case study took place in a typical and heterogeneous Mediterranean vineyard of 1,500 ha in the South of France. We developed a participatory modeling approach to (1) conceptualize local climate change issues and design spatially explicit adaptation strategies with stakeholders, (2) numerically evaluate their effects on phenology, yield and irrigation needs under the high-emissions climate change scenario RCP 8.5, and (3) collectively discuss simulation results. We organized five sets of workshops, with in-between modeling phases. A process-based model was developed that allowed to evaluate the effects of six technical options (late varieties, irrigation, water saving by reducing canopy size, adjusting cover cropping, reducing density, and shading) with various distributions in the watershed, as well as vineyard relocation. Overall, we co-designed three adaptation strategies. Delay harvest strategy with late varieties showed little effects on decreasing air temperature during ripening. Water constraint limitation strategy would compensate for production losses if disruptive adaptations (e.g. reduced density) were adopted, and more land got access to irrigation. Relocation strategy would foster high premium wine production in the constrained mountainous areas where grapevine is less impacted by climate change. This research shows that a spatial distribution of technical changes gives room for adaptation to climate change, and that the collaboration with local stakeholders is a key to the identification of relevant adaptation. Further research should explore the potential of adaptation strategies based on soil quality improvement and on water stress tolerant varieties.

The plantation frame as a measure of adaptation to climate change

The mechanization of vineyard work originally led to a reduction in planting densities due to the lack of machinery adapted to the vineyard. The current availability of specific machinery makes it possible to establish higher planting densities. In this work, three planting densities (1.40×0.80 m, 1.80×1 m and 2.20×1.20 m, corresponding to 8928, 5555 and 3787 plants/ha respectively) were studied with four varieties autochthonous of Galicia (northwestern Spain): Albariño and Treixadura (white), Sousón and Mencía (red). The vines were trained in a vertical shoot positioning system using a single Royat cordon, and pruned to spurs with two buds each. Agronomic data (yield, pruning wood weight, Ravaz index) and oenological data in must were collected. The higher planting density (1.40×0.80 m) had no significant effect on grape yield per vine in white varieties, although production per hectare was much higher due to the greater number of plants. In red varieties, this planting density resulted in a significantly lower production per vine, compensated by the greater number of plants. In addition, it significantly reduced the Brix degree in the must of the Albariño, Treixadura and Sousón varieties, and increased the total acidity in the latter two and Mencía. It also caused an increase in extractable and total anthocyanins and IPT in red grapes. The effects of high planting density on grapes are of great interest for the adaptation of varieties in the context of climate change. In the future, it could be advisable to modify the limits imposed by the appellations of origin on the planting density of these varieties in order to obtain more balanced wines.

Sustaining wine identity through intra-varietal diversification

With contemporary climate change, cultivated Vitis vinifera L. is at risk as climate is a critical component in defining ecologically fitted plant materiel. While winegrowers can draw on the rich diversity among grapevine varieties to limit expected impacts (Morales-Castilla et al., 2020), replacing a signature variety that has created a sense of local distinctiveness may lead to several challenges. In order to sustain wine identity in uncertain climate outcomes, the study of intra-varietal diversity is important to reflect the adaptive and evolutionary potential of current cultivated varieties. The aim of this ongoing study is to understand to what extent can intra-varietal diversity be a climate change adaptation solution. With a focus on early (Sauvignon blanc, Riesling, Grolleau, Pinot noir) to moderate late (Chenin, Petit Verdot, Cabernet franc) ripening varieties, data was collected for flowering and veraison for the various studied accessions (from conservatory plots) and clones. For these phenological growing stages, heat requirements were established using nearby weather stations (adapted from the GFV model, Parker et al., 2013) and model performances were verified. Climate change projections were then integrated to predict the future behaviour of the intra-varietal diversity. Study findings highlight the strong phenotypic diversity of studied varieties and the importance of diversification to enhance climate change resilience. While model performances may require improvements, this study is the first step towards quantifying heat requirements of different clones and how they can provide adaptation solutions for winegrowers to sustain local wine identity in a global changing climate. As genetic diversity is an ongoing process through point mutations and epigenetic adaptations, perspective work is to explore clonal data from a wide variety of geographic locations.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.