Macrowine 2021
IVES 9 IVES Conference Series 9 Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Abstract

AIM: This study aimed to use simultaneous measurements of absorbance, transmittance, and fluorescence excitation-emission matrix (A-TEEM) combined with chemometrics as a rapid method to authenticate wines from three vintages within a single geographical indication (GI) according to their subregional variations.

METHODS: The A-TEEM technique (Gilmore, Akaji, & Csatorday, 2017) has been applied to analyse experimental Shiraz wines (n = 186) from six subregions of Barossa Valley, South Australia, from 2018, 2019 and 2020 vintages. Absorbance spectra and EEM fingerprints of the wines were recorded and the data were fused for multivariate statistical modelling with extreme gradient boost discriminant analysis (XGBDA) as reported by Ranaweera, Gilmore, Capone, Bastian, and Jeffery (2021) to classify wine according to their subregions. The cross-validated (k =10, Venetian blinds) confusion matrix score probabilities of classes were used to assess the accuracy of the classification models. A similar procedure was also carried out to discriminate subregions for a single vintage year. Basic chemical parameters (alcohol %v/v, pH, titratable acidity, and volatile acidity) were modelled with the partial least squares regression (PLSR) using A-TEEM data and reference chemical data.

RESULTS: Results have shown an unprecedented 100% correct classification of wines according to subregion across the three vintages and 98% accuracy for subregion in a single vintage year. Other model performance parameters of confusion matrix, including sensitivity, specificity, precision, and F1 score, were also showing the highest values (1.0) for each of the subregions. PLSR modelling revealed that A-TEEM data can also be used for a rapid assessment of basic wine chemical parameters. Notably, the results confirmed a distinct resolution among subregions despite their relatively close proximity within a single GI, indicating the effect of terroir on intraregional variation.

CONCLUSIONS

The sensitivity of A-TEEM allied with multivariate statistical analysis of fluorescence data facilitated the accurate classification of Shiraz wines according to the subregion of origin and production year. As a robust analytical method, A-TEEM can help identify the drivers of regional expression of wine and can potentially be developed for use within the supply chain to guarantee the provenance indicated on the label and to provide an assurance of quality. Overall, A-TEEM with XGBDA modelling continues to be shown as an accurate wine authentication tool that could even be applied at a subregional level.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Ruchira Ranaweera

Department of Wine Science, The University of Adelaide, South Australia, Australia,Adam GILMORE, Horiba Instruments Inc., Piscataway, New Jersey, USA Dimitra CAPONE, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide Susan BASTIAN, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide David JEFFERY, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide

Contact the author

Keywords

geographical indication, authenticity, subregion, excitation-emission matrix, chemometrics, terroir

Citation

Related articles…

Effect of vigour and number of clusters on eonological parameters and metabolic profile of Cabernet Sauvignon red wines

Vegetative growth and yield are reported to affect grape and wine quality. They can be controlled through different techniques linked to vine management. The objective of this research was to determine the effect of vine vigour and number of clusters per vine on physicochemical composition and phenolic profile of red wines. The experiment was carried out during two vegetative cycles, with cv. Cabernet Sauvignon grafted onto Paulsen 1103. Three vine vigour were defined, according to shoot weight at previous harvests, being low, medium and high. Five treatments of number of clusters were used for each vigour, with 15, 22, 29, 36, and 45 clusters per vine. Grapes from all treatments were harvested in the same day from Brix and total acidity criteria. Thirty days after bottling, classical analyzes and phenolic compounds were performed. As results, different responses were obtained from each vintage. In 2020, a dry season from veraison to harvest, grapes and wines obtained from low vigour treatment and 45 clusters per vine was the highest in sugar and alcohol content respectively, while grapes and wines from high vigour and 15 clusters presented the lowest sugar and alcohol content. Total anthocyanins were higher in treatment with low vigour and 15 clusters, while the lowest amounts were found in low vigour with 45 clusters, as well as medium and high vigour with 36 clusters per vine. Total tannins were higher in high vigour with 22 clusters and medium vigour with 29 clusters, while were lower in low vigour with 36 clusters. In 2021, a wet season at harvest, responses were different, and great variations were observed between treatments. As conclusions, yield and vine vigour had strong influence on grape and wine quality, promoting different enological potentials on which can be indicated/used for aging strategies of red and even rosé wines.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Influence of a spontaneous cover crop on the vineyard and soil erosion under Mediterranean climate

Sixty five % of the agricultural area of the Basque Country located in the DO Ca Rioja corresponds to vineyards. More than 40% of it has an average slope greater than 10%, which makes it sensitive to erosive processes. Furthermore, it is foreseeable that extreme weather events (storms, hail, extreme heat and cold, etc.) will be favored due to climate change. Cover cropping can mitigate this risk, and therefore the objective of this work is to evaluate the impact that a vegetable cover has on the agronomic behavior of the vineyard, the quality of the grape and soil erosion. For this, a trial has been carried out with a Graciano variety vineyard with a slope between 10% -20% during the years 2020 and 2021. Conventional tillage management in the area has been compared (4-6 passes per year of tillage machinery) versus spontaneous vegetation cover management in the vineyard. This implies not tilling and allowing the grass of the land to colonize the range between the lines of vines, controlling their height through 1-3 mowing passes per year, always trying to affect the surface of the land as little as possible. The vegetative growth, yield and quality of the grape and wine was measured. Furthermore, erosion has been measured using Gerlasch boxes. The yield was lower in the second year of the trial in the cover crop treatment, but erosion was significantly reduced.

Effect of one-year cover crop and arbuscular mycorrhiza inocululation in the microbial soil community of a vineyard

The microbial composition of the soil is an important factor to consider in viticulture, since its influence on the “terroir” and on the organoleptic properties of the wine have been demonstrated. Different agronomic techniques have the potential to modify the composition and functionality of the soil microbial community. Maintaining green covers is known to increase soil microbial diversity. The direct application of inoculum of beneficial microorganisms to the soil has also been used to increase their abundance. However, the environmental conditions of each site seem to have a determining weight in the result of these practices. In this study, we compared the effect on the microbial community of a cover crop with legumes in autumn and the inoculation of grapevines with commercial inoculum bases on Rhizophagus irregularis and Funeliformis mosseae in the previous spring. The study has been carried out in a vineyard in Binissalem, Mallorca, Spain. After applying the treatments, we will analyze the soil microbial communities using the data obtained from Illumina amplification of soil DNA from the 16S and ITS regions to analyze bacteria and fungi community, respectively. In addition, we will record the physicochemical characteristics of the soil at each sampling point. The result showed that agronomic management, in the short term, has less influence than soil characteristics on the composition of the soil microbiome. With these results, we can conclude that in a vineyard, agricultural techniques should focus on improving the characteristics of the soil to improve the biodiversity of the soil microbiota.

Updating the Winkler index: An analysis of Cabernet sauvignon in Napa Valley’s varied and changing climate

This study aims to create an updated, agile viticultural climate index (similar to the Winkler Index) by performing in-depth analyses of current and historical data from industry partners in several major winegrowing regions. The Winkler Index was developed in the early twentieth century based on analysis of various grape-growing regions in California. The index uses heat accumulation (i.e. Growing Degree Days) throughout the growing season to determine which grape varieties are best suited to each region. As viticultural regions are increasingly subject to the complexity and uncertainty of a changing climate, a more rigorous, agile model is needed to aid grape growers in determining which cultivars to plant where. For the first phase of this study, 21 industry partners throughout Napa Valley shared historical phenology, harvest, viticultural practice, and weather data related to their Cabernet sauvignon vineyard blocks. To complement this data, berry samples were collected throughout the 2021 growing season from 50 vineyard blocks located throughout 16 American Viticultural Areas that were then analyzed for basic berry chemistry and phenolics. These blocks have been mapped using a Geographic Information System (GIS), enabling analysis of altitude, vineyard row orientation, slope, and remotely sensed climate data. Sampling sites were also chosen based on their proximity to a weather station. By analyzing historical data from industry partners and data specifically collected for this study, it is possible to identify key parameters for further analysis. Initial results indicate extreme variability at a high spatial resolution not currently accounted for in modern viticultural climate indices and suggest that viticultural practices play a major role. Using the structure of data collection and analyses developed for the first phase, this project will soon be expanded to other wine regions globally, while continuing data collection in Napa Valley.