Macrowine 2021
IVES 9 IVES Conference Series 9 Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Fluorescence spectroscopy with xgboost discriminant analysis for intraregional wine authentication

Abstract

AIM: This study aimed to use simultaneous measurements of absorbance, transmittance, and fluorescence excitation-emission matrix (A-TEEM) combined with chemometrics as a rapid method to authenticate wines from three vintages within a single geographical indication (GI) according to their subregional variations.

METHODS: The A-TEEM technique (Gilmore, Akaji, & Csatorday, 2017) has been applied to analyse experimental Shiraz wines (n = 186) from six subregions of Barossa Valley, South Australia, from 2018, 2019 and 2020 vintages. Absorbance spectra and EEM fingerprints of the wines were recorded and the data were fused for multivariate statistical modelling with extreme gradient boost discriminant analysis (XGBDA) as reported by Ranaweera, Gilmore, Capone, Bastian, and Jeffery (2021) to classify wine according to their subregions. The cross-validated (k =10, Venetian blinds) confusion matrix score probabilities of classes were used to assess the accuracy of the classification models. A similar procedure was also carried out to discriminate subregions for a single vintage year. Basic chemical parameters (alcohol %v/v, pH, titratable acidity, and volatile acidity) were modelled with the partial least squares regression (PLSR) using A-TEEM data and reference chemical data.

RESULTS: Results have shown an unprecedented 100% correct classification of wines according to subregion across the three vintages and 98% accuracy for subregion in a single vintage year. Other model performance parameters of confusion matrix, including sensitivity, specificity, precision, and F1 score, were also showing the highest values (1.0) for each of the subregions. PLSR modelling revealed that A-TEEM data can also be used for a rapid assessment of basic wine chemical parameters. Notably, the results confirmed a distinct resolution among subregions despite their relatively close proximity within a single GI, indicating the effect of terroir on intraregional variation.

CONCLUSIONS

The sensitivity of A-TEEM allied with multivariate statistical analysis of fluorescence data facilitated the accurate classification of Shiraz wines according to the subregion of origin and production year. As a robust analytical method, A-TEEM can help identify the drivers of regional expression of wine and can potentially be developed for use within the supply chain to guarantee the provenance indicated on the label and to provide an assurance of quality. Overall, A-TEEM with XGBDA modelling continues to be shown as an accurate wine authentication tool that could even be applied at a subregional level.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Ruchira Ranaweera

Department of Wine Science, The University of Adelaide, South Australia, Australia,Adam GILMORE, Horiba Instruments Inc., Piscataway, New Jersey, USA Dimitra CAPONE, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide Susan BASTIAN, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide David JEFFERY, The Australian Research Council Training Centre for Innovative Wine Production, The University of Adelaide

Contact the author

Keywords

geographical indication, authenticity, subregion, excitation-emission matrix, chemometrics, terroir

Citation

Related articles…

Sustainable fertilisation of the vineyard in Galicia (Spain)

Excessive fertilization of the vineyard leads to low quality grapes, increased costs and a negative impact on the environment. In order to establish an integrated management system aimed at a sustainable fertilization of the vineyards, nutritional reference levels were established. For this purpose, 30 representative vineyards of the Albariño variety were studied, in which soil and petiole analyses were carried out for two years and grape yield and quality at harvest were measured. In both years of study, soil pH, calcium, sodium and cation exchange capacity were positively correlated with calcium content and negatively correlated with manganese in grapes. Irrigated vineyards had higher levels of aluminium in soil and lower levels of calcium in petiole. Climatic conditions were very different in the years of the study. The year 2019 was colder than usual, in 2020 there was a marked water stress with high summer temperatures. This resulted in medium-high acidity in grapes in 2019 and low acidity in 2020, with sugar levels being similar both years. A very marked decrease in must amino nitrogen was observed in 2020, with ammonia nitrogen remaining stable. The correlation of acidity and sugar values in grapes with soil and petiole analysis data made it possible to establish reference levels for the nutritional diagnosis of the Albariño variety in this region. Based on these results, an easy-to-use TIC application is currently being created for grapegrowers, aimed at improving the sustainability of the vineyard through reasoned fertilization. This study has now been extended to other Galician vine varieties.

A spatial explicit inventory of EU wine protected designation of origin to support decision making in a changing climate

Winemaking areas recognized as protected designations of origin (PDOs) shape important economic, environmental and cultural values that are tied to closely defined geographic locations. To preserve wine products and wine-growing practices adopted in different PDOs these areas are strictly regulated by legal specifications. However, quality viticulture is increasingly under pressure from climate change, which is altering the local conditions of many winegrowing areas. Therefore, maintaining traditional wine products will require the adoption of tailored adaptation strategies, including possible changes in the legal regulation of protected wines. To this end, it is necessary to have a comprehensive knowledge on PDOs including their extension, products and allowed practices. While there have been efforts to build databases that summarize the characteristics for individual wine PDO areas and to quantify the related effects of climate change, much information is still included only in the official documentation of the EU geographical indication register and has never been collected in a comprehensive manner. With this study we aim at filling this gap by building a spatial inventory of European wine PDOs that supports decision making in viticulture in the context of climate change. To map and characterize European wine PDOs, we analysed their legal documents and extracted relevant information useful for climate change adaptation. The output consists of a comprehensive geographical dataset that identifies the boundaries of all 1200 European wine PDOs at unprecedented spatial resolution and includes a set of legally binding regulations, such as authorized vine varieties, maximum yields and planting density. The inventory will allow researchers to analyse the impacts of climate change on European wine PDOs and support decision makers in developing tailored adaptation strategies. This includes, among others, the evaluation of new vineyard site selection, the expansion of cultivated varieties or the authorization of irrigation in vineyards.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Extreme canopy management for vineyard adaptation to climate change: is it a good idea?

Climate change constitutes an enormous challenge for humankind and for all human activities, viticulture not being an exception. Long-term strategic changes are probably needed the most, but growers also need to deal with short-term changes: summers that are getting progressively warmer, earlier harvest dates and higher pH in musts and wines. In the last 10-15 years, a relevant corpus of research is being developed worldwide in order to evaluate to which extent extreme canopy management operations, aimed at reducing leaf area and, thus, limiting the source to sink ratio, could be useful to delay ripening. Although extreme canopy management can result in relevant delays in harvest dates, longer term studies, as well as detailed analysis of their implications on carbohydrate reserves, bud fertility and future yield are desirable before these practices can be recommended.

Long-term drought resilience of traditional red grapevine varieties from a semi-arid region

In recent decades, the scarcity of water resources in agriculture in certain areas has been aggravated by climate change, which has caused an increase in temperatures, changes in rainfall patterns, as well as an increase in the frequency of extreme phenomena such as droughts and heat waves. Although the vine is considered a drought-tolerant specie, it has to satisfy important water requirements to complete its cycle, which coincides with the hottest and driest months. Achieving sustainable viticulture in this scenario requires high levels of efficiency in the use of water, a scarce resource whose use is expected to be severely restricted in the near future. In this regard, the use of drought-tolerant varieties that are able to maintain grape yield and quality could be an effective strategy to face this change. During three consecutive seasons (2018-2020) the behavior in rainfed regime of 13 traditional red grapevine varieties of the Spain central region was studied. These varieties were cultivated in a collection at Centro de Investigación de la Vid y el Vino de Castilla-La Mancha (IVICAM-IRIAF) located in Tomelloso (Castilla-La Mancha, Spain). Yield components (yield, mean bunch and berry weight, pruning weight), physicochemical parameters of the musts (brix degree, total acidity, pH) and some physiological parameters related with water stress during ripening period (δ13C, δ18O) were analysed. The application of different statistical techniques to the results showed the existence of significant differences between varieties in their response to stressful conditions. A few varieties highlighted for their high ability to adapt to drought, being able to maintain high yields due to their efficiency in the use of water. In addition, it was possible quantify to what extent climate can be a determinant in the δ18O of musts under severe water stress conditions.