Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Estimating bulk stomatal conductance of grapevine canopies

In response to changes in their environment, grapevines regulate transpiration using various physiological mechanisms that alter conductance of water through the soil-plant-atmosphere continuum. Expressed as bulk stomatal conductance at the canopy scale, it varies diurnally in response to changes in vapor pressure deficit and net radiation, and over the season to changes in soil water deficits and hydraulic conductivity of both soil and plant. It is necessary to characterize the response of conductance to these variables to better model how vine transpiration also responds to these variables. Furthermore, to be relevant for vineyard-scale modeling, conductance is best characterized using data collected in a vineyard setting. Applying a crop canopy energy flux model developed by Shuttleworth and Wallace, bulk stomatal conductance was estimated using measurements of individual vine sap flow, temperature and humidity within the vine canopy, and estimates of net radiation absorbed by the vine canopy. These measurements were taken on several vines in a non-irrigated vineyard in Bordeaux France, using equipment that did not interfere with ongoing vineyard operations. An inverted Penman-Monteith equation was then used to calculate bulk stomatal conductance on 15-minute intervals from July to mid-September 2020. Time-series plots show significant diurnal variation and seasonal decreases in conductance, with overall values similar to those in the literature. Global sensitivity analysis using non-parametric regression found transpiration flux and vapor pressure deficit to be the most important input variables to the calculation of bulk stomatal conductance, with absorbed net radiation and bulk boundary layer conductance being much less important. Conversely, bulk stomatal conductance was one of the most important inputs when calculating vine transpiration, further emphasizing the need for characterizing its response to environmental changes for use in vineyard water use modeling.

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.

The use of rootstock as a lever in the face of climate change and dieback of vineyard

As viticulture faces challenges such as climate change or vineyard dieback, the choice of the variety and rootstock becomes more and more crucial. To study rootstock levers in the Bordeaux region, a parcel of Cabernet Sauvignon (CS) was planted with four rootstocks in 2014. Twenty repetitions of each of the following four rootstocks were set up: 101-14 MGt, Nemadex AB, 420A MGt and Gravesac. The number of bunches, yields and pruning weights of the vine shoots were measured individually on 240 vines from 2017 to 2021. Since 2020, nitrogen status assessed by assimilable nitrogen level, hydric status assessed by δ13C and berry maturity were measured on 80 samples taken from 20 repetitions of the four rootstocks. A lower yield was measured for CS grafted onto Nemadex AB due to the lower number of bunches and the lower weight of berries. The differences between the other three rootstocks are small, but CS grafted onto 420A MGt was the most productive. The CS grafted onto Nemadex AB had the lowest pruning weight while 101-14 MGt had the highest. In 2020, δ13C showed a more moderate water stress with 101-14 MGt and 420A MGt than with Nemadex AB. Surprisingly, the Gravesac was under more stress than the 101-14 MGt. The nitrogen status in the berries was better for Nemadex AB but this was perhaps due to the significantly lower weight of the berries.Rootstock 101-14 MGt attained the highest accumulation of sugars in the berries while 420A MGt allows to preserve higher acidity. The parcel is still young which may explain some of the results. These measures must therefore be continued over the next several years to fully assess the effects of these rootstocks on the development of the vines and the quality of the production under new climatic conditions.

A multidisciplinary approach to evaluate the effects of the training system on the performance of “Aglianico del Vulture” vineyards

Vineyards are complex agro-ecosystems with high spatial and temporal variability. An efficient training system may counteract the adverse effects of this variability. Moreover, considering the climate change issues, choosing an efficient training system that enhances water use and protects the vines from radiative thermal stress has become a priority for the farmers. A multidisciplinary approach that assesses the soil-crop-yield-wine relationships of vineyards in a distributed and holistic way could bring added knowledge on the behavior of the different training systems. This ongoing research aimed to implement a multidisciplinary approach to study the behavior of “Aglianico del Vulture” grapevines trained with two different systems: a spurred cordon (SC) and an “Alberello in parete” (AL), grown in a high-quality wine production area of Basilicata region (Italy). The approach merged several methods and scales of soil, ecophysiology, must/wine quality, and spectral data collection to assess the influence of the training system. Homogeneous zones (HZs) in both training systems were defined through a procedure based on geomorphological classification, unmanned aerial vehicles (UAV) images analysis, and a traditional soil survey supported by geophysical scanning. During the 2021 season, TDR probes monitored soil water content, while grapevine health status was assessed using eco-physiological measurements (LWP, chlorophyll content, PSII photosynthetic efficiency, LAI, and point-based field spectroscopy). These grapevine in-vivo measurements validated the spectral vegetation indexes (NDVI, RENDVI, CVI, and TVI) derived from the UAV multispectral imagery, which monitored the grapevine status in a distributed and non-invasive way. Grape yield, quality of berries, must and wine were measured to assess the effects of the training systems. The first experimental year results showed the variability of the vineyards and revealed relationships among soil parameters, crop characteristics, and vegetation indices of the SC and AL training systems. This multidisciplinary study could bring new insights into the vineyard training system’s effects on grape yield and wine quality.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

How does aromatic composition of red wines, resulting from varieties adapted to climate change, modulate fruity aroma?

One of the major issues for the wine sector is the impact of climate change linked to the increasing temperatures which affects physicochemical parameters of the grape varieties planted in Bordeaux vineyard and consequently, the quality of wine. In some varietals, the attenuation of their fresh fruity character is accompanied by the accentuation of dried-fruit notes [1]. As a new adaptive strategy on climate change, some winegrowers have initiated changes in the Bordeaux blend of vine varieties [2]. This study intends to explore the fruitiness in wines produced from grape varieties adapted to the future climate of Bordeaux. 10 commercial single–varietal wines from 2018 vintage made from the main grape varieties in the Bordeaux region (Cabernet franc, Cabernet-Sauvignon and Merlot) as well as from indigenous grape varieties from the Mediterranean basin, such as Cyprus (Yiannoudin), France (Syrah), Greece (Agiorgitiko and Xinomavro), Portugal (Touriga Nacional) and Spain (Garnacha and Tempranillo), were selected among 19 samples using sensory descriptive analyses. Both sensory and instrumental analyses were coupled, to investigate their fruity aroma expression. For sensory analysis, samples were prepared from wine, using a semi preparative HPLC method which preserves wine aroma and isolates fruity characteristics in 25 specific fractions [3,4]. Fractions of interest with intense fruity aromas were sensorially selected for each wine by a trained panel and mixed with ethanol and microfiltered water to obtain fruity aromatic reconstitutions (FAR) [5]. A free sorting task was applied to categorize FAR according to their similarities or dissimilarities, and different clusters were highlighted. Instrumental analysis of the different FAR and wines demonstrated variations in their molecular composition. Results obtained from sensory and gas chromatography analysis enrich the knowledge of the fruity expression of red wines from “new” grape varieties opening up new perspectives in wine technology, including blending, thus providing new tools for producers.

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.

Is wine terroir a valid concept under a changing climate?

The OIV[i] defines terroir as a concept referring to an area in which collective knowledge of the interactions between the physical and biological environment (soil, topography, climate, landscape characteristics and biodiversity features) and vitivinicultural practices develops, providing distinctive wine characteristics. Those are perceptible in the taste of wine, which drives consumer preference and, therefore, wine’s value in the marketplace. Geographical indications (GI) are recognized regulatory constructs formalizing and protecting the nexus between wine taste and the terroir generating it. Despite considering updates, GIs do not consider the nexus as a dynamic one and do not anticipate change, namely of climate. Being climate a fundamental feature of terroir, it strongly impacts wine characteristics, such as taste. According to IPCC[ii], many widespread, rapid and unprecedented changes of climate occurred, some being irreversible over hundreds to thousands of years. Climatic shifts and atmospheric-driven extreme events have been widely reported worldwide. Recent climatic trends are projected to strengthen in upcoming decades, whereas extremes are expected to increase in frequency and intensity, forcing wines away from GI definitions. Geographical shifts of viticultural suitability are projected, often moving into regions and countries different from current ones. Some authors propose adaptation in viticulture, winemaking and product innovation. We show evidence of climate changing wine characteristics in the Douro valley, home of 270-year-old Port GI. We discuss herein resist or adapt stances for when climate changes the nexus between terroir and wine characteristics. Using the MED-GOLD[iii] dashboard, a tool allowing for easy visual navigation of past and future climates, we demonstrate how policymakers can identify future moments, throughout the 21st century under different emission scenarios, when GI specifications will likely need updates (e.g., boundaries, varieties) to reduce climate-change impacts.

Soil quality in Beaujolais vineyard. Importance of pedology and cultural practices

A pedological study was carried out from 2009 to 2017 in Beaujolais vineyard, to improve physical and chemical knowledge of soils. It was completed in 2016 and 2017 by the current study, dealing with microbial aspects, in order to build a reference frame for improved advice in soil management. Microbial biomass was measured on representative plots of the six most common soil types identified in Beaujolais and, for each soil type, on plots with different levels of the main impacting parameters: total organic carbon, pH, cation exchange capacity, extractable copper. A total of 59 soil samples were collected. Confirming the results of various trials carried out in Beaujolais over the past 20 years, the results of the present study showed that the soils were still alive, but exhibited a large variability of biological parameters, which appeared dependant on both pedological and anthropic factors. Therefore, a good interpretation of biological parameters and advice for vine growers must rely on a pedologically-based referential with differentiated main driving factors. For example, the control of pH is of primary importance in granitic soils and in no way organic matter addition can improve soil quality if pH is too low. Conversely, in calcareous soils, biological parameters are more directly affected by direct or indirect (cover crops for example) inputs of organic matter. The use of biological parameters, such as microbial biomass, is of great potential value to improve advice on agro-viticultural practices (soil management, fertilization, liming, etc.), basis of a sustainable wine production on fragile soils.

Impact on leaf morphology of Vitis vinifera L. cvs Riesling and Cabernet Sauvignon under Free Air Carbon dioxide Enrichment (FACE)

Atmospheric carbon dioxide (CO2) concentration has continuously increased since pre-industrial times from 280 ppm in 1750, and is predicted to exceed 700 ppm by the end of 21st century. For most of C3 plant species elevated CO2 (eCO2) improve photosynthetic apparatus results in an increased plant biomass production. To investigate the effects of eCO2 on morphological leaf characteristics the two Vitis vinifera L. cultivars, Riesling and Cabernet Sauvignon, grown in the Geisenheim VineyardFACE (Free Air Carbon dioxide Enrichment) system were used. The FACE site is located at Geisenheim University (49° 59′ N, 7° 57′ E, 94 m above sea level), Germany and was implemented in 2014 comparing future atmospheric CO2-concentrations (eCO2, predicted for the mid-21st century) with current ambient CO2-conditions (aCO2). Experiments were conducted under rain-fed conditions for two consecutive years (2015 and 2016). Six leaves per repetition of the CO2 treatment were sampled in the field and immediately fixed in a FAA solution (ethanol, H2O, formaldehyde and glacial acetic acid). After 24 h leaf samples were transferred and stored in an ethanol solution. Subsequently, leaf tissue was dehydrated using ethanol series and embedded in paraffin. By using a rotary microtomesections of 5 µm were prepared and fixed on microscopic slides. Subsequent the samples were stained using consecutive staining and washing solutions. Afterwards pictures of the leaf cross-sections were taken using a light microscope and consecutive measurements were conducted with an open source image software. Differences found in leaf cross-sections of the two CO2 treatments were detected for the palisade parenchyma. Leaf thickness, upper and lower epidermis and spongy parenchyma remained less affected under eCO2 conditions. The observed results within grapevine leaf tissues can provide first insights to seasonal adaptation strategies of grapevines under future elevated CO2 concentrations.