Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).

Protected Designation of Origin (D.P.O.) Valdepeñas: classification and map of soils

The objective of the work described here is the elaboration of a map of the different types of vineyard soils that to guide the famers in the choice of the most productive vine rootstocks and varieties. 90 vineyard soils profiles were analysed in the entire territory of the Origen Denominations of Valdepeñas. The sampling was carried out in 2018 (June to October) by making a sampling grid, followed by photointerpretation and control in the field. The studied soils can be grouped into 9 different soil types (according to FAO 2006 classification): Leptosols, Regosols, Fluvisols, Gleysols, Cambisols, Calcisols, Luvisols and Anthrosols. A map showing the soil distribution with different type of soils has been made with the ArcGIS program. Regarding to the choice of rootstock, Calcisoles are soils with a high active limestone content, so the rootstocks used in these soils must be resistant to this parameter; Luvisols are deep soils with high clay content, so they will support vigorous rootstocks. Because the cartographic units are composed of two or more subgroups, with are associated in variable proportions, 9 different soil associations have been established; Unit 1: Leptosols, Cambisols and Luvisols (80%, 15% and 5% respectively); Unit 2: Cambisols with Regosols and Luvisols (40%, 30% and 30% respectively); Unit 3: Cambisols and Gleysols with Regosols (40%, 40% and 20% respectively); Unit 4: Regosols with Cambisols, Leptosols and Calcisols (40%, 30%, 15% and 15% respectively); Unit 5: Cambisols, Leptosols, Calcisols and Regosols (25% each of them); Unit 6: Luvisols with Cambisol and Calcisols (80%, 10% and 10% respectively); Unit 7: Luvisols and Calcisols with Cambisols (40%, 40% and 20% respectively); Unit 8: Calcisols with, Cambisols and Luvisols (80%, 10% and 10% respectively); Unit 9: Anthrosols. These study allow to elaborate the first map of vineyard soils of this Protected Designation of Origin in Castilla-La Mancha.

Optimizing stomatal traits for future climates

Stomatal traits determine grapevine water use, carbon supply, and water stress, which directly impact yield and berry chemistry. Breeding for stomatal traits has the strong potential to improve grapevine performance under future, drier conditions, but the trait values that breeders should target are unknown. We used a functional-structural plant model developed for grapevine (HydroShoot) to determine how stomatal traits impact canopy gas exchange, water potential, and temperature under historical and future conditions in high-quality and hot-climate California wine regions (Napa and the Central Valley). Historical climate (1990-2010) was collected from weather stations and future climate (2079-99) was projected from 4 representative climate models for California, assuming medium- and high-emissions (RCP 4.5 and 8.5). Five trait parameterizations, representing mean and extreme values for the maximum stomatal conductance (gmax) and leaf water potential threshold for stomatal closure (Ψsc), were defined from meta-analyses. Compared to mean trait values, the water-spending extremes (highest gmax or most negative Ysc) had negligible benefits for carbon gain and canopy cooling, but exacerbated vine water use and stress, for both sites and climate scenarios. These traits increased cumulative transpiration by 8 – 17%, changed cumulative carbon gain by -4 – 3%, and reduced minimum water potentials by 10 – 18%. Conversely, the water-saving extremes (lowest gmax or least negative Ψsc) strongly reduced water use and stress, but potentially compromised the carbon supply for ripening. Under RCP 8.5 conditions, these traits reduced transpiration by 22 – 35% and carbon gain by 9 – 16% and increased minimum water potentials by 20 – 28%, compared to mean values. Overall, selecting for more water-saving stomatal traits could improve water-use efficiency and avoid the detrimental effects of highly negative canopy water potentials on yield and quality, but more work is needed to evaluate whether these benefits outweigh the consequences of minor declines in carbon gain for fruit production.

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

Grapevine varietal diversity as mitigation tool for climate change: Agronomic and oenologic potential of 14 foreign varieties grown in Languedoc region (France)

Climate change effects in Languedoc include an expected rise in temperatures, increased evapotranspiration as well as more severe and frequent climatic hazards, such as frost, drought periods and heat waves. For winegrowers theses phenomena impact both yield and quality, resulting in more frequent unbalanced wines. Research on identified mitigation tools for vineyard management is necessary to improve resilience of grapevine agrosystems. Varietal assortment is one of them. This study focuses on agronomic and oenologic potential of 14 foreign varieties grown in Languedoc French region. Fourteen grapevine varieties were monitored during 2021 from June until harvest on eight different sites, some of which occurring on more than one site adding up to 21 different modalities: 7 white varieties Alvarinho B, Assyrtiko B (2), Malvasia Istriana B, Parellada B, Verdejo B, Verdelho B, Xarello B, and 7 black varieties Saperavi N (2), Touriga nacional N, Baga N, Aleatico N, Montepulciano N (2), Primitivo N (3), Calabrese N (3). Varietals were compared through the following parameters: phenology was assessed by using the information collected in the Database Network of French Vine Conservatories (INRAE-SupAgro-IFV, 2005-2015). The number of inflorescences for shoots from secondary buds and bourillons and suckers were observed to assess post-bud break frost tolerance potential. Grapevine water status was studied through stem water potential measurement, observation of foliage symptoms of drought, and 𝛿13C on must. Frequencies and intensities of downy mildew, powdery mildew, and black rot attacks were estimated before harvest on leaves and clusters and botrytis at harvest to assess disease susceptibilities. Berry composition was monitored from end of veraison until harvest. Yield and mean bunch weight were also calculated. Varieties were then ranked on a 1-4 scale for each parameter and compared through PCA. Forty two stations of the Mediterranean basin were compared by PCA with the Multicriteria Climatic Classification indicators in order to confront the collected information during 2021 campaign to the hypothesis that plants coming from dry and hot regions are genetically adapted to such climatic conditions.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Water deficit differentially impacts the performances and the accumulation of grape metabolites of new varieties tolerant to fungi

The use of resistant varieties is a long-term but promising solution to reduce chemical input in viticulture. Several important breeding programs in Europe and abroad are now releasing a range of new hybrids performing well regarding fungi susceptibility and producing good quality wines. Unfortunately, insufficient attention is paid by the breeders to the adaptation of these varieties to climatic changes, notably to the increased climatic demand and water deficit (WD). Thus, prior to the adoption of such varieties by the wine industry in Mediterranean regions, there is a need to consider their suitability to WD. This study aimed to characterize the different drought-strategies adopted by 6 new resistant varieties selected by INRAE in comparison to Syrah. To allow the assessment of long-term impacts of WD, field-grown vines were exposed to contrasted WD from 2018 to 2021 under a semi-arid Mediterranean climate. A gradient of WD was applied in the field and controlled through plant measurements at the single plant level. Grape development was non-destructively monitored to determine the arrest of berry phloem unloading. The impacts of WD on berry composition, including water, primary metabolites (sugars, organic acids), secondary metabolites (anthocyanins, thiols precursors) and main cations contents, were assessed at this specific stage. Results showed different varietal responses during the year and inter-annual acclimation in terms of plant water use efficiency, biomass accumulation, as well as yield components and berry composition. WD differentially reduced the accumulation of primary metabolites at plant and berry levels, but it little changed their concentrations in the fruits at the ripe stage. Moreover, WD differentially impacted the accumulation of secondary metabolites and major cations between the varieties. In the talk, we’ll present the main results regarding the WD impacts on fruit metabolites and enlarge the reflection about the practical assessment of the grapevine acclimation to WD.

Assessing the climate change vulnerability of European winegrowing regions by combining exposure, sensitivity and adaptive capacity indicators

Winegrowing regions recognized as protected designations of origin (PDOs) are closely tied to well defined geographic locations with a specific set of pedoclimatic attributes and strictly regulated by legal specifications. However, climate change is increasingly threatening these regions by changing local conditions and altering winegrowing processes. The vulnerability to these changes is largely heterogenous across different winegrowing regions because it is determined by individual characteristics of each region, including the capacity to adapt to new climatic conditions and the sensitivity to climate change, which depend not only on natural, but also socioeconomic and legal factors. Accurate vulnerability assessments therefore need to combine information about adaptive capacity and climate change sensitivity with projected exposure to new climatic conditions. However, most existing studies focus on specific impacts neglecting important interactions between the different factors that determine climate change vulnerability. Here, we present the first comprehensive vulnerability assessment of European wine PDOs that spatially combines multiple indicators of adaptive capacity and climate change sensitivity with high-resolution climate projections. We found that the climate change vulnerability of PDO areas largely depends on the complex interactions between physical and socioeconomic factors. Homogenous topographic conditions and a narrow varietal spectrum increase climate change vulnerability, while the skills and education of farmers, together with a good economic situation, decrease their vulnerability. Assessments of climate change consequences therefore need to consider multiple variables as well as their interrelations to provide a comprehensive understanding of the expected impacts of climate change on European PDOs. Our results provide the first vulnerability assessment for European winegrowing regions at high spatiotemporal resolution that includes multiple factors related to climate exposure, sensitivity, and adaptive capacity on the level of single winegrowing regions. They will therefore help to identify hot spots of climate change vulnerability among European PDOs and efficiently direct adaptation strategies.

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.

Downscaling of remote sensing time series: thermal zone classification approach in Gironde region

In viticulture, the challenges of local climate modelling are multiple: taking into account the local environment, fine temporal and spatial scales, reliable time series of climate data, ease of implementation and reproducibility of the method. At the local scale, recent studies have demonstrated the contribution of spatialization methods for ground-based climate observation data considering topographic factors such as altitude, slope, aspect, and geographic coordinates (Le Roux et al, 2017; De Rességuier et al, 2020). However, these studies have shown questions in terms of the reproducibility and sustainability of this type of climate study. In this context, we evaluated the potential of MODIS thermal satellite images validated with ground-based climate data (Morin et al, 2020). Previous studies have been encouraging, but questions remain to be explored at the regional scale, particularly in the dynamics of the massive use of bioclimatic indices to classify the climate of wine regions. The results at the local scale were encouraging, but this approach was tested in the current study at the regional scale. Several objectives were set: 1) to evaluate the downscaling method for land surface temperature time series, 2) to identify regional thermal structure variations. We used weekly minimum and maximum surface temperature time series acquired by MODIS satellites at a spatial resolution of 1000 m and downscaled at 500 m using topographical variables. Two types of analyses were performed: