Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Climate projections over France wine-growing region and its potential impact on phenology

Climate change represents a major challenge for the French wine industry. Climatic conditions in French vineyards have already changed and will continue to evolve. One of the notable effects on grapevine is the advancing growing season. The aim of this study is to characterise the evolution of agroclimatic indicators (Huglin index, number of hot days, mean temperature, cumulative rainfall and number of rainy days during the growing season) at French wine-growing regions scale between 1980 and 2019 using gridded data (8 km resolution, SAFRAN) and for the middle of the 21th century (2046-2065) with 21 GCMs statistically debiased and downscaled at 8 km. A set of three phenological models were used to simulate the budburst (BRIN, Smoothed-Utah), flowering, veraison and theoretical maturity (GFV and GSR) stages for two grape varieties (Chardonnay and Cabernet-Sauvignon) over the whole period studied. All the French wine-growing regions show an increase in both temperatures during the growing season and Huglin index. This increase is accompanied by an advance in the simulated flowering (+3 to +9 days), veraison (+6 to +13 days) and theoretical maturity (+6 to +16 days) stages, which are more noticeable in the north-eastern part of France. The climate projections unanimously show, for all the GCMs considered, a clear increase in the Huglin index (+662 to 771 °C.days compared to the 1980-1999 period) and in the number of hot days (+5.6 to 22.6 days) in all the wine regions studied. Regarding rainfall, the expected evolution remains very uncertain due to the heterogeneity of the climates simulated by the 21 models. Only 4 regions out of 21 have a significant decrease in the number of rainy days during the growing season. The two budburst models show a strong divergence in the evolution of this stage with an average difference of 18 days between the two models on all grapevine regions. The theoretical maturity is the most impacted stage with a potential advance between 40 and 23 days according to wine-growing regions.

Adaptability of grapevines to climate change: characterization of phenology and sugar accumulation of 50 varieties, under hot climate conditions

Climate is the major factor influencing the dynamics of the vegetative cycle and can determine the timing of phenological periods. Knowledge of the phenology of varieties, their chronological duration, and thermal requirements, allows not only for the better management of interventions in the vineyard, but also to predict the varieties’ behaviour in a scenario of climate change, giving the wine producer the possibility of selecting the grape varieties that are best adapted to the climatic conditions of a certain terroir. In 2014, Symington Family Estates, Vinhos, established two grape variety libraries in two different places with distinctive climate conditions (Douro Superior, and Cima Corgo), with the commitment of contributing to a deeper agronomic and oenological understanding of some grape varieties, in hot climate conditions. In these research vineyards are represented local varieties that are important in the regional and national viticulture, but also others that have over time been forgotten — as well as five international reference cultivars. From 2017 to 2021, phenological observations have been made three times a week, following a defined protocol, to determine the average dates of budbreak, flowering and veraison. With the climate data of each location, the thermal requirements of each variety and the chronological duration of each phase have been calculated. During maturation, berry samples have been gathered weekly to study the dynamics of sugar accumulation, between other parameters. The data was analysed applying phenological and sugar accumulation models available in literature. The results obtained show significant differences between the varieties over several parameters, from the chronological duration and thermal requirements to complete the various stages of development, to the differences between the two locations, confirming the influence of the climate on phenology and the stages of maturation, in these specific conditions.

Grapevine yield-gap: identification of environmental limitations by soil and climate zoning in Languedoc-Roussillon region (south of France)

Grapevine yield has been historically overlooked, assuming a strong trade-off between grape yield and wine quality. At present, menaced by climate change, many vineyards in Southern France are far from the quality label threshold, becoming grapevine yield-gaps a major subject of concern. Although yield-gaps are well studied in arable crops, we know very little about grapevine yield-gaps. In the present study, we analysed the environmental component of grapevine yield-gaps linked to climate and soil resources in the Languedoc Roussillon. We used SAFRAN data and IGP Pays d’Oc wine yields from 2010 to 2018. We selected climate and soil indicators proving to have a significant effect on average wine yield-gaps at the municipality scale. The most significant factors of grapevine yield were the Soil Available Water Capacity; followed by the Huglin Index and the Climatic Dryness Index. The Days of Frost; the Soil pH; and the Very Hot Days were also significant. Then, we clustered geographical zones presenting similar indicators, facilitating the identification of resources yield-gaps. We discussed the number of zones with the experts of IGP Pays d’Oc label, obtaining 7 zones with similar limitations for grapevine yield. Finally, we analysed the main resources causing yield-gaps and the grapevine varieties planted on each zone. Mapping grapevine resource yield-gaps are the first stage for understanding grapevine yield-gaps at the regional scale.

From a local to an international scale: sensory benchmarking of PDO wines. Quincy and Reuilly PDO wines (Sauvignon blanc) as a case study (France)

In a collective marketing strategy, the Protected Designation of Origin (PDO) can be used as a quality indicator. To highlight terroir specificities, it is useful to know how the wines are positioned on the local, national or international market from a sensory point of view. This is especially true for a comparison of varietal wines (e.g. Sauvignon blanc). We focus on the case of two closed Loire Valley PDO (France): Quincy and Reuilly. Three distinct tastings were organized. Firstly, at the local level comparing the 2 PDO (11 and 9 wines, 17 professional assessors); secondly at a regional level adding 3 closed PDO: Menetou-Salon, Sancerre and Pouilly-Fumé (3 wines per PDO, 16 assessors) and thirdly at an international level comparing these 5 PDO with Sauvignon Blanc wines coming from South Africa, New Zealand and Chile (1 to 3 wines per PDO, 19 assessors). All the wines were from the 2019 vintage and were considered to have a traditional elaboration process without contact with oak. A sensory descriptive analysis was performed using an aroma wheel allowing to combine a Check-All-That-Apply methodology, often used in sensory benchmarking, with a hierarchical structuration of the attributes. The aim is to facilitate data acquisition in a professional context without common training, to consider the hierarchical relationships among the attributes during the data analysis and to be able to characterize wines with a large range of sensorial variability. We use univariate, multivariate and clustering analyses. Similarities and differences between Quincy and Reuilly PDO wines and other Sauvignon blanc wines were identified. Specific attributes can distinguish the two PDO and different proximities exist with other local PDO, while clear differences were observed compared to international wines. Our study contributes to propose and discuss a method to do a wine sensory benchmarking highlighting sensory specificities linked to origin.

Grape must quality and mesoclimatic variability in Fruška Gora wine-growing region, Serbia

The Fruška Gora mountain is a traditional wine-growing region in Serbia situated in the Pannonian Basin. Due to such a position, the vicinity of the Danube River and the presence of concave configuration, it is suitable for grape production. This paper provides analyses of spatial variations in meteorological parameters and grape juice quality within Fruška Gora wine region over three consecutive vintages (2018-2020). The examined period can be defined as warm with cool nights during September (AVG 18,9°C; GDD 1918°C; CI 12°CF) and with the presence of mesoclimatic variability. The East part of the study area was somewhat drier and hotter compared to other parts of the region. The analyses of grape must samples (190 in total) of five cultivars (Cabernet-Sauvignon, Merlot, Chardonnay, Sauvignon blanc and Grašac (Welschriesling)) commonly grown across the region (19 sites), were performed using Fourier Transform Infrared Technology (FTIR). Among all cultivars, Sauvignon blanc was harvested first in the East area (DOY=246±5, GDD at harvest=1552±74, 22.2±0.7 °Brix), while the latest harvest was recorded for Cabernet-Sauvignon in the West (DOY=283±5, GDD at harvest=1936±187, 23.4±1.0 °Brix ). Both the red and white cultivars had higher acidity and YAN in the grape must if the vines were grown in the North and East compared to South and West areas. According to PCA analysis, Grašac showed the lowest variation in grape must chemical composition. Thus, the results confirm that Grašac is the most stable cultivar in Fruška Gora. All monitored cultivars reached technological fruit ripeness by the end of the growing season. However, it was difficult to reach full ripeness of red cultivars, mostly beacuse of uncoupling of technolocical and phenolic ripeness. Thus, Cabernet-Sauvignon had higher variations in GDD sums at harvest compared to other cultivars, which probably increased variations in grape must quality.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

The potential of multispectral/hyperspectral technologies for early detection of “flavescence dorée” in a Portuguese vineyard

“Flavescence dorée” (FD) is a grapevine quarantine disease associated with phytoplasmas and transmitted to healthy plants by insect vectors, mainly Scaphoideus titanus. Infected plants usually develop symptoms of stunted growth, unripe cane wood, leaf rolling, leaf yellowing or reddening, and shrivelled berries. Since plants can remain symptomless up to four years, they may act as reservoirs of FD contributing to the spread of the disease. So far, conventional management strategies rely mainly on the insecticide treatments, uprooting of infected plants and use of phytoplasma-free propagation material. However, these strategies are costly and could have undesirable environmental impacts. Thus, the development of sustainable and noninvasive approaches for early detection of FD and its management are of great importance to reduce disease spread and select the best cultural practices and treatments. The present study aimed to evaluate if multispectral/hyperspectral technologies can be used to detect FD before the appearance of the first symptoms and if infected grapevines display a spectral imaging fingerprint. To that end, physiological parameters (leaf area, chlorophyll content and photosynthetic rate) were collected in concomitance to the measurements of plant reflectance (using both a portable apparatus and a remote sensing drone). Measurements were performed in two leaves of 8 healthy and 8 FD-infected grapevines, at four timepoints: before the development of disease symptoms (21st June); and after symptoms appearance (ii) at veraison (2nd August); at post-veraison (11th September); and at harvest (25th September). At all timepoints, FD infected plants revealed a significant decrease in the studied physiological parameters, with a positive correlation with drone imaging data and portable apparatus analyses. Moreover, spectra of either drone imaging and portable apparatus showed clear differences between healthy and FD-infected grapevines, validating multispectral/ hyperspectral technology as a potential tool for the early detection of FD or other grapevine-associated diseases.

Different soil types and relief influence the quality of Merlot grapes in a relatively small area in the Vipava Valley (Slovenia) in relation to the vine water status

Besides location and microclimatic conditions, soil plays an important role in the quality of grapes and wine. Soil properties influence…

The impact of leaf canopy management on eco-physiology, wood chemical properties and microbial communities in root, trunk and cordon of Riesling grapevines (Vitis vinifera L.)

In the last decades, climate change required already adaptation of vineyard management. Increase in temperature and unexpected weather events cause changes in all phenological stages requiring new management tools. For example, defoliation can be a useful tool to reduce the sugar content in the berries creating differences in the wine profiles. In a ten-year field experiment using Riesling (Vitis vinifera L, planted 1986, Geisenheim, Germany), various mechanical defoliation strategies and different intensities were trialed until 2016 before the vineyard was uprooted. Wood was sampled from the plant compartments root, trunk, cordon and shoot for analyses of physicochemical properties (e.g. lignin and element content, pH, diameter), nonstructural carbohydrates and the microbial communities. The aim of the study was to investigate the influence of reduced canopy leaf area on the sink-source allocation into different compartments and potential changes of the fungal and prokaryotic wood-inhabiting community using a metabarcoding approach. Severe summer pruning (SSP) of the canopy and mechanical defoliation (MDC) above the bunch zone decreased the leaf area by 50% compared to control (C). SSP reduced the photosynthetic capacity, which resulted in an altered source-sink allocation and carbohydrate storage. With lower leaf area, less carbohydrates are allocated. This for example resulted in a decreased trunk diameter. Further, it affected the composition of the grapevine wood microbiota. SSP and MDC management changed significantly the prokaryotic community composition in wood of the root samples, but had no effect in other compartments. In general, this study found strong compartment and less management effects of the microbial community composition and associated physicochemical properties. The highest microbial diversities were identified in the wood of the trunk, and several species were recorded the first time in grapevine.

Terroir traceability in grapes, musts and wine: results of research on Gewürztraminer and Sauvignon Blanc grape varieties in northern Italy

In the study of terroir, a separate analysis of its many component factors can be of great help in accurately identifying a vineyard’s natural elements that impact wine quality and typicity. This research used a dedicated pluri-disciplinary approach to investigate the ecological characteristics, including geology and geographical features, of 14 vineyards that produce Gewürztraminer and Sauvignon Blanc cultivars in the alpine Alto Adige DOC wine region. Both the geopedological method using Vineyards Geological Identity (VGI) and the new Solar Radiaton Identity (SRI) topoclimatic classification method were used to provide analytical measurements and qualitative/quantitative characterisations. In addition, wide-ranging targeted and untargeted oenological and chemical analyses were carried out on grapes, musts and wines to correlate the soils’ geomineral and physical conditions with the biochemical properties of their fruits and wines. The research identified strong correlations between vineyard geo-identity and wine biofingerprint, confirming a mineral traceability of strontium rubidium ratio and some minerals distinctive to the local geology, such as K, Ca, Ag, Ba and Mn.  The study also discovered that particular geomineral and physical soil conditions of the studied vineyards are related to the different amount of amino acids, primary varietal aromas and polyphenols found in grapes, musts and wines. The research confirmed that winemaking technologies support oenological quality, although in some cases, human practices can overpower certain characteristic elements in wine, erasing the typical imprint left by the vineyards’ natural terroir, which becomes less traceable. Terroir abiotic ecological factors and vineyard identity can be classified in detail using the new VGI and SRI analysis methods to discover interrelationships between geo-pedological and topoclimatic conditions that impact wine quality. These methods are also helpful in identifying which ecological elements are exclusive to a particular vineyard or wine sub-region.

Metabolomic discrimination of grapevine water status for Chardonnay and Pinot noir

Water status impact in viticulture has been widely explored, as it strongly affects grapevine physiology and grape chemical composition. It is considered as a key component of vitivinicultural terroir. Most of the studies concerning grapevine water status have focused on either physiological traits, or berry compounds, or traits involved in wine quality. Here, the response of grapevine to water availability during the ripening period is assessed through non-targeted metabolomics analysis of grape berries by ultra-high resolution mass spectrometry. The grapevine water status has been assessed during 2 consecutive years (2019 & 2020), through carbon isotope discrimination on juices from berries collected at maturity (21.5 brix approx.) for 2 Vitis vinifera cv. Pinot noir (PN) and Chardonnay (CH). A total of 220 grape juices were collected from 5 countries worldwide (Italy; Argentina; France; Germany; Portugal). Measured δ13C (‰) varied from -28.73 to -22.6 for PN, and from -28.79 to -21.67 for CH. These results also clearly revealed higher water stress for the 2020 vintage. The same grape juices have been analysed by Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS) and Liquid Chromatography coupled to Mass Spectrometry (LC-qTOF-MS), leading to the detection of up to 4500 CHONS containing elemental compositions, and thus likely tens of thousands of individual compounds, which include fatty acids, organic acids, peptides, phenolics, also with high levels of glycosylation. Multivariate statistical analysis revealed that up to 160 elemental compositions, covering the whole range of detected masses (100 –1000 m/z), were significantly correlated to the observed gradients of water status. Examples of chemical markers, which are representative of these complex fingerprints, include various derivatives of the known abscisic acid (ABA), such as phaesic acid or abscisic acid glucose ester, which are significantly correlated with higher water stress, regardless of the variety. Cultivar-specific behaviours could also be identified from these fingerprints. Our results provide an unprecedented representation of the metabolic diversity, which is involved in the water status regulation at the grape level, and which could contribute to a better knowledge of the grapevine mitigation strategy in a climate change context.