Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Underpinning terroir with data: rethinking the zoning paradigm

Agriculture, natural resource management and the production and sale of products such as wine are increasingly data-driven activities. Thus, the use of remote and proximal crop and soil sensors to aid management decisions is becoming commonplace and ‘Agtech’ is proliferating commercially; mapping, underpinned by geographical information systems and complex methods of spatial analysis, is widely used. Likewise, the chemical and sensory analysis of wines draws on multivariate statistics; the efficient winery intake of grapes, subsequent production of wines and their delivery to markets relies on logistics; whilst the sales and marketing of wines is increasingly driven by artificial intelligence linked to the recorded purchasing behaviour of consumers. In brief, there is data everywhere!

Opinions will vary on whether these developments are a good thing. Those concerned with the ‘mystique’ of wine, or the historical aspects of terroir and its preservation, may find them confronting. In contrast, they offer an opportunity to those interested in the biophysical elements of terroir, and efforts aimed at better understanding how these impact on vineyard performance and the sensory attributes of resultant wines. At the previous Terroir Congress, we demonstrated the potential of analytical methods used at the within-vineyard scale in the development of Precision Viticulture, in contributing to a quantitative understanding of regional terroir. For this conference, we take this approach forward with examples from contrasting locations in both the northern and southern hemispheres. We show how, by focussing on the vineyards within winegrowing regions, as opposed to all of the land within those regions, we might move towards a more robust terroir zoning than one derived from a mixture of history, thematic mapping, heuristics and the whims of marketers. Aside from providing improved understanding by underpinning terroir with data, such methods should also promote improved management of the entire wine value chain.

The modification of cultural practices in grapevine cv. Syrah, does it modify the characteristics of the musts?

The work shows the results of a year of experimentation (2020) in a Syrah variety vineyard in La Roda (Castilla-La Mancha, Spain). The trial approach was on a randomized block design with two factors: Irrigation (I) and Pruning (P).
Irrigation schedules were adjusted to apply amounts close to 1,500 m3/ha. With this provision, 2 different irrigation treatments were proposed: I1) Start of irrigation from pea-sized grape to post-harvest (providing at least 20 % of the total amount of irrigation water to be provided post-harvest); I2) Start of irrigation from pea-sized grape to harvest (usual irrigation practice in the study area). Pruning was proposed with two treatments, one at the end of January (P1), which is pruning on a conventional date; and P2) pruning carried out at the beginning of budding. In total, 4 repetitions were designed with 4 elementary plots, each one of them representing one of the proposed treatments (I1P1; I1P2; I2P1; I2P2). In total, 16 plots were worked on and each elementary plot consisted of 30 strains, distributed in 3 lines.
The productive response was evaluated with the yield results of the harvest harvested at 23 ºBrix. The qualitative response was measured in the musts through the indices of technological (acidity, pH and potassium) and phenolic maturity and aromatic compounds in free and glycosylated fractions. The treatments tested had, in general, an effect on the different variables analyzed.

Second pruning as a strategy to delay maturation in cv. ‘Touriga nacional’ in the Portuguese Douro region

The advance in maturation of wine grapes is an important climate change risk related effect that could affect warm regions like Portuguese Douro Wine Region. Indeed, the climate analysis over the past years registered a decrease in the precipitation, significant higher average temperatures, and a more frequent occurrence of extreme weather events, including heat waves. In these conditions the length from anthesis until maturation is shortened and the uncoupling of technical and phenolic maturity results in berries with higher sugar concentration (and lower acidity), but lower anthocyanins, tannins, and total phenolic concentration, which produce unbalanced wines.
In this work, an innovative strategy of crop forcing, based on forcing vine regrowth after a second pruning of green shoots, was tested, aimed at delaying ripening until the temperature becomes lower and, therefore, preventing acidity loss and increasing anthocyanin-to-sugar ratio. The experiments were conducted in 2019 and 2020 in a commercial vineyard of ‘Touriga Nacional’ located in the Douro Region. Crop forcing was conducted 15 (CF1) to 30 (CF2) days after fruit set. Vines pruned with conventional methods were used as control (CF0). Results confirmed that fruit ripening was shifted from the hot season (August/September), until a cooler period (October through early-November). At harvest, grapevine berries from CF1 and CF2 presented lower pH and higher acidity, than control, with no significant differences in colour intensity and phenolic levels composition. Sugar content was lower in CF2-treated vines in both seasons. However, in CF-treated vines the number and size of clusters were significantly lower (up to 88% reduction) than in control plants. A metabolomics analysis of mature berries from CF-treated vines and control is underway. Crop forcing was indeed effective in producing a more balance berry composition but severely reduced grapevine yield,

Green berries on Gewürztraminer (Vitis vinifera L.) in South Tyrol (Italy)

The grape variety Gewürztraminer is known to be affected by two physiological disorders namely berry shrivel and bunch stem necrosis. During the season 2014 we noticed a new symptomatology type of ripening disorder on the variety. The new symptom showed not all berries fallowing the normal maturation stages, but single berries remaining at a soft but green stage till harvest. The broad distribution of these so called “green berries” symptoms in different production sites of our region, caused huge damage due to the difficulty of eliminating single berries per bunch before harvesting. Therefore, the Research Centre Laimburg began to investigate the reasons and origins of this new symptom. This work shows the results of first attempts to find causes for the symptom as well as the resulting approach to mitigate symptoms. Applications of magnesium leaf fertilizer showed first promising results against this putative disorder. To study the causal effect of the green berries 30 symptomatic vineyards in 2014 have been selected for a monitoring during the season 2016. To evaluate the foliar nutrient treatment two vineyards have been selected for application of magnesium sulfate and magnesium chloride. Leaf and berry nutrient analysis, as well as the main quality parameters during ripening have been performed. As soon as “green berries” symptoms appeared, incidence and severity have been evaluated. Most of the symptomatic vineyards of the 2016 monitoring showed light to clear magnesium deficit symptoms on their foliage. Only during the seasons 2020 and 2021 “green berries” symptoms could be found in the leaf fertilizer treatment vineyards. Both seasons showed a significant effect of the magnesium treatments to reduce the incidence and severity of the symptom. It seems that the appearance of the “green berries” symptom on Gewürztraminer is correlated to a disturbed uptake of magnesium of the vines.

Impact of climate change on the viticultural climate of the Protected Designation of Origin “Jumilla” (SE Spain)

Protected Designation of Origin “Jumilla” (PDO Jumilla) is located in the Spanish provinces of Albacete and Murcia, in the South-eastern part of the Iberian Peninsula, where most of the models predict a severe impact of climate change in next decades. PDO Jumilla covers an area of 247,054 hectares, of which more than 22,000 hectares

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

δ13C : A still underused indicator in precision viticulture  

The first demonstration of the interest of carbon isotope composition of sugars in grapevine, as an integrated indicator of vineyard water status, dates back to 2000 (Gaudillère et al., 1999; Van Leeuwen et al., 2001). Thanks to the isotopic discrimination of Carbon that takes place during plant photosynthesis, under hydric stress conditions, it is possible to accurately estimate the photosynthetic activity. Ever since, δ13C has been widely applied with success to zonation, terroir studies and vine physiology research, but is still not widely used by viticulturists. This is quite astonishing by considering the impact of global warming on viticulture and the need to improve water management, that would justify a widespread use of δ13C.
The lack of private laboratories proposing the analysis, the cost of the technology, as well as the long analytical delays, have been detrimental to its development. Some laboratories tried to overcome the analytical difficulties of isotopic analysis by using fourier transformed infrared spectroscopy, as a fast and cheap alternative to the official OIV method (IRMS). These claimed FTIR models have never been published or peer reviewed and cannot be considered robust. In this work, thanks to the recent acquisition of IRMS technology, new modern and robust applications of δ13C for viticulture are proposed. This includes the use of the analysis to make parcel separations at harvesting, the possibility to increase the precision of hydric stress cartography and the potential cost reduction when compared with Scholander pressure bomb analysis.

Grapevine yield estimation in a context of climate change: the GraY model

Grapevine yield is a key indicator to assess the impacts of climate change and the relevance of adaptation strategies in a vineyard landscape. At this scale, a yield model should use a number of parameters and input data in relation to the information available and be able to reproduce vineyard management decisions (e.g. soil and canopy management, irrigation). In this study, we used data from six experimental sites in Southern France (cv. Syrah) to calibrate a model of grapevine yield limited by water constraint (GraY). Each yield component (bud fertility, number of berries per bunch, berry weight) was calculated as a function of the soil water availability simulated by the WaLIS water balance model at critical phenological phases. The model was then evaluated in 10 grapegrowers’ plots, covering a diversity of biophysical and technical contexts (soil type, canopy size, irrigation, cover crop). We identified three critical periods for yield formation: after flowering on the previous year for the number of bunches and berries, around pre-veraison and post-veraison of the same year for mean berry weight. Yields were simulated with a model efficiency (EF) of 0.62 (NRMSE = 0.28). Bud fertility and number of berries per bunch were more accurately simulated (EF = 0.90 and 0.77, NRMSE = 0.06 and 0.10, respectively) than berry weight (EF = -0.31, NRMSE = 0.17). Model efficiency on the on-farm plots reached 0.71 (NRMSE = 0.37) simulating yields from 1 to 8 kg/plant. The GraY model is an original model estimating grapevine yield evolution on the basis of water availability under future climatic conditions.  It allows to evaluate the effects of various adaptation levers such as planting density, cover crop management, fruit/leaf ratio, shading and irrigation, in various production contexts.

A multidisciplinary approach to evaluate the effects of the training system on the performance of “Aglianico del Vulture” vineyards

Vineyards are complex agro-ecosystems with high spatial and temporal variability. An efficient training system may counteract the adverse effects of this variability. Moreover, considering the climate change issues, choosing an efficient training system that enhances water use and protects the vines from radiative thermal stress has become a priority for the farmers. A multidisciplinary approach that assesses the soil-crop-yield-wine relationships of vineyards in a distributed and holistic way could bring added knowledge on the behavior of the different training systems. This ongoing research aimed to implement a multidisciplinary approach to study the behavior of “Aglianico del Vulture” grapevines trained with two different systems: a spurred cordon (SC) and an “Alberello in parete” (AL), grown in a high-quality wine production area of Basilicata region (Italy). The approach merged several methods and scales of soil, ecophysiology, must/wine quality, and spectral data collection to assess the influence of the training system. Homogeneous zones (HZs) in both training systems were defined through a procedure based on geomorphological classification, unmanned aerial vehicles (UAV) images analysis, and a traditional soil survey supported by geophysical scanning. During the 2021 season, TDR probes monitored soil water content, while grapevine health status was assessed using eco-physiological measurements (LWP, chlorophyll content, PSII photosynthetic efficiency, LAI, and point-based field spectroscopy). These grapevine in-vivo measurements validated the spectral vegetation indexes (NDVI, RENDVI, CVI, and TVI) derived from the UAV multispectral imagery, which monitored the grapevine status in a distributed and non-invasive way. Grape yield, quality of berries, must and wine were measured to assess the effects of the training systems. The first experimental year results showed the variability of the vineyards and revealed relationships among soil parameters, crop characteristics, and vegetation indices of the SC and AL training systems. This multidisciplinary study could bring new insights into the vineyard training system’s effects on grape yield and wine quality.

Soil, vine, climate change – what is observed – what is expected

To evaluate the current and future impact of climate change on Viticulture requires an integrated view on a complex interacting system within the soil-plant-atmospheric continuum under continuous change. Aside of the globally observed increase in temperature in basically all viticulture regions for at least four decades, we observe several clear trends at the regional level in the ratio of precipitation to potential evapotranspiration. Additionally the recently published 6th assessment report of the IPCC (The physical science basis) shows case-dependent further expected shifts in climate patterns which will have substantial impacts on the way we will conduct viticulture in the decades to come.
Looking beyond climate developments, we observe rising temperatures in the upper soil layers which will have an impact on the distribution of microbial populations, the decay rate of organic matter or the storage capacity for carbon, thus affecting the emission of greenhouse gases (GHGs) and the viscosity of water in the soil-plant pathway, altering the transport of water. If the upper soil layers dry out faster due to less rainfall and/or increased evapotranspiration driven by higher temperatures, the spectral reflection properties of bare soil change and the transport of latent heat into the fruiting zone is increased putting a higher temperature load on the fruit. Interactions between micro-organisms in the rhizosphere and the grapevine root system are poorly understood but respond to environmental factors (such as increased soil temperatures) and the plant material (rootstock for instance), respectively the cultivation system (for example bio-organic versus conventional). This adds to an extremely complex system to manage in terms of increased resilience, adaptation to and even mitigation of climate change. Nevertheless, taken as a whole, effects on the individual expressions of wines with a given origin, seem highly likely to become more apparent.

Impact of geographical location on the phenolic profile of minority varieties grown in Spain. II: red grapevines

Because terroir and cultivar are drivers of wine quality, is essential to investigate theirs effects on polyphenolic profile before promoting the implantation of a red minority variety in a specific area. This work, included in MINORVIN project, focuses in the polyphenolic profile of 7 red grapevines minority varieties of Vitis vinifera L. (Morate, Sanguina, Santafe, Terriza Tinta Jeromo Tortozona Tinta) and Tempranillo) from six typical viticulture Spanish areas: Aragón (A1), Cataluña (A2), Castilla la Mancha (A3), Castilla –León (A4), Madrid (A5) and Navarra (A6) of 2020 season. Polyphenolic substances were extracted from grapes. 35 compounds were identified and quantified (mg subtance/kg fresh berry) by HPLC and grouped in anthocyanins (ANT) flavanols (FLAVA), flavonols (FLAVO), hydroxycinnamic (AH), benzoic (BA) acids and stilbenes (ST). Antioxidant activity (AA, mmol TE /g fresh berry) was determined by DPPH method. The results were submitted to a two-way ANOVA to investigate the influence of variety, area and their interaction for each polyphenolic family and cluster analysis was used to construct hierarchical dendrograms, searching the natural groupings among the samples. Sanguina (A3) had the most of total polyphenols while Tempranillo (A5) those of ANT. Sanguina (A2) and (A3) reached the highest values of FLAVO, FLAVA and AA. These two last samples had also the maximum of AA. The effect cultivar and area were significant for all polyphenolic families analyzed. A high variability due to variety (>50%) was observed in FLAVA and the maximum value of variability due to growing area was detected in AA (86.41%), ANT and FLAVO (51%); the interaction variety*zone was significant only for ANT, FLAVO, EST and AA. Finally, dendrograms presented five cluster: i) Sanguina (A2); ii) Sanguina (A3); iii) Tempranillo (A5); iv) Tempranillo (A3); Terriza (A3,A5), Morate (A5,A6); v) Santafé (A1,A6); Tortozona tinta (A1,A3,A6); Tinta Jeromo (A3,A4).