Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Under-vine management effects on grapevine production, soil properties and plant communities in South Australia

Under-vine (UV) management has traditionally consisted of synthetic herbicide use to limit competition between weeds and grapevines. With growing global interest towards non-synthetic chemical use, this study aimed to capture the effects of alternative UV management at two commercial Shiraz vineyards in South Australia, where the sole management variables were UV management since 2016. In adjacent treatment blocks, cultivation (CU) was compared to spontaneous vegetation (SV) in McLaren Vale (MV), and herbicide was compared to SV in Eden Valley (EV). Soil water infiltration rates were slower and grapevine stem water potential was lower in CU compared to SV in MV, with the latter having a plant community dominated by soursob (Oxalis pes-caprae) during winter; while in EV, there was little separation between the treatments. Yields were affected at both sites, with SV being higher in MV and HE being higher in EV. In MV, the only effect on grape must was a lower 13C:12C isotope ratio in CU, indicating greater grapevine water stress. In the grape must at EV, SV had higher total soluble solids, total phenolics, anthocyanins, and yeast available nitrogen; and lower pH and titratable acidity. Pruning weights were not affected by the treatments in MV, while they were higher in HE at EV. Assessments revealed that the differing soil types at the two sites were likely the main determinants of the opposing production outcomes associated with UV management. In the silty loam soil of MV, the higher yields in SV were likely due to more plant-available water, as a potential result of the continuous soil bio-pores formed by winter UV vegetation. Conversely, in the loamy sand soils of EV with a lower cation exchange capacity, the lower yields and pruning weights in SV suggest the UV vegetation competed significantly with the grapevines for available water and nutrients.

The use of rootstock as a lever in the face of climate change and dieback of vineyard

As viticulture faces challenges such as climate change or vineyard dieback, the choice of the variety and rootstock becomes more and more crucial. To study rootstock levers in the Bordeaux region, a parcel of Cabernet Sauvignon (CS) was planted with four rootstocks in 2014. Twenty repetitions of each of the following four rootstocks were set up: 101-14 MGt, Nemadex AB, 420A MGt and Gravesac. The number of bunches, yields and pruning weights of the vine shoots were measured individually on 240 vines from 2017 to 2021. Since 2020, nitrogen status assessed by assimilable nitrogen level, hydric status assessed by δ13C and berry maturity were measured on 80 samples taken from 20 repetitions of the four rootstocks. A lower yield was measured for CS grafted onto Nemadex AB due to the lower number of bunches and the lower weight of berries. The differences between the other three rootstocks are small, but CS grafted onto 420A MGt was the most productive. The CS grafted onto Nemadex AB had the lowest pruning weight while 101-14 MGt had the highest. In 2020, δ13C showed a more moderate water stress with 101-14 MGt and 420A MGt than with Nemadex AB. Surprisingly, the Gravesac was under more stress than the 101-14 MGt. The nitrogen status in the berries was better for Nemadex AB but this was perhaps due to the significantly lower weight of the berries.Rootstock 101-14 MGt attained the highest accumulation of sugars in the berries while 420A MGt allows to preserve higher acidity. The parcel is still young which may explain some of the results. These measures must therefore be continued over the next several years to fully assess the effects of these rootstocks on the development of the vines and the quality of the production under new climatic conditions.

Grapevine yield estimation in a context of climate change: the GraY model

Grapevine yield is a key indicator to assess the impacts of climate change and the relevance of adaptation strategies in a vineyard landscape. At this scale, a yield model should use a number of parameters and input data in relation to the information available and be able to reproduce vineyard management decisions (e.g. soil and canopy management, irrigation). In this study, we used data from six experimental sites in Southern France (cv. Syrah) to calibrate a model of grapevine yield limited by water constraint (GraY). Each yield component (bud fertility, number of berries per bunch, berry weight) was calculated as a function of the soil water availability simulated by the WaLIS water balance model at critical phenological phases. The model was then evaluated in 10 grapegrowers’ plots, covering a diversity of biophysical and technical contexts (soil type, canopy size, irrigation, cover crop). We identified three critical periods for yield formation: after flowering on the previous year for the number of bunches and berries, around pre-veraison and post-veraison of the same year for mean berry weight. Yields were simulated with a model efficiency (EF) of 0.62 (NRMSE = 0.28). Bud fertility and number of berries per bunch were more accurately simulated (EF = 0.90 and 0.77, NRMSE = 0.06 and 0.10, respectively) than berry weight (EF = -0.31, NRMSE = 0.17). Model efficiency on the on-farm plots reached 0.71 (NRMSE = 0.37) simulating yields from 1 to 8 kg/plant. The GraY model is an original model estimating grapevine yield evolution on the basis of water availability under future climatic conditions.  It allows to evaluate the effects of various adaptation levers such as planting density, cover crop management, fruit/leaf ratio, shading and irrigation, in various production contexts.

Elevational range shifts of mountain vineyards: Recent dynamics in response to a warming climate

Increasing temperatures worldwide are expected to cause a change in spatial distribution of plant species along elevational gradients and there are already observable shifts to higher elevations as a consequence of climate change for many species. Not only naturally growing plants, but also agricultural cultivations are subject to the effects of climate change, as the type of cultivation and the economic viability depends largely on the prevailing climatic conditions. A shift to higher elevations therefore represents a viable adaptation strategy to climate change, as higher elevations are characterized by lower temperatures. This is especially important in the case of viticulture because a certain wine-style can only be achieved under very specific climatic conditions. Although there are several studies investigating climatic suitability within winegrowing regions or longitudinal shifts of winegrowing areas, little is known about how fast vineyards move to higher elevations, which may represent a viable strategy for winegrowers to maintain growing conditions and thus wine-style, despite the effects of climate change. We therefore investigated the change in the spatial distribution of vineyards along an elevational gradient over the past 20 years in the mountainous wine-growing region of Alto Adige (Italy). A dataset containing information about location and planting year of more than 26000 vineyard parcels and 30 varieties was used to perform this analysis. Preliminary results suggest that there has been a shift to higher elevations for vineyards in general (from formerly 700m to currently 850 m a.s.l., with extreme sites reaching 1200 m a.s.l.), but also that this development has not been uniform across different varieties and products (i.e. vitis vinifera vs hybrid varieties and still vssparkling wines). This is important for climate change adaptation as well as for rural development. Mountain areas, especially at mid to high elevations, are often characterized by severe land abandonment which can be avoided to some degree if economically viable and sustainable land management strategies are available.

Legacy of land-cover changes on soil erosion and microbiology in Burgundian vineyards

Soils in vineyards are recognized as complex agrosystems whose characteristics reflect complex interactions between natural factors (lithology, climate, slope, biodiversity) and human activities. To date, most of the unknown lies in an incomplete understanding of soil ecosystems, and specifically in the microbial biodiversity even though soil microbiota is involved in many key functions, such as nutrient cycling and carbon sequestration. Soil biological properties are indicative of soil quality. Therefore, understanding how soil communities are related to soil ecosystem functioning is becoming an essential issue for soil strategy conservation. Here, we propose to assess the importance of land-cover history on the present-day microbiological and physico-chemical properties. The studied area was selected in the Burgundian vineyards (Pernand-Vergelesses, Burgundy, France) where land occupation has been reconstructed over the last 40 years. Soil samples were collected in five areas reflecting various land cover history (forest, vineyards, shifting from forest to vineyards). For each area, physico-chemical parameters (pH, C, N, P, grain size) were measured and DNA was extracted to characterize the abundance and diversity of microbial communities. The obtained results show significant differences in the five areas suggesting that present-day microbial molecular biomass and bacterial taxonomic is partly inherited from past land occupation. Over longer period of time, such study of land-uses legacies may help to better assess ecosystem recovery and the impact of management practices for a better soil quality and vineyards sustainability.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Combining effect of leaf removal and natural shading on grape ripening under two irrigation strategies in Manto negro (Vitis vinifera L.)

The increasingly frequent heat waves during grape ripening pose challenges for high quality wine grape production. Defoliation is a common practice that can improve the control of diseases in bunches, but also it increases the exposure to sunlight. Grapes exposed to solar radiation reach temperatures over the optimum for berry development and maturation. This makes the development of irrigation and canopy management techniques of great importance to maximize yield and grape quality. A field experiment was carried out during 2021 using Manto negro wine grapes to study the effect of applied irrigation and different light exposure levels on grape quality. Two irrigation treatments were imposed based on the frequency and amount of water doses in a four-block experimental vineyard at Bodega Ribas (Mallorca). Three light exposure treatments were randomly applied in each irrigation plot. The light treatments included exposed clusters from pea size, non-exposed clusters, and shaded clusters after softening. Leaf area index and canopy porosity was estimated every 2 weeks. Midday leaf water potential was measured weekly. Additionally, apparent electrical conductivity was measured between rows to estimate the soil water content variability. Light and temperature sensors were installed at the bunch level to quantify the differences in bunch temperature and light intensity among treatments. The effect of irrigation and cluster light exposure on berry weight, TSS, TA, malic acid, tartaric acid, K+, and pH were analysed at 5 moments along grape ripening. During different heat waves, the natural shading technique decreased the maximum bunch temperature around 10 °C respect to the exposed bunches in both irrigation strategies. The combination of defoliation and shading techniques after softening decreased TSS at harvest and affected most of the quality parameters during the last stages of ripening, showing an interesting technique to delay ripening in warm viticulture areas.

Analysis of Cabernet Sauvignon and Aglianico winegrape (V. vinifera L.) responses to different pedo-climatic environments in southern Italy

Water deficit is one of the most important effects of climate change able to affect agricultural sectors. In general, it determines a reduction in biomass production, and for some plants, as in the case of grapevine, it can endorse fruit quality. The monitoring and management of plant water stress in the vineyard

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Grape must quality and mesoclimatic variability in Fruška Gora wine-growing region, Serbia

The Fruška Gora mountain is a traditional wine-growing region in Serbia situated in the Pannonian Basin. Due to such a position, the vicinity of the Danube River and the presence of concave configuration, it is suitable for grape production. This paper provides analyses of spatial variations in meteorological parameters and grape juice quality within Fruška Gora wine region over three consecutive vintages (2018-2020). The examined period can be defined as warm with cool nights during September (AVG 18,9°C; GDD 1918°C; CI 12°CF) and with the presence of mesoclimatic variability. The East part of the study area was somewhat drier and hotter compared to other parts of the region. The analyses of grape must samples (190 in total) of five cultivars (Cabernet-Sauvignon, Merlot, Chardonnay, Sauvignon blanc and Grašac (Welschriesling)) commonly grown across the region (19 sites), were performed using Fourier Transform Infrared Technology (FTIR). Among all cultivars, Sauvignon blanc was harvested first in the East area (DOY=246±5, GDD at harvest=1552±74, 22.2±0.7 °Brix), while the latest harvest was recorded for Cabernet-Sauvignon in the West (DOY=283±5, GDD at harvest=1936±187, 23.4±1.0 °Brix ). Both the red and white cultivars had higher acidity and YAN in the grape must if the vines were grown in the North and East compared to South and West areas. According to PCA analysis, Grašac showed the lowest variation in grape must chemical composition. Thus, the results confirm that Grašac is the most stable cultivar in Fruška Gora. All monitored cultivars reached technological fruit ripeness by the end of the growing season. However, it was difficult to reach full ripeness of red cultivars, mostly beacuse of uncoupling of technolocical and phenolic ripeness. Thus, Cabernet-Sauvignon had higher variations in GDD sums at harvest compared to other cultivars, which probably increased variations in grape must quality.

Evaluation of climate change impacts at the Portuguese Dão terroir over the last decades: observed effects on bioclimatic indices and grapevine phenology

In the last decades the growers of the Portuguese Dão winegrowing region (center of Portugal) are experiencing changes in climate that are influencing either grape phenology berry health and ripening. Aiming to study the relationships between climate indices (CI), seasonal weather and grapevine phenology, in this work long-term climate and phenological data collected at the experimental vineyard of the Portuguese Dão research centre between 1958 and 2019 (61 years) for the red variety Touriga Nacional, was analyzed. The trends over time for the classical temperature-based indices (Growing Season Temperature – GST -, Growing Degree Days – GDD, Huglin Index – HI and Cool Night Index – CI) presented a significantly positive slope while the Dryness Index (DI) showed a negative trend over the last 61 years. Regarding grapevine phenology, an average advance of 4.5 days per decade in the harvest day was observed throughout the last 61 years. Consequently, the weather conditions during the ripening period have changed, showing an increasing trend over time in the average temperature (higher magnitude in the maximum than in the minimum temperature) and a decrease in the accumulated rainfall. A regression analysis showed that ~50% of harvest date variability over years was explained by the temperature-based indices variability. These observed effects of climate change on bioclimatic indices and corresponding anticipation of harvest date can still be considered advantageous for the Dão terroir as it allows to achieve an optimal berry ripening before the common equinox rains and, therefore, avoid the potential negative impacts of the rainfall on berry health and composition.