Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Adaptability of grapevines to climate change: characterization of phenology and sugar accumulation of 50 varieties, under hot climate conditions

Climate is the major factor influencing the dynamics of the vegetative cycle and can determine the timing of phenological periods. Knowledge of the phenology of varieties, their chronological duration, and thermal requirements, allows not only for the better management of interventions in the vineyard, but also to predict the varieties’ behaviour in a scenario of climate change, giving the wine producer the possibility of selecting the grape varieties that are best adapted to the climatic conditions of a certain terroir. In 2014, Symington Family Estates, Vinhos, established two grape variety libraries in two different places with distinctive climate conditions (Douro Superior, and Cima Corgo), with the commitment of contributing to a deeper agronomic and oenological understanding of some grape varieties, in hot climate conditions. In these research vineyards are represented local varieties that are important in the regional and national viticulture, but also others that have over time been forgotten — as well as five international reference cultivars. From 2017 to 2021, phenological observations have been made three times a week, following a defined protocol, to determine the average dates of budbreak, flowering and veraison. With the climate data of each location, the thermal requirements of each variety and the chronological duration of each phase have been calculated. During maturation, berry samples have been gathered weekly to study the dynamics of sugar accumulation, between other parameters. The data was analysed applying phenological and sugar accumulation models available in literature. The results obtained show significant differences between the varieties over several parameters, from the chronological duration and thermal requirements to complete the various stages of development, to the differences between the two locations, confirming the influence of the climate on phenology and the stages of maturation, in these specific conditions.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Modeling island and coastal vineyards potential in the context of climate change

Climate change impacts regional and local climates, which in turn affects the world’s wine regions. In the short term, these modifications rises issues about maintaining quality and style of wine, and in a longer term about the suitability of grape varieties and the sustainability of traditional wine regions. Thus, adaptation to climate change represents a major challenge for viticulture. In this context, island and coastal vineyards could become coveted areas due to their specific climatic conditions. In regions subject to warming, the proximity of the sea can moderate extremes temperatures, which could be an advantage for wine. However, coastal and island areas are particular prized spaces and subject to multiple pressures that make the establishment or extension of viticulture complex.
In this perspective, it seems relevant to assess the potentialities of coastal and island areas for viticulture. This contribution will present a spatial optimization model that tends to characterize most suitable agroclimatic patterns in historical or emerging vineyards according to different scenarios. Thanks to an in-depth bibliography a global inventory of coastal and insular vineyards on a worldwide scale has been realized. Relevant criteria have been identified to describe the specificities of these vineyards. They are used as input data in the optimization process, which will optimize some objectives and spatial aspects. According to a predefined scenario, the objectives are set in three main categories associated with climatic characteristics, vineyards characteristics and management strategies. At the end of this optimization process, a series of maps presents the different spatial configurations that maximize the scenario objectives.

Grape must quality and mesoclimatic variability in Fruška Gora wine-growing region, Serbia

The Fruška Gora mountain is a traditional wine-growing region in Serbia situated in the Pannonian Basin. Due to such a position, the vicinity of the Danube River and the presence of concave configuration, it is suitable for grape production. This paper provides analyses of spatial variations in meteorological parameters and grape juice quality within Fruška Gora wine region over three consecutive vintages (2018-2020). The examined period can be defined as warm with cool nights during September (AVG 18,9°C; GDD 1918°C; CI 12°CF) and with the presence of mesoclimatic variability. The East part of the study area was somewhat drier and hotter compared to other parts of the region. The analyses of grape must samples (190 in total) of five cultivars (Cabernet-Sauvignon, Merlot, Chardonnay, Sauvignon blanc and Grašac (Welschriesling)) commonly grown across the region (19 sites), were performed using Fourier Transform Infrared Technology (FTIR). Among all cultivars, Sauvignon blanc was harvested first in the East area (DOY=246±5, GDD at harvest=1552±74, 22.2±0.7 °Brix), while the latest harvest was recorded for Cabernet-Sauvignon in the West (DOY=283±5, GDD at harvest=1936±187, 23.4±1.0 °Brix ). Both the red and white cultivars had higher acidity and YAN in the grape must if the vines were grown in the North and East compared to South and West areas. According to PCA analysis, Grašac showed the lowest variation in grape must chemical composition. Thus, the results confirm that Grašac is the most stable cultivar in Fruška Gora. All monitored cultivars reached technological fruit ripeness by the end of the growing season. However, it was difficult to reach full ripeness of red cultivars, mostly beacuse of uncoupling of technolocical and phenolic ripeness. Thus, Cabernet-Sauvignon had higher variations in GDD sums at harvest compared to other cultivars, which probably increased variations in grape must quality.

Copper contamination in vineyard soils of Bordeaux: spatial risk assessment for the replanting of vines and crops

Copper (Cu) is widely and historically used in viticulture as a fungicide against mildew. Cu has a strong affinity for soil organic matter and accumulates in topsoil horizons. Thus, Cu may negatively affect soil organisms and plants, consequently reducing soil fertility and productivity. The Bordeaux vineyards have the largest vineyard surfaces (26%) within French controlled appellation and a great proportion of French wine production (around 5 million hl per year). Considering the local context of vineyard surfaces decreasing (vine uprooting) and possible new crop plantation, the issue of Cu potential toxicity rises. Therefore, the aims of this work are firstly to evaluate the Cu contamination in vineyard soils of Bordeaux, secondly to produce a risk assessment map for new vine or crop plantation. We used soil analyses from several local studies to build a database with 4496 soil horizon samples. The database was enhanced by means of pedotransfer functions in order to estimate the bioaccessible (EDTA-extractable) Cu in soils of samples without measurements. From this database, 1797 georeferenced samples with CuEDTA concentrations in the topsoil (0-50 cm depth) were used for kriging interpolation in order to produce the spatial distribution map of CuEDTA in vineyard soils. Then, the spatial distribution of Cu was crossed with vine uprooting surfaces and municipality boundaries. CuEDTAconcentrations ranged from 0.52 to 459 mg/kg and showed clear anomalies. Our results from spatial analysis showed that almost 50% of vineyard soil surfaces have CuEDTA concentrations higher than 30 mg/kg (moderate risk for new plantation) and 20% with concentrations higher than 50 mg/kg (high risk for new plantation). A decision-support map based on municipalities was realised to provide a simple tool to stakeholders concerned by land use management.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Spatial determination of areas in the Western Balkans region favorable for organic production

In problematic conditions for production of grapes and wine caused by the COVID-19 pandemic and the resulting occurrence of wine surpluses, producers are increasingly turning to the innovative viticulture and winemaking of products that are more appealing to the market and the consumers. On the other hand, consumption of the food safety or organic products, and therefore of organic grapes and wine, is increasingly common in the world, in particular in Europe. The Regional Rural Development Standing Working Group (SWG RRD), as a regional intergovernmental organization gathers actors in the viticulture and winemaking sector from states and territories of the Western Balkans (South-East Europe) in the Expert Working Group for Wine, with the aim of improving viticulture and winemaking in this region through joint activities. In accordance with the aforementioned, the SWG RRD is working on advancing organic production of grapes and wine, and on recognition of specificities of the terroir of wine-growing areas in Western Balkans. In addition, as part of the project “Facilitation of Exchange and Advice on Wine Regulations in Western Balkan Countries” helmed by the German Federal Ministry of Food and Agriculture, in addition to harmonization of relevant legislation with EU regulations, efforts are being invested towards recognition of organic wines. Within activities and project implemented by this organization, expert analyses and scientific research of the terroir of Western Balkans were carried out, and some of the results are presented in this paper.

Modulation of berry composition by different vineyard management practices

High concentration of sugars in grapes and alcohol in wines is one of the consequences of climate change on viticulture production in several wine-growing regions. In order to investigate the possibilities of adaptation of vineyard management practices aimed to reduce the accumulation of sugar during the maturation phase without reducing the accumulation of anthocyanins in grapes, a study with severe shoot trimming, shoot thinning, cluster thinning and date of harvest was conducted on Merlot variety in Istria region (Croatia), under the Mediterranean climate. Four factors which may affect grape maturation and its composition at harvest were investigated in a two-years experiment; severe shoot trimming applied at veraison when >80% of berries changed colour (in comparison to untreated control), shoot thinning (0 and 30%), cluster thinning (0 and 30%), and the date of harvest (early and standard harvest dates). Shoot thinning had no significant impact on berry composition, despite the obtained reduction in yield per vine. Lower Brix in grapes were obtained with earlier harvest date and if no cluster thinning was applied, although at the same time a reduction in the concentration of anthocyanins in berries was observed in these treatments. On the other hand, if severe shoot trimming was applied when >80% of berries changed colour, a reduction of Brix was obtained without a negative impact on berry anthocyanins concentration. We conclude that in cases when undesirably high sugar concentrations at harvest are expected, severe shoot trimming at 80% veraison may effectively be used in order to obtain moderate sugar concentration in berries together with the adequate phenolic composition.

Use of a new, miniaturized, low-cost spectral sensor to estimate and map the vineyard water status from a mobile 

Optimizing the use of water and improving irrigation strategies has become increasingly important in most winegrowing countries due to the consequences of climate change, which are leading to more frequent droughts, heat waves, or alteration of precipitation patterns. Optimized irrigation scheduling can only be based on a reliable knowledge of the vineyard water status.

In this context, this work aims at the development of a novel methodology, using a contactless, miniaturized, low-cost NIR spectral tool to monitor (on-the-go) the vineyard water status variability. On-the-go spectral measurements were acquired in the vineyard using a NIR micro spectrometer, operating in the 900–1900 nm spectral range, from a ground vehicle moving at 3 km/h. Spectral measurements were collected on the northeast side of the canopy across four different dates (July 8th, 14th, 21st and August 12th) during 2021 season in a commercial vineyard (3 ha). Grapevines of Vitis vinifera L. Graciano planted on a VSP trellis were monitored at solar noon using stem water potential (Ψs) as reference indicators of plant water status. In total, 108 measurements of Ψs were taken (27 vines per date).

Calibration and prediction models were performed using Partial Least Squares (PLS) regression. The best prediction models for grapevine water status yielded a determination coefficient of cross-validation (r2cv) of 0.67 and a root mean square error of cross-validation (RMSEcv) of 0.131 MPa. This predictive model was employed to map the spatial variability of the vineyard water status and provided useful, practical information towards the implementation of appropriate irrigation strategies. The outcomes presented in this work show the great potential of this low-cost methodology to assess the vineyard stem water potential and its spatial variability in a commercial vineyard.

Impact of long term agroecological and conventional practices on subsurface soil microbiota in Macabeu and Xarel·lo vineyards

There is a growing trend on the transition from conventional to agroecological management of vineyards. However, the impact of practices, such as reduced-tillage, organic fertilization and cover crops, is not well-understood regarding the soil microbial diversity, and its relationship with the soil physicochemical properties in the subsurface depth near the rooting zone. Soil bacterial diversity is an important contributor towards plant health, productivity and response to environmental stresses. A field experiment was conducted by sampling subsurface soil bacterial community (NGS and qPCR) near to the root zone of Macabeu and Xarel·lo vineyards, located at the Penedes. 3 organic (ECO) and 3 conventional (CON) vineyards, with more than 10 years of respective management were sampled (n=5 each plot). ECO practices did not affect bacterial and fungal abundance but increased significantly the ammonium oxidizing bacteria and alpha-diversity (Inv.Simpson). Interestingly beta-diversity was significantly affected by the management strategy. ANOSIM-tests revealed a significative effect of the management (ecological vs conventional) and plot, on the soil microbial structure (ASV abundance). Main phyla depicted were Proteobacteria, Actinobacteria and Acidobacteria, whose relative abundances were not affected by the management. EdgeR assay revealed a significant increase of Cyanobacteria and decrease of Gemmatimonadetes and Firmicutes phyla in ECO. Interestingly, the grapevine variety was not correlated with the soil microbial community structure. Mantel-test revealed an important correlation (Spearman) of some physicochemical parameters with the soil microbiota structure, in order of importance: texture, EC, pH Ca/Mg, Mg/P, K+, Mg2+, Ca2+, SO42-, and OM. N-NH4 and NTK, which were higher in the ECO managed soils, did not correlated significantly with the soil microbiome population. The results revealed the importance of combining a deep physicochemical characterization of each replicate with the microbial diversity assessment to gain better insights on the relationship between soil microbiome and vineyard management.

Modelling vine water stress during a critical period and potential yield reduction rate in European wine regions: a retrospective analysis

Most European vineyards are managed under rainfed conditions, where seasonal water deficit has become increasingly important. The flowering-veraison phenophase represents an important period for vine response to water stress, which is seldomly thoroughly evaluated. Therefore, we aim to quantify the flowering-veraison water stress levels using Crop Water Stress Indicator (CWSI) over 1986–2015 for important European wine regions, and to assess the respective potential Yield Lose Rate (YLR). Additionally, we also investigate whether an advanced flowering-veraison phase may help alleviating the water stress with improved yield. A process-based grapevine model STICS is employed, which has been extensively calibrated for flowering and veraison stages using observed data at 38 locations with 10 different grapevine varieties. Subsequently, the model is being implemented at the regional level, considering site-specific calibration results and gridded climate and soil datasets. The findings suggest wine regions with stronger flowering-veraison CWSI tend to have higher potential YLR. However, contrasting patterns are found between wine regions in France-Germany-Luxembourg and Italy-Portugal-Spain. The former tends to have slight-to-moderate drought conditions (CWSI<0.5) and a negligible-to-moderate YLR (<30%), whereas the latter possesses severe-to-extreme CWSI (>0.5) and substantial YLR (>40%). Wine regions prone to a high drought risk (CWSI>0.75) are also identified, which are concentrated in southern Mediterranean Europe. An advanced flowering-veraison phase may have benefited from cooler temperatures and a higher fraction of spring precipitation in wine regions of Italy-Portugal-Spain, resulting in alleviated CWSI and moderate reductions of YLR. For those of France-Germany-Luxembourg, this can have reduced flowering-veraison precipitation, but prevalent alleviations of YLR are also found, possibly because of shifted phase towards a cooler growing season with reduced evaporative demands. Overall, such a retrospective analysis might provide new insights towards better management of seasonal water deficit for conventionally vulnerable Mediterranean wine regions, but also for relatively cooler and wetter Central European regions.