Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

What are the optimal ranges and thresholds for berry solar radiation for flavonoid biosynthesis?

In wine grape production, canopy management practices are applied to control the source-sink balance and improve the cluster microclimate to enhance berry composition. The aim of this study was to identify the optimal ranges of berry solar radiation exposure (exposure) for upregulation of flavonoid biosynthesis and thresholds for their degradation, to evaluate how canopy management practices such as leaf removal, shoot thinning, and a combination of both affect the grapevine (Vitis vinifera L. cv. Cabernet Sauvignon) yield components, berry composition, and flavonoid profile under context of climate change. First experiment assessed changes in the grape flavonoid content driven by four degrees of exposure. In the second experiment, individual grape berries subjected to different exposures were collected from two cultivars (Cabernet Sauvignon and Petit Verdot). The third experiment consisted of an experiment with three canopy management treatments (i) LR (removal of 5 to 6 basal leaves), (ii) ST (thinned to 24 shoots per vine), and (iii) LRST (a combination of LR and ST) and an untreated control (UNT). Berry composition, flavonoid content and profiles, and 3-isobutyl 2-methoxypyrazine were monitored during berry ripening. Although increasing canopy porosity through canopy management practices can be helpful for other purposes, this may not be the case of flavonoid compounds when a certain proportion of kaempferol was achieved. Our results revealed different sensitivities to degradation within the flavonoid groups, flavonols being the only monitored group that was upregulated by solar radiation. Within different canopy management practices, the main effects were due to the ST. Under environmental conditions given in this trial, ST and LRST hastened fruit maturity; however, a clear improvement of the flavonoid compounds (i.e., greater anthocyanin) was not observed at harvest. Methoxypyrazine berry content decreased with canopy management practices studied. Although some berry traits were improved (i.e. 2.5° Brix increase in berry total soluble solids) due to canopy management practices (ST), this resulted in a four-fold increase in labor operations cost, two-fold decrease in yield with a 10-fold increase in anthocyanin production cost per hectare that should be assessed together as the climate continues to get hot.

Postveraison shoot trimming in Tannat and Merlot: preliminary results on yield components, plant balance and berry composition

There is currently a trend towards the production of wines with low alcohol content. To achieve this, grapes with low sugar content must be used. There are techniques at the vineyard level that can delay ripening and avoid excessive sugar accumulation without, a priori, affecting the final polyphenol content. Postveraison shoot trimming (PVST) is experimentally evaluated for these purposes, but its impact under Uruguayan climatic conditions with high interannual variability is not known. The aim of this work is to assess the PVST in Tannat and Merlot cultivars and their impact on yield components, plant balance and berry primary composition. In this study, two commercial vineyards of 10 years old Tannat and Merlot (grafted on SO4) at Canelones Department were selected. During the 2020-201 growing season, grapevines were submitted to PVST when grapes reached 15º Brix. In a randomized block, trimmed (T) and control (C) plants were evaluated with three repetitions each cultivar. Evaluation of the evolution of primary berry composition during ripening, measurement of yield components and plant balance were performed. For both cultivars, PVST did not affect yield components. Merlot reached 5.4 kg per plant and Tannat 7.1 kg, with not statistical significance between treatments. However, statistical differences were observed in terms of plant balance. In Merlot Ravaz Index reached a difference of 5.3 (12.0 in T and 6.7 in C) meanwhile Tannat reached 3.5 of statistical difference (13.7 in T and 10.2 in C). The tendency to imbalance for the treated plants had an impact on the final grape composition. Merlot grapes showed statistical difference in final total acidity (0.3 g of difference between treatments) while treatments impact final sugar content on Tannat grapes (10.0 g of difference between treatments). Further studies are needed to assess the impact of different canopy management techniques in our conditions.

Phenological characterization of a wide range of Vitis Vinifera varieties

In order to study the impact of climate change on Bordeaux grape varieties and to assess the adaptation capacities of candidates to the grape varieties of this wine region to the new climatic conditions, an experimental block design composed of 52 grape varieties was set up in 2009 at the INRAE Bordeaux Aquitaine center. Among the many parameters studied, the three main phenological stages of the vine (budburst, flowering and veraison) have been closely monitored since 2012. Observations for each year, stage and variety were carried out on four independent replicates. Precocity indices have been calculated from the data obtained over the 2012-2021 period (Barbeau et al. 1998). This work allowed to group the phenological behaviour of the grapevine varieties, not only based on the timing of the subsequent developmental stages, but also on the overall precocity of the cycle and the total length of the cycle between budburst and veraison. Results regarding the variability observed among the different grape varieties for these phenological stages are presented as heat maps.

How does aromatic composition of red wines, resulting from varieties adapted to climate change, modulate fruity aroma?

One of the major issues for the wine sector is the impact of climate change linked to the increasing temperatures which affects physicochemical parameters of the grape varieties planted in Bordeaux vineyard and consequently, the quality of wine. In some varietals, the attenuation of their fresh fruity character is accompanied by the accentuation of dried-fruit notes [1]. As a new adaptive strategy on climate change, some winegrowers have initiated changes in the Bordeaux blend of vine varieties [2]. This study intends to explore the fruitiness in wines produced from grape varieties adapted to the future climate of Bordeaux. 10 commercial single–varietal wines from 2018 vintage made from the main grape varieties in the Bordeaux region (Cabernet franc, Cabernet-Sauvignon and Merlot) as well as from indigenous grape varieties from the Mediterranean basin, such as Cyprus (Yiannoudin), France (Syrah), Greece (Agiorgitiko and Xinomavro), Portugal (Touriga Nacional) and Spain (Garnacha and Tempranillo), were selected among 19 samples using sensory descriptive analyses. Both sensory and instrumental analyses were coupled, to investigate their fruity aroma expression. For sensory analysis, samples were prepared from wine, using a semi preparative HPLC method which preserves wine aroma and isolates fruity characteristics in 25 specific fractions [3,4]. Fractions of interest with intense fruity aromas were sensorially selected for each wine by a trained panel and mixed with ethanol and microfiltered water to obtain fruity aromatic reconstitutions (FAR) [5]. A free sorting task was applied to categorize FAR according to their similarities or dissimilarities, and different clusters were highlighted. Instrumental analysis of the different FAR and wines demonstrated variations in their molecular composition. Results obtained from sensory and gas chromatography analysis enrich the knowledge of the fruity expression of red wines from “new” grape varieties opening up new perspectives in wine technology, including blending, thus providing new tools for producers.

Impact of changes in pruning practices on vine growth and yield

A gradual decline in vineyards has been observed over the past twenty years worldwide. This might be explained by the climate change, practices change or the increase of dieback diseases. To increase the longevity of vines, we studied the impact of different pruning strategies in four adult and four young vineyards located in France and Spain. In France, vineyards were planted with Cabernet franc on 3309C while Spanish trials were planted with Tempranillo grafted on 110R. Vegetative expression, yield, quality of berries and wood vessels conductivity were measured. The distribution of vegetative expression, yield and berry composition between primary and secondary vegetation were quantified. Finally, tomography was used to evaluate the implication of the treatments on sap flows.
First results show that i) the respectful pruning leads to an increase of 30 to 50% more secondary shoots than the aggressive pruning in France and between 15 and 20% in Spain, ii) there is no major effect on the yield over the first two years following the implementation of the new pruning practices, although the proportion of clusters from suckers is higher on the respectful pruning method. On young vines, the development of the trunk according to a respectful pruning leads to a loss of harvest 2 years after planting. This is due to the removal, on the future trunk, of the green suckers which carrying bunches. This operation carried out in spring rather than during winter pruning, would promote a better leaf / fruit balance when the plant comes into production, and could lead to better hydraulic conduction in the vessels of the trunk. Maintaining these trials for several years will provide more robust data to assess the impact of these practices on the vines over the long term.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

VINIoT: Precision viticulture service for SMEs based on IoT sensors network

The main innovation in the VINIoT service is the joint use of two technologies that are currently used separately: vineyard monitoring using multispectral imaging and deployed terrain sensors. One part of the system is based on the development of artificial intelligence algorithms that are feed on the images of the multispectral camera and IoT sensors, high-level information on water stress, grape ripening status and the presence of diseases. In order to obtain algorithms to determine the state of ripening of the grapes and avoid losing information due to the diversity of the grape berries, it was decided to work along the first year 2020 at berry scale in the laboratory, during the second year at the cluster scale and on the last year at plot scale. Different varieties of white and red grapes were used; in the case of Galicia we worked with the white grape variety Treixadura and the red variety Mencía. During the 2020 and 2021 campaigns, multispectral images were taken in the visible and infrared range of: 1) sets of 100 grapes classifying them by means of densimetric baths, 2) individual bunches. The images taken with the laboratory analysis of the ripening stage were correlated. Technological maturity, pH, probable degree, malic acid content, tartaric acid content and parameters for assessing phenolic maturity, IPT, anthocyanin content were determined. It has been calculated for each single image the mean value of each spectral band (only taking into account the pixels of interest) and a correlation study of these values with laboratory data has been carried out. These studies are still provisional and it will be necessary to continue with them, jointly with the training of the machine learning algorithms. Processed data will allow to determine the sensitivity of the multispectral images and select bands of interest in maturation.

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

Spatiotemporal patterns of chemical attributes in Vitis vinifera L. cv. Cabernet Sauvignon vineyards in Central California

Spatial variability of vine productivity in winegrapes is important to characterise as both yield and quality are relevant for the production of different wine styles and products. The objectives were to understand how patterns of variability of Cabernet Sauvignon fruit composition changed over time and space, how these patterns could be characterised with indirect measurements, and how spatial patterns of the variation in fruit compositional attributes can aid in improving management. Prior to the 2017 vintage, 125 data vines were distributed across each of four vineyards in the Lodi American Viticultural Area (AVA) of California. Each data vine was sampled at commercial harvest in 2017, 2018, and 2019. Yield components and fruit composition were measured at harvest for each data vine, and maps of yield and fruit composition were produced for eight ‘objective measures of fruit quality’: total anthocyanins, polymeric tannins, quercetin glycosides, malic acid, yeast assimilable nitrogen, β-damascenone, C6 alcohols and aldehydes, and 3-isobutyl-2-methoxypyrazine. Patterns of variation in anthocyanins and phenolic compounds were found to be most stable over time. Given this relative stability, management decisions focused on fruit quality could be based on zonal descriptions of anthocyanins or phenolics to increase profitability in some vineyards. In each vineyard, dormant season pruning weights and soil cores were collected at each location, elevation and soil apparent electrical conductivity surveys were completed, and remotely sensed imagery was captured by fixed wing aircraft and two satellite platforms at major phenological stages. The data collected were used to develop relationships among biophysical data, soil, imagery, and fruit composition. The standardised and aggregated samples from four vineyards over three seasons were included in the estimation of ‘common variograms’ to assess how this technique could aid growers in producing geostatistically rigorous maps of fruit composition variability without cumbersome, single season sampling efforts.

The plantation frame as a measure of adaptation to climate change

The mechanization of vineyard work originally led to a reduction in planting densities due to the lack of machinery adapted to the vineyard. The current availability of specific machinery makes it possible to establish higher planting densities. In this work, three planting densities (1.40×0.80 m, 1.80×1 m and 2.20×1.20 m, corresponding to 8928, 5555 and 3787 plants/ha respectively) were studied with four varieties autochthonous of Galicia (northwestern Spain): Albariño and Treixadura (white), Sousón and Mencía (red). The vines were trained in a vertical shoot positioning system using a single Royat cordon, and pruned to spurs with two buds each. Agronomic data (yield, pruning wood weight, Ravaz index) and oenological data in must were collected. The higher planting density (1.40×0.80 m) had no significant effect on grape yield per vine in white varieties, although production per hectare was much higher due to the greater number of plants. In red varieties, this planting density resulted in a significantly lower production per vine, compensated by the greater number of plants. In addition, it significantly reduced the Brix degree in the must of the Albariño, Treixadura and Sousón varieties, and increased the total acidity in the latter two and Mencía. It also caused an increase in extractable and total anthocyanins and IPT in red grapes. The effects of high planting density on grapes are of great interest for the adaptation of varieties in the context of climate change. In the future, it could be advisable to modify the limits imposed by the appellations of origin on the planting density of these varieties in order to obtain more balanced wines.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.