Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Impact of long term agroecological and conventional practices on subsurface soil microbiota in Macabeu and Xarel·lo vineyards

There is a growing trend on the transition from conventional to agroecological management of vineyards. However, the impact of practices, such as reduced-tillage, organic fertilization and cover crops, is not well-understood regarding the soil microbial diversity, and its relationship with the soil physicochemical properties in the subsurface depth near the rooting zone. Soil bacterial diversity is an important contributor towards plant health, productivity and response to environmental stresses. A field experiment was conducted by sampling subsurface soil bacterial community (NGS and qPCR) near to the root zone of Macabeu and Xarel·lo vineyards, located at the Penedes. 3 organic (ECO) and 3 conventional (CON) vineyards, with more than 10 years of respective management were sampled (n=5 each plot). ECO practices did not affect bacterial and fungal abundance but increased significantly the ammonium oxidizing bacteria and alpha-diversity (Inv.Simpson). Interestingly beta-diversity was significantly affected by the management strategy. ANOSIM-tests revealed a significative effect of the management (ecological vs conventional) and plot, on the soil microbial structure (ASV abundance). Main phyla depicted were Proteobacteria, Actinobacteria and Acidobacteria, whose relative abundances were not affected by the management. EdgeR assay revealed a significant increase of Cyanobacteria and decrease of Gemmatimonadetes and Firmicutes phyla in ECO. Interestingly, the grapevine variety was not correlated with the soil microbial community structure. Mantel-test revealed an important correlation (Spearman) of some physicochemical parameters with the soil microbiota structure, in order of importance: texture, EC, pH Ca/Mg, Mg/P, K+, Mg2+, Ca2+, SO42-, and OM. N-NH4 and NTK, which were higher in the ECO managed soils, did not correlated significantly with the soil microbiome population. The results revealed the importance of combining a deep physicochemical characterization of each replicate with the microbial diversity assessment to gain better insights on the relationship between soil microbiome and vineyard management.

Adaptability of grapevines to climate change: characterization of phenology and sugar accumulation of 50 varieties, under hot climate conditions

Climate is the major factor influencing the dynamics of the vegetative cycle and can determine the timing of phenological periods. Knowledge of the phenology of varieties, their chronological duration, and thermal requirements, allows not only for the better management of interventions in the vineyard, but also to predict the varieties’ behaviour in a scenario of climate change, giving the wine producer the possibility of selecting the grape varieties that are best adapted to the climatic conditions of a certain terroir. In 2014, Symington Family Estates, Vinhos, established two grape variety libraries in two different places with distinctive climate conditions (Douro Superior, and Cima Corgo), with the commitment of contributing to a deeper agronomic and oenological understanding of some grape varieties, in hot climate conditions. In these research vineyards are represented local varieties that are important in the regional and national viticulture, but also others that have over time been forgotten — as well as five international reference cultivars. From 2017 to 2021, phenological observations have been made three times a week, following a defined protocol, to determine the average dates of budbreak, flowering and veraison. With the climate data of each location, the thermal requirements of each variety and the chronological duration of each phase have been calculated. During maturation, berry samples have been gathered weekly to study the dynamics of sugar accumulation, between other parameters. The data was analysed applying phenological and sugar accumulation models available in literature. The results obtained show significant differences between the varieties over several parameters, from the chronological duration and thermal requirements to complete the various stages of development, to the differences between the two locations, confirming the influence of the climate on phenology and the stages of maturation, in these specific conditions.

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

Phenolic composition of Tempranillo Blanco grapes changes after foliar application of urea

Our research aimed to determine the effect and efficiency of foliar application of urea on the phenolic composition of Tempranillo Blanco grapes. The field experiment was carried out in 2019 and 2020 seasons and the plot was located in D.O.Ca Rioja (North of Spain). The vineyard was Vitis vinifera L. Tempranillo Blanco and grafted on Richter-110 rootstock. The treatments were control (C), whose plants were sprayed with water and three doses of urea: plants were sprayed with urea 3 kg N/ha (U3), 6 kg N/ha (U6) and 9 kg N/ha (U9). The applications were performed in two phenological stages, pre-veraison (Pre) and veraison (Ver). Also, each of the treatments was repeated one week later. Control and treatments were performed in triplicate and arranged in a randomised block design. Grapes were harvested at optimum ripening stage. High-performance liquid chromatography was used to analyse the phenolic composition of the grapes. Finally, the results obtained from the analytical determinations – flavonols, flavanols and non-flavonoid (hydroxybenzoic acids, hydroxycinnamic acids and stilbenes) – were studied statistically by analysis of variance. The results showed that, in 2019, U6-Pre and U9-Pre treatments increased the hydroxybenzoic acid content in grapes, and also all foliar treatments applied at Pre enhanced the stilbene concentration. Moreover, U3-Ver was the only treatment that rose flavonol and stilbene contents in the Tempranillo Blanco grapes. In 2020, all treatments applied at Pre enhanced the flavonol concentration in grapes. Furthermore, U3-Pre and U9-Pre treatments increased stilbene content in grapes. Nevertheless, the hydroxybenzoic acid content was improved by U6-Ver and U9-Ver and besides, hydroxycinnamic acid concentration in grapes was increased by all treatments applied at Ver. In conclusion, the lower and highest dose of urea (U3 and U9), applied at pre-veraison, were the best treatments to improve the Tempranillo Blanco grape phenolic composition.

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Grapevine sugar concentration model in the Douro Superior, Portugal

Increasingly warm and dry climate conditions are challenging the viticulture and winemaking sector. Digital technologies and crop modelling bear the promise to provide practical answers to those challenges. As viticultural activities strongly depend on harvest date, its early prediction is particularly important, since the success of winemaking practices largely depends upon this key event, which should be based on an accurate and advanced plan of the annual cycle. Herein, we demonstrate the creation of modelling tools to assess grape ripeness, through sugar concentration monitoring. The study area, the Portuguese Côa valley wine region, represents an important terroir in the “Douro Superior” subregion. Two varieties (cv. Touriga Nacional and Touriga Franca) grown in five locations across the Côa Region were considered. Sugar accumulation in grapes, with concentrations between 170 and 230 g l-1, was used from 2014 to 2020 as an indicator of technological maturity conditioned by meteorological factors. The climatic time series were retrieved from the EU Copernicus Service, while sugar data were collected by a non-profit organization, ADVID, and by Sogrape, a leading wine company. The software for calibrating and validating this model framework was the Phenology Modeling Platform (PMP), version 5.5, using Sigmoid and growing degree-day (GDD) models for predictions. The performance was assessed through two metrics: Roots Mean Square Error (RMSE) and efficiency coefficient (EFF), while validation was undertaken using leave-one-out cross-validation. Our findings demonstrate that sugar content is mainly dependent on temperature and air humidity. The models achieved a performance of 0.65

The concept of terroir: what place for microbiota?

Microbes play key roles on crop nutrient availability via biogeochemical cycles, rhizosphere interactions with roots as well as on plant growth and health. Recent advances in technologies, such as High Throughput Sequencing Techniques, allowed to gain deeper insight on the structure of bacterial and fungal communities associated with soil, rhizosphere and plant phyllosphere. Over the past 10 years, numerous scientific studies have been carried out on the microbial component of the vineyard. Whether the soil or grape compartments have been taken into account, many studies agree on the evidence of regional delineations of microbial communities, that may contribute to regional wine characteristics and typicity. Some authors proposed the term “microbial terroir” including “yeast terroir” for grapes to describe the connection between microbial biogeography and regional wine characteristics. Many factors are involved in terroir including climate, soil, cultivar and human practices as well as their interactions. Studies considering “microbial terroir” greatly contributed to improve our knowledge on factors that shape the vineyard microbial structure and diversity. However, the potential impact of “microbial terroir” on wine composition has yet not received strong scientific evidence and many questions remain to be addressed, related to the functional characterization of the microbial community and its impact on plant physiology and grape composition, the origins and interannual stability of vineyard microbiota, as well as their impact on wine sensorial attributes. The presentation will give an overview on the role of microbiota as a terroir component and will highlight future perspectives and challenges on this key subject for the wine industry.

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).

The combined effects of climate, soils, and deficit irrigation on yield and quality of Touriga Nacional under high atmospheric demand in the Douro Region

Global warming is one of the biggest environmental, social and economic threats in several viticultural regions. In the Douro Valley, changes are expected in the coming years, namely an increase in temperature and a decrease in precipitation. These changes are likely to have consequences for the production and quality of wine.
The aim of this study was to explore the effects of different soil characteristics combined with several deficit irrigation strategies, managed throughout ETc references and predawn leaf water potentials thresholds, on physiology, yield, and qualitative attributes on the Touriga Nacional variety under years of mild to severe water and heat stress.
The studies were conducted over seven years (2015 to 2021) in two plots of a commercial vineyard located at Quinta do Ataíde (Symington Family Estates) planted in 2011 and 2014 at 170 meters elevation, growing under three water regimes: non-irrigated (NI) and two deficit irrigation strategies (30% and 60% ETc) assessed weekly by Ψpd. The site has an annual rainfall below 500 mm, with high atmospheric demand. Climate data was collected from a weather station, located on site. Berry ripening was followed weekly for fruit analysis. At harvest, yield, vigour and pruning weight per vine were determined from 90 vines by treatment. Each season at veraison the NDVI Index was accessed by a drone. The soils physic-chemistry in the experimental blocs were analysed and grouped by SWHC. Delta C-13 analyses were also performed per treatment in two years.Irrigation had a positive effect on yield per vine, mostly due to an increase in berry and cluster weight, and fertility index through the years. A significant increase in sugar content, colour and phenols was observed with deficit irrigation in some years, but vine vigour related to soil characteristics had by far the greatest impact on quality.

Impact of geographical location on the phenolic profile of minority varieties grown in Spain. II: red grapevines

Because terroir and cultivar are drivers of wine quality, is essential to investigate theirs effects on polyphenolic profile before promoting the implantation of a red minority variety in a specific area. This work, included in MINORVIN project, focuses in the polyphenolic profile of 7 red grapevines minority varieties of Vitis vinifera L. (Morate, Sanguina, Santafe, Terriza Tinta Jeromo Tortozona Tinta) and Tempranillo) from six typical viticulture Spanish areas: Aragón (A1), Cataluña (A2), Castilla la Mancha (A3), Castilla –León (A4), Madrid (A5) and Navarra (A6) of 2020 season. Polyphenolic substances were extracted from grapes. 35 compounds were identified and quantified (mg subtance/kg fresh berry) by HPLC and grouped in anthocyanins (ANT) flavanols (FLAVA), flavonols (FLAVO), hydroxycinnamic (AH), benzoic (BA) acids and stilbenes (ST). Antioxidant activity (AA, mmol TE /g fresh berry) was determined by DPPH method. The results were submitted to a two-way ANOVA to investigate the influence of variety, area and their interaction for each polyphenolic family and cluster analysis was used to construct hierarchical dendrograms, searching the natural groupings among the samples. Sanguina (A3) had the most of total polyphenols while Tempranillo (A5) those of ANT. Sanguina (A2) and (A3) reached the highest values of FLAVO, FLAVA and AA. These two last samples had also the maximum of AA. The effect cultivar and area were significant for all polyphenolic families analyzed. A high variability due to variety (>50%) was observed in FLAVA and the maximum value of variability due to growing area was detected in AA (86.41%), ANT and FLAVO (51%); the interaction variety*zone was significant only for ANT, FLAVO, EST and AA. Finally, dendrograms presented five cluster: i) Sanguina (A2); ii) Sanguina (A3); iii) Tempranillo (A5); iv) Tempranillo (A3); Terriza (A3,A5), Morate (A5,A6); v) Santafé (A1,A6); Tortozona tinta (A1,A3,A6); Tinta Jeromo (A3,A4).