Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Assessment of climate change impacts on water needs and growing cycle on grapevine in three DOs of NE Spain

This study assessed the suitability of grapevine growing in three DOs (Empordà, Pla de Bages and Penedès) of Catalonia (NE Spain) over the 21st century. For this purpose, an estimation of water needs and agroclimatic and phenological indicators was made. Climate change impacts were estimated at 1 km pixel resolution using temperature and precipitation projections from several general circulation models (GCM) and two climate change scenarios: RCP 4.5 (stabilization scenario) and RCP 8.5 (worst-case scenario). Potential crop evapotranspiration (following FAO procedure) and a daily water balance considering soil water holding capacity were used to estimate actual evapotranspiration of vines and, finally, water needs. Dynamics would be similar in the three DOs studied although the magnitude of impact differs. Water needs would be 2 and 3 times greater (ranging from 0 to more than 1500 m3/ha) than current water needs at both climate change scenarios. Moreover, blooming date would advance from 3 to 6 weeks, harvest date from 1 to 2.5 months, resulting in growing cycles from 10 to 80 days shorter. It should also be noted that frost risk would decrease from 6 to 76%, the number of days with temperatures above 30ºC during ripening would rise from 48 to 500% and tropical nights (minimum temperature >20ºC) at ripening would increase from 28 to 150%, depending on the scenario and the DOs. The impacts of climate change in the three DOs could result in significant limitations for grapevine cultivation and wine production if adaptive strategies are not applied. This result could serve as a basis for the design of specific and particular adaptation strategies to improve and maintain vineyards in the DOs studied and could be extrapolated to similar DOs and regions.

20-Year-Old data set: scion x rootstock x climate, relationships. Effects on phenology and sugar dynamics

Global warming is one of the biggest environmental, social, and economic threats. In the Douro Valley, change to the climate are expected in the coming years, namely an increase in average temperature and a decrease in annual precipitation. Since vine cultivation is extremely vulnerable and influenced by the climate, these changes are likely to have negative effects on the production and quality of wine.
Adaptation is a major challenge facing the viticulture sector where the choice of plant material plays an important role, particularly the rootstock as it is a driver for adaptation with a wide range of effects, the most important being phylloxera, nematode and salt, tolerance to drought and a complex set of interactions in the grafted plant.
In an experimental vineyard, established in the Douro Region in 1997, with four randomized blocs, with five varieties, Touriga Nacional, Tinta Barroca, Touriga Franca and Tinta Roriz, grafted in four rootstocks, Rupestris du Lot, R110, 196-17C, R99 and 1103P, data was collected consecutively over 20 years (2001-2020). Phenological observations were made two to three times a week, following established criteria, to determine the average dates of budbreak, flowering and veraison. During maturation, weekly berry samples were taken to study the dynamics of sugar accumulation, amongst other parameters. Climate data was collected from a weather station located near the vineyard parcel, with data classified through several climatic indices.
The results achieved show a very low coefficient of variations in the average date of the phenophases and an important contribution from the rootstock in the dynamic of the phenology, allowing a delay in the cycle of up to10-12 days for the different combinations. The Principal Component Analysis performed, evaluating trends in the physical-chemical parameters, highlighted the effect of the climate and rootstock on fruit quality by grape varieties.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Leaf vine content in nutrients and trace elements in La Mancha (Spain) soils: influence of the rootstock

The use of rootstock of American origin has been the classic method of fighting against Phylloxera for more than 100 years. For this reason, it is interesting to establish if different rootstock modifies nutrient composition as well as trace elements content that could be important for determining the traceability of the vine products. A survey of four classic rootstocks (110-Richter, SO4, FERCAL and 1103-Paulsen) and four new ones (M1, M2, M3 and M4) provided by Agromillora Iberia. S.L.U., all of them grafted with the Tempranillo variety, has been carried out during 2019. The eight rootstocks were planted in pots of 500 cc, on three soils with very different characteristics from Castilla-La Mancha (Spain). In the month of July, the leaves were collected and dried in a forced air oven for seven days at 40ºC. Then, the samples were prepared for the analysis determination, carried out by X-Ray fluorescence spectrometry. The results obtained showed that in the case of content in mineral elements in leaf, separated by soil type, we can report the importance of few elements such as Si, Fe, Pb and, especially, Sr. The rootstock does not influence the composition of the vine leaf for the studied elements that are the most important in determining the geochemical footprint of the soil. The influence of the soil can be discriminated according to some elements such as Fe, Pb, Si and, especially, Sr.

Grapevine xylem embolism resistance spectrum reveals which varieties have a lower mortality risk in a future dry climate

Wine growing regions have recently faced intense and frequent droughts that have led to substantial economical losses, and the maintenance of grapevine productivity under warmer and drier climate will rely notably on planting drought-resistant cultivars. Given that plant growth and yield depend on water transport efficiency and maintenance of photosynthesis, thus on the preservation of the vascular system integrity during drought, a better understanding of drought-related hydraulic traits that have a significant impact on physiological processes is urgently needed. We have worked towards this end by assessing vulnerability to xylem embolism in 30 grapevine commercial varieties encompassing red and white Vitis vinifera varieties, hybrid varieties characterized by a polygenic resistance for powdery and downy mildew, and commonly used rootstocks. These analyses further allowed a global assessment of wine regions with respect to their varietal diversity and resulting vulnerability to stem embolism. Hybrid cultivars displayed the highest vulnerability to embolism, while rootstocks showed the greatest resistance. Significant variability also arose among Vitis vinifera varieties, with Ψ12 and Ψ50 values ranging from -0.4 to -2.7 MPa and from -1.8 to -3.4 MPa, respectively. Cabernet franc, Chardonnay and Ugni blanc featured among the most vulnerable varieties while Pinot noir, Merlot and Cabernet Sauvignon ranked among the most resistant. In consequence, wine regions bearing a significant proportion of vulnerable varieties, such as Poitou-Charentes, France and Marlborough, New Zealand, turned out to be at greater risk under drought. These results highlight that grapevine varieties may not respond equally to warmer and drier conditions, outlining the importance to consider hydraulic traits associated with plant drought tolerance into breeding programmes and modeling simulations of grapevine yield maintenance under severe drought. They finally represent a step forward to advise the wine industry about which varieties and regions would have the lowest risk of drought-induced mortality under climate change.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Towards a regional mapping of vine water status based on crowdsourcing observations

Monitoring vine water status is a major challenge for vineyard management because it influences both yield and harvest quality. It is also a challenge at the territorial scale for identifying periods of high water restriction or zones regularly impacted by water stress. This information is of major importance for defining collective strategies, anticipating harvest logistic or applying for irrigation authorisation. At this spatial scale, existing tools and methods for monitoring vine water status are few and often require strong assumptions (e.g. water balance model). This paper proposes to consider a collaborative collection of observations by winegrowers and wine industry stakeholders (crowdsourcing) as an interesting alternative. Indeed, it allows the collection of a large number of field observations while pooling the collection effort. However, the feasibility of such a project and its interest in monitoring vine water status at regional scale has never been tested.

The objective of this article is to explore the possibility of making a regional map of vine water status based on crowdsourcing observations. It is based on the study of the free mobile application ApeX-Vigne, which allows the collection of observations about vine shoot growth. This information is easy to collect and can be considered, under certain conditions, as a proxy for vine water status. This article presents the first results obtained from the nearly 18,000 observations collected by winegrowers and wine industry stakeholders during 2019, 2020 and 2021 seasons. It presents the vine shoot growth maps obtained at regional scale and their evolution over the three vintages studied. It also proposes an analysis of the factors that favoured the number of observations collected and those that favoured their quality. These results open up new perspectives for monitoring vine water status at a regional scale but above they provide references for other crowdsourcing projects in viticulture.

Effect of vigour and number of clusters on eonological parameters and metabolic profile of Cabernet Sauvignon red wines

Vegetative growth and yield are reported to affect grape and wine quality. They can be controlled through different techniques linked to vine management. The objective of this research was to determine the effect of vine vigour and number of clusters per vine on physicochemical composition and phenolic profile of red wines. The experiment was carried out during two vegetative cycles, with cv. Cabernet Sauvignon grafted onto Paulsen 1103. Three vine vigour were defined, according to shoot weight at previous harvests, being low, medium and high. Five treatments of number of clusters were used for each vigour, with 15, 22, 29, 36, and 45 clusters per vine. Grapes from all treatments were harvested in the same day from Brix and total acidity criteria. Thirty days after bottling, classical analyzes and phenolic compounds were performed. As results, different responses were obtained from each vintage. In 2020, a dry season from veraison to harvest, grapes and wines obtained from low vigour treatment and 45 clusters per vine was the highest in sugar and alcohol content respectively, while grapes and wines from high vigour and 15 clusters presented the lowest sugar and alcohol content. Total anthocyanins were higher in treatment with low vigour and 15 clusters, while the lowest amounts were found in low vigour with 45 clusters, as well as medium and high vigour with 36 clusters per vine. Total tannins were higher in high vigour with 22 clusters and medium vigour with 29 clusters, while were lower in low vigour with 36 clusters. In 2021, a wet season at harvest, responses were different, and great variations were observed between treatments. As conclusions, yield and vine vigour had strong influence on grape and wine quality, promoting different enological potentials on which can be indicated/used for aging strategies of red and even rosé wines.

Simulating climate change impact on viticultural systems in historical and emergent vineyards

Global climate change affects regional climates and hold implications for wine growing regions worldwide. Although winegrowers are constantly adapting to internal and external factors, it seems relevant to develop tools, which will allow them to better define actual and future agro-climatic potentials. Within this context, we develop a modelling approach, able to simulate the impact of environmental conditions and constraints on vine behaviour and to highlight potential adaptation strategies according to different climate change scenarios. Our modeling approach, named SEVE (Simulating Environmental impacts on Viticultural Ecosystems), provides a generic modeling framework for simulating grapevine growth and berry ripening under different conditions and constraints (slope, aspect, soil type, climate variability…) as well as production strategies and adaptation rules according to climate change scenarios. Each activity is represented by an autonomous agent able to react and adapt its reaction to the variability of environmental constraints. Using this model, we have recently analyzed the evolution of vineyards’ exposure to climatic risks (frost, pathogen risk, heat wave) and the adaptation strategies potentially implemented by the winegrowers. This approach, implemented for two climate change scenarios, has been initiated in France on traditional (Loire Valley) and emerging (Brittany) vineyards. The objective is to identify the time horizons of adaptations and new opportunities in these two regions. Carried out in collaboration with wine growers, this approach aims to better understand the variability of climate change impacts at local scale in the medium and long term.

The combined effects of climate, soils, and deficit irrigation on yield and quality of Touriga Nacional under high atmospheric demand in the Douro Region

Global warming is one of the biggest environmental, social and economic threats in several viticultural regions. In the Douro Valley, changes are expected in the coming years, namely an increase in temperature and a decrease in precipitation. These changes are likely to have consequences for the production and quality of wine.
The aim of this study was to explore the effects of different soil characteristics combined with several deficit irrigation strategies, managed throughout ETc references and predawn leaf water potentials thresholds, on physiology, yield, and qualitative attributes on the Touriga Nacional variety under years of mild to severe water and heat stress.
The studies were conducted over seven years (2015 to 2021) in two plots of a commercial vineyard located at Quinta do Ataíde (Symington Family Estates) planted in 2011 and 2014 at 170 meters elevation, growing under three water regimes: non-irrigated (NI) and two deficit irrigation strategies (30% and 60% ETc) assessed weekly by Ψpd. The site has an annual rainfall below 500 mm, with high atmospheric demand. Climate data was collected from a weather station, located on site. Berry ripening was followed weekly for fruit analysis. At harvest, yield, vigour and pruning weight per vine were determined from 90 vines by treatment. Each season at veraison the NDVI Index was accessed by a drone. The soils physic-chemistry in the experimental blocs were analysed and grouped by SWHC. Delta C-13 analyses were also performed per treatment in two years.Irrigation had a positive effect on yield per vine, mostly due to an increase in berry and cluster weight, and fertility index through the years. A significant increase in sugar content, colour and phenols was observed with deficit irrigation in some years, but vine vigour related to soil characteristics had by far the greatest impact on quality.

‘Cabernet Sauvignon’ (Vitis vinifera L.) berry skin flavonol and anthocyanin composition is affected by trellis systems and applied water amounts

Trellis systems are selected in wine grape vineyards to mainly maximize vineyard yield and maintain berry quality. This study was conducted in 2020 and 2021 to evaluate six commonly utilized trellis systems including a vertical shoot positioning (VSP), two relaxed VSPs (VSP60 and VSP80), a single high wire (SH), a high quadrilateral (HQ), and a guyot (GY), combined with three levels of irrigation regimes based on different crop evapotranspiration (ETc) replacements, including a 25% ETc, 50% ETc, and 100% ETc. The results indicated SH yielded the most fruits and accumulated the most total soluble solids (TSS) at harvest in 2020, however, it showed the lowest TSS in the second season. In 2020, SH and HQ showed higher concentrations in most of the anthocyanin derivatives compared to the VSPs. Similar comparisons were noticed in 2021 as well. SH and HQ also accumulated more flavonols in both years compared to other trellis systems. Overall, this study provides information on the efficacy of trellis systems on grapevine yield and berry flavonoid accumulation in a currently warming climate.