Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

A spatial explicit inventory of EU wine protected designation of origin to support decision making in a changing climate

Winemaking areas recognized as protected designations of origin (PDOs) shape important economic, environmental and cultural values that are tied to closely defined geographic locations. To preserve wine products and wine-growing practices adopted in different PDOs these areas are strictly regulated by legal specifications. However, quality viticulture is increasingly under pressure from climate change, which is altering the local conditions of many winegrowing areas. Therefore, maintaining traditional wine products will require the adoption of tailored adaptation strategies, including possible changes in the legal regulation of protected wines. To this end, it is necessary to have a comprehensive knowledge on PDOs including their extension, products and allowed practices. While there have been efforts to build databases that summarize the characteristics for individual wine PDO areas and to quantify the related effects of climate change, much information is still included only in the official documentation of the EU geographical indication register and has never been collected in a comprehensive manner. With this study we aim at filling this gap by building a spatial inventory of European wine PDOs that supports decision making in viticulture in the context of climate change. To map and characterize European wine PDOs, we analysed their legal documents and extracted relevant information useful for climate change adaptation. The output consists of a comprehensive geographical dataset that identifies the boundaries of all 1200 European wine PDOs at unprecedented spatial resolution and includes a set of legally binding regulations, such as authorized vine varieties, maximum yields and planting density. The inventory will allow researchers to analyse the impacts of climate change on European wine PDOs and support decision makers in developing tailored adaptation strategies. This includes, among others, the evaluation of new vineyard site selection, the expansion of cultivated varieties or the authorization of irrigation in vineyards.

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Inhibition of Oenococcus oeni during alcoholic fermentation by a selected Lactiplantibacillus plantarum strain

The use of selected cultures of the species Lactiplantibacillus plantarum in Oenology has grown in prominence in recent years. While initial applications of this species centred very much around malolactic fermentation (MLF), there is strong evidence to show that certain strains can be harnessed for their bio-protective effects. Unwanted spontaneous MLF during alcoholic fermentation (AF), driven by rogue Oenococcus oeni, is a winemaking deviation that is very difficult to manage when it occurs. This work set out to determine the efficacy of one particular strain of Lactiplantibacillus plantarum(Viniflora® NoVA™ Protect), against this problem in Cabernet Sauvignon must. The work was carried out at commercial scale and in a winery environment and compared the bio-protective culture with the more traditional approach of reducing must pH by the addition of tartaric acid. The combination of both was also investigated. The concentration of both Oenococcus oeni and Lactiplantibacillus plantarum was determined using qPCR. The adventitious Oenococcus oeni showed the most growth during AF in the control wine, whereas in the wines treated with Lactiplantibacillus plantarum a bacteriostatic effect against this species was observed. This effect was comparable to the wines treated with tartaric acid. This has particular commercial relevance for controlling the flora in musts with high pH, or when the addition of tartaric acid is either not permitted or is prohibitive for other reasons.

Heatwaves and grapevine yield in the Douro region, crop model simulations

Heatwaves or extreme heat events can be particularly harmful to agriculture. Grapevines grown in the Douro winemaking region are particularly exposed to this threat, due to the specificities of the already warm and dry climatic conditions. Furthermore, climate change simulations point to an increase in the frequency of occurrence of these extreme heat events, therefore posing a major challenge to winegrowers in the Mediterranean type climates. The current study focuses on the application of the STICS crop model to assess the potential impacts of heatwaves in grapevine yields over the Douro valley winemaking region. For this purpose, STICS was applied to grapevines using high-resolution weather, soil and terrain datasets over the Douro. To assess the impact of heatwaves, the weather dataset (1989-2005) was artificially modified, generating periods with anomalously high temperatures (+5 ºC), at certain onset dates and with specific durations (from 5 to 9 days). The model was run with this modified weather dataset and results were compared to the original unmodified runs. The results show that heatwaves can have a very strong impact on grapevine yields, strongly depending on the onset dates and duration of the heatwaves. The highest negative impacts may result in a decrease in the yield by up to -35% in some regions. Despite some uncertainties inherent to the current modelling assessment, the present study highlights the negative impacts of heatwaves on viticultural yields in the Douro region, which is critical information for stakeholders within the winemaking sector for planning suitable adaptation measures.

Diagnosis of soil quality and evaluation of the impact of viticultural practices on soil biodiversity in a vineyard in southwestern France

Viticulture is facing two major changes – climate change and agroecological transition. In both cases, soil quality is seen as a lever to move towards a more sustainable viticulture. However, soil biological quality is little considered in the implementation of viticultural practices. Gascogn’Innov (2017-2022) is an Operational Group funded by the European Innovation Partnership for Agriculture. As such, it brings together winegrowers from the south-west of France, scientists, advisors and technicians, around a project focused on viticultural soil biological functioning and the design of technical routes more respectful toward soil heritage. To achieve this, the project aims to acquire references on the impact of viticultural practices on soil biology from a dynamic way, and to test a methodology to integrate information provided by the soil bioindicators to manage farming systems. A set of indicators of soil biological quality are evaluated in the project: microorganisms (bacteria and fungi abundance and diversity), fauna (abundance and diversity of nematodes and earthworms), physico-chemical characteristics, soil structure assessment and degradation rate of organic matter. Based on a network of 13 plots that have been subject to an initial diagnosis in 2017, several agronomical practices to restore soil fertility are experimented to redesign the cropping system (for instance plant cover, organic matter inputs, reduction of herbicides, mineral fertilizers). System redesign was made in collaboration by winegrowers and an interdisciplinary group of experts (agronomists, biologists). Several indicators are measured on vine and soil at each vintage to assess vine health and productivity. At the end of the project (2021), a final diagnosis was carried out. Gascogn’Innov allowed to create a regional database on the quality of wine-growing soils, which permitted to evaluate the effect of practices according to soil types. Especially, decreasing the intensity of tillage and increasing the duration and diversity of grass coverage tends to increase the abundance of all the organisms studied. This project confirmed the value of soil biological quality indicators to drive the sustainability of practices, but also highlighted the key-role of expertise, in both agronomy and soil biology, to help winegrowers understand and appropriate their soil quality diagnoses.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).

Updating the Winkler index: An analysis of Cabernet sauvignon in Napa Valley’s varied and changing climate

This study aims to create an updated, agile viticultural climate index (similar to the Winkler Index) by performing in-depth analyses of current and historical data from industry partners in several major winegrowing regions. The Winkler Index was developed in the early twentieth century based on analysis of various grape-growing regions in California. The index uses heat accumulation (i.e. Growing Degree Days) throughout the growing season to determine which grape varieties are best suited to each region. As viticultural regions are increasingly subject to the complexity and uncertainty of a changing climate, a more rigorous, agile model is needed to aid grape growers in determining which cultivars to plant where. For the first phase of this study, 21 industry partners throughout Napa Valley shared historical phenology, harvest, viticultural practice, and weather data related to their Cabernet sauvignon vineyard blocks. To complement this data, berry samples were collected throughout the 2021 growing season from 50 vineyard blocks located throughout 16 American Viticultural Areas that were then analyzed for basic berry chemistry and phenolics. These blocks have been mapped using a Geographic Information System (GIS), enabling analysis of altitude, vineyard row orientation, slope, and remotely sensed climate data. Sampling sites were also chosen based on their proximity to a weather station. By analyzing historical data from industry partners and data specifically collected for this study, it is possible to identify key parameters for further analysis. Initial results indicate extreme variability at a high spatial resolution not currently accounted for in modern viticultural climate indices and suggest that viticultural practices play a major role. Using the structure of data collection and analyses developed for the first phase, this project will soon be expanded to other wine regions globally, while continuing data collection in Napa Valley.

A multidisciplinary approach to evaluate the effects of the training system on the performance of “Aglianico del Vulture” vineyards

Vineyards are complex agro-ecosystems with high spatial and temporal variability. An efficient training system may counteract the adverse effects of this variability. Moreover, considering the climate change issues, choosing an efficient training system that enhances water use and protects the vines from radiative thermal stress has become a priority for the farmers. A multidisciplinary approach that assesses the soil-crop-yield-wine relationships of vineyards in a distributed and holistic way could bring added knowledge on the behavior of the different training systems. This ongoing research aimed to implement a multidisciplinary approach to study the behavior of “Aglianico del Vulture” grapevines trained with two different systems: a spurred cordon (SC) and an “Alberello in parete” (AL), grown in a high-quality wine production area of Basilicata region (Italy). The approach merged several methods and scales of soil, ecophysiology, must/wine quality, and spectral data collection to assess the influence of the training system. Homogeneous zones (HZs) in both training systems were defined through a procedure based on geomorphological classification, unmanned aerial vehicles (UAV) images analysis, and a traditional soil survey supported by geophysical scanning. During the 2021 season, TDR probes monitored soil water content, while grapevine health status was assessed using eco-physiological measurements (LWP, chlorophyll content, PSII photosynthetic efficiency, LAI, and point-based field spectroscopy). These grapevine in-vivo measurements validated the spectral vegetation indexes (NDVI, RENDVI, CVI, and TVI) derived from the UAV multispectral imagery, which monitored the grapevine status in a distributed and non-invasive way. Grape yield, quality of berries, must and wine were measured to assess the effects of the training systems. The first experimental year results showed the variability of the vineyards and revealed relationships among soil parameters, crop characteristics, and vegetation indices of the SC and AL training systems. This multidisciplinary study could bring new insights into the vineyard training system’s effects on grape yield and wine quality.

Climate change impacts: a multi-stress issue

With the aim of producing premium wines, it is admitted that moderate environmental stresses may contribute to the accumulation of compounds of interest in grapes. However the ongoing climate change, with the appearance of more limiting conditions of production is a major concern for the wine industry economic. Will it be possible to maintain the vineyards in place, to preserve the current grape varieties and how should we anticipate the adaptation measures to ensure the sustainability of vineyards? In this context, the question of the responses and adaptation of grapevine to abiotic stresses becomes a major scientific issue to tackle. An abiotic stress can be defined as the effect of a specific factor of the physico-chemical environment of the plants (temperature, availability of water and minerals, light, etc.) which reduces growth, and for a crop such as the vine, the yield, the composition of the fruits and the sustainability of the plants. Water stress is in many minds, but a systemic vision is essential for at least two reasons. The first reason is that in natural environments, a single factor is rarely limiting, and plants have to deal with a combination of constraints, as for example heat and drought, both in time and at a given time. The second reason is that plants, including grapevine, have central mechanisms of stress responses, as redox regulatory pathways, that play an important role in adaptation and survival. Here we will review the most recent studies dealing with this issue to provide a better understanding of the grapevine responses to a combination of environmental constraints and of the underlying regulatory pathways, which may be very helpful to design more adapted solutions to cope with climate change.

First step in the preparation of a soil map of the Protected Designation of Origin Valdepeñas (Central, Spain)

This work is a first step to make a map of vineyard soils. The characterization of the soils of the Protected Designation of Origin (D.P.O.) Valdepeñas will allow to group the studied profiles according to their physico-chemical characteristics and the concentrations of most relevant chemical elements. 90 soil profiles were analysed throughout the territory and the soils were sampled and described according to FAO (2006) and classified according to and Soil Taxonomy (2014). All samples were air dried, sieved and some physico-chemical parameters were determined following standard protocols. Also, major and trace elements were analysed by X-ray fluorescence. The statistically study was made using the SPSS program. Trend maps were made using the ArcGIS program. The studied soils have the following average properties: pH, 8.3; electrical conductivity, 0,20 dS/m (low); clay, 18.8% (medium) and CaCO3, 17.1% (high). In the study for the major elements. The major elements of these soils are Si, followed by Ca and Al, with an average content of 203.7 g/kg, 105.5 g/kg and 74.0 g/kg respectively. On the other hand, 27 trace elements have been studied. Of all of them, it can be highlighted the average values of Ba (361.8 mg/kg), Sr (129.3 mg/kg), Rb (83.4 mg/kg), V (74.2 mg/kg) and Ce (70.6 mg/kg). Ba, V and Ce values are higher and the values of Sr and Rb are lower to those found in the literature. The discriminant analysis shows a percentage of grouping of 91%. The content of chemical elements together with the physico-chemical characteristics allows grouping the soils in 4 group according to their order in the classification to Soil Taxonomy; due to the importance of the Calcisols in Castilla-La Mancha, it has been decided to establish them as their own group even if they do not appear in Soil Taxonomy classification.