Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Towards a regional mapping of vine water status based on crowdsourcing observations

Monitoring vine water status is a major challenge for vineyard management because it influences both yield and harvest quality. It is also a challenge at the territorial scale for identifying periods of high water restriction or zones regularly impacted by water stress. This information is of major importance for defining collective strategies, anticipating harvest logistic or applying for irrigation authorisation. At this spatial scale, existing tools and methods for monitoring vine water status are few and often require strong assumptions (e.g. water balance model). This paper proposes to consider a collaborative collection of observations by winegrowers and wine industry stakeholders (crowdsourcing) as an interesting alternative. Indeed, it allows the collection of a large number of field observations while pooling the collection effort. However, the feasibility of such a project and its interest in monitoring vine water status at regional scale has never been tested.

The objective of this article is to explore the possibility of making a regional map of vine water status based on crowdsourcing observations. It is based on the study of the free mobile application ApeX-Vigne, which allows the collection of observations about vine shoot growth. This information is easy to collect and can be considered, under certain conditions, as a proxy for vine water status. This article presents the first results obtained from the nearly 18,000 observations collected by winegrowers and wine industry stakeholders during 2019, 2020 and 2021 seasons. It presents the vine shoot growth maps obtained at regional scale and their evolution over the three vintages studied. It also proposes an analysis of the factors that favoured the number of observations collected and those that favoured their quality. These results open up new perspectives for monitoring vine water status at a regional scale but above they provide references for other crowdsourcing projects in viticulture.

Climate change impacts on Douro Region viticulture and adaptation measures

Climate has a significant impact in the success of any agricultural system, with a direct influence on the crops suitability to a given region, interfering on yield and quality and also with the economic sustainability of the productive activity. In the Douro Demarcated Region (RDD), as in most regions of the Mediterranean climate, the scarce precipitation (33% has less than 600 mm per year), and your high variability, associated with high rates of evapotranspiration during the summer, is usually one of the fundamental factors that limit the grapevine development, as well as the production and quality of the harvest. Thus, facing the scenario in temperature changes for the next decades (1.5-2.5°C) and confirming the predictions of precipitation decreases and/or great variability in the occurrence of heat waves and intense rainfall, the consequences for slope stability in mountain viticulture and sustainability of all operations involved, are risks to be taken into account. In this way, a deepest and sustained knowledge regarding the adaptation measures to adverse environmental conditions is of a crucial importance, enabling a more efficient adaptation of plant growth conditions and the optimization of production and quality of the grapevines. The development of this work, carried out in two commercial vineyards, one located in Soutelo do Douro, São João da Pesqueira, Cima Corgo sub-region, and another located in Numão, Vila Nova de Foz Côa, Douro Superior sub-region, it seeks to establish a relationship between climatic elements and physiological, productive and qualitative parameters, as well as to evaluate the effectiveness of adaptation measures, including different types of deficit irrigation (2002-2019) and the application of shading nets (2019-2020) in the physiological, viticultural and oenological behavior in the Touriga Nacional and Moscatel Galego Branco varieties, respectively. The results showed that the application of deficit irrigation allowed to significantly reduce the impact of the adverse weather conditions at key moments in the development of the grapevine, particularly in the period immediately before veráison and maturation, reducing the negative effects on the physiological processes and productivity, without compromise the must quality parameters. On the other hand, the application of shading nets significantly reduced de leaves temperature, allowing to increase the water potential, stomatal conductance and photosynthetic rate of grapes, which was reflected in the yield increase in the 2nd year of the study. For the maturation indicators, higher levels of total acidity, malic acid and assimilable nitrogen were obtained. The last measure presents a huge potential, being essential to carry out more years of trials to obtain stronger conclusions in terms of production parameters, but also in characteristics as important as the grape ripening components and the organoleptic characteristics of wines.

Simulating climate change impact on viticultural systems in historical and emergent vineyards

Global climate change affects regional climates and hold implications for wine growing regions worldwide. Although winegrowers are constantly adapting to internal and external factors, it seems relevant to develop tools, which will allow them to better define actual and future agro-climatic potentials. Within this context, we develop a modelling approach, able to simulate the impact of environmental conditions and constraints on vine behaviour and to highlight potential adaptation strategies according to different climate change scenarios. Our modeling approach, named SEVE (Simulating Environmental impacts on Viticultural Ecosystems), provides a generic modeling framework for simulating grapevine growth and berry ripening under different conditions and constraints (slope, aspect, soil type, climate variability…) as well as production strategies and adaptation rules according to climate change scenarios. Each activity is represented by an autonomous agent able to react and adapt its reaction to the variability of environmental constraints. Using this model, we have recently analyzed the evolution of vineyards’ exposure to climatic risks (frost, pathogen risk, heat wave) and the adaptation strategies potentially implemented by the winegrowers. This approach, implemented for two climate change scenarios, has been initiated in France on traditional (Loire Valley) and emerging (Brittany) vineyards. The objective is to identify the time horizons of adaptations and new opportunities in these two regions. Carried out in collaboration with wine growers, this approach aims to better understand the variability of climate change impacts at local scale in the medium and long term.

Drought effect on aromatic and phenolic potential of seven recovered grapevine varieties in Castilla-La Mancha region (Spain)

The effects of climate change are seriously affecting the quality of wine grapes. High temperatures and drought cause imbalances in the chemical composition of grapes. The result is overripe grapes with low acidity and high sugar content, which produce wines with excessive alcohol content, lacking in freshness and not very aromatic. As a consequence, the search of varieties with capacity of produce quality grapes in adverse climate conditions is a good alternative to preserve the sustainability of vineyards. In this work, quality parameters of seven Vitis vinifera L. cultivars (five whites and two reds) recently recovered from extinction and grown under two different hydric regimes (rainfed and irrigated) were analyzed during the 2020 vintage. At harvest time, weight of 100 berries, must physicochemical parameters (brix degree, total acidity, malic acid, pH), and carbon and oxygen isotope ratios (δ13C, δ18O) were determined. Subsequently, varietal aroma potential index (IPAv) and total polyphenol index (TPI) were analyzed. Quality parameters, IPAv and TPI, showed significant differences between varieties and water regimes. Both red varieties, Moribel and Tinto Fragoso, stood out for their high aromatic and phenolic potential, which was higher under rainfed regime. Regarding to white varieties, Montonera del Casar and Jarrosuelto stood out in terms of varietal aroma potential. Montonera del Casar high acidity in its musts and Jarrosuelto showed the highest berry weights.

Impact of long term agroecological and conventional practices on subsurface soil microbiota in Macabeu and Xarel·lo vineyards

There is a growing trend on the transition from conventional to agroecological management of vineyards. However, the impact of practices, such as reduced-tillage, organic fertilization and cover crops, is not well-understood regarding the soil microbial diversity, and its relationship with the soil physicochemical properties in the subsurface depth near the rooting zone. Soil bacterial diversity is an important contributor towards plant health, productivity and response to environmental stresses. A field experiment was conducted by sampling subsurface soil bacterial community (NGS and qPCR) near to the root zone of Macabeu and Xarel·lo vineyards, located at the Penedes. 3 organic (ECO) and 3 conventional (CON) vineyards, with more than 10 years of respective management were sampled (n=5 each plot). ECO practices did not affect bacterial and fungal abundance but increased significantly the ammonium oxidizing bacteria and alpha-diversity (Inv.Simpson). Interestingly beta-diversity was significantly affected by the management strategy. ANOSIM-tests revealed a significative effect of the management (ecological vs conventional) and plot, on the soil microbial structure (ASV abundance). Main phyla depicted were Proteobacteria, Actinobacteria and Acidobacteria, whose relative abundances were not affected by the management. EdgeR assay revealed a significant increase of Cyanobacteria and decrease of Gemmatimonadetes and Firmicutes phyla in ECO. Interestingly, the grapevine variety was not correlated with the soil microbial community structure. Mantel-test revealed an important correlation (Spearman) of some physicochemical parameters with the soil microbiota structure, in order of importance: texture, EC, pH Ca/Mg, Mg/P, K+, Mg2+, Ca2+, SO42-, and OM. N-NH4 and NTK, which were higher in the ECO managed soils, did not correlated significantly with the soil microbiome population. The results revealed the importance of combining a deep physicochemical characterization of each replicate with the microbial diversity assessment to gain better insights on the relationship between soil microbiome and vineyard management.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Characterization of variety-specific changes in bulk stomatal conductance in response to changes in atmospheric demand and drought stress

In wine growing regions around the world, climate change has the potential to affect vine transpiration and overall vineyard water use due to related changes in atmospheric demand and soil water deficits. Grapevines control their transpiration in response to a changing environment by regulating conductance of water through the soil-plant-atmosphere continuum. Most vineyard water use models currently estimate vine transpiration by applying generic crop coefficients to estimates of reference evapotranspiration, but this does not account for changes in vine conductance associated with water stress, nor differences thought to exist between varieties. The response of bulk stomatal conductance to daily weather variability and seasonal drought stress was studied on Cabernet-Sauvignon, Merlot, Tempranillo, Ugni blanc, and Semillon vines in a non-irrigated vineyard in Bordeaux France. Whole vine sap flow, temperature and humidity in the vine canopy, and net radiation absorbed by the vine canopy were measured on 15-minute intervals from early July through mid-September 2020, together with periodic measurement of leaf area, canopy porosity, and predawn leaf water potential. From this data, bulk stomatal conductance was calculated on 15-minute intervals, and multiple regression analysis was performed to identify key variables and their relative effect on conductance. Attention was focused on addressing multicollinearity and time-dependency in the explanatory variables and developing regression models that were readily interpretable. Variability of vapor pressure deficit over the day, and predawn water potential over the season explained much of the variability in conductance, with relative differences in response coefficients observed across the five varieties. By characterizing this conductance response, the dynamics of vine transpiration can be better parameterized in vineyard water use modeling of current and future climate scenarios.

Climate and the evolving mix of grape varieties in Australia’s wine regions

The purpose of this study is to examine the changing mix of winegrape varieties in Australia so as to address the question: In the light of key climate indicators and predictions of further climate change, how appropriate are the grape varieties currently planted in Australia’s wine regions? To achieve this, regions are classified into zones according to each region’s climate variables, particularly average growing season temperature (GST), leaving aside within-region variations in climates. Five different climatic classifications are reported. Using projections of GSTs for the mid- and late 21st century, the extent to which each region is projected to move from its current zone classification to a warmer one is reported. Also shown is the changing proportion of each of 21 key varieties grown in a GST zone considered to be optimal for premium winegrape production. Together these indicators strengthen earlier suggestions that the mix of varieties may be currently less than ideal in many Australian wine regions, and would become even less so in coming decades if that mix was not altered in the anticipation of climate change. That is, grape varieties in many (especially the warmest) regions will have to keep changing, or wineries will have to seek fruit from higher latitudes or elevations if they wish to retain their current mix of varieties and wine styles.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

VINIoT: Precision viticulture service for SMEs based on IoT sensors network

The main innovation in the VINIoT service is the joint use of two technologies that are currently used separately: vineyard monitoring using multispectral imaging and deployed terrain sensors. One part of the system is based on the development of artificial intelligence algorithms that are feed on the images of the multispectral camera and IoT sensors, high-level information on water stress, grape ripening status and the presence of diseases. In order to obtain algorithms to determine the state of ripening of the grapes and avoid losing information due to the diversity of the grape berries, it was decided to work along the first year 2020 at berry scale in the laboratory, during the second year at the cluster scale and on the last year at plot scale. Different varieties of white and red grapes were used; in the case of Galicia we worked with the white grape variety Treixadura and the red variety Mencía. During the 2020 and 2021 campaigns, multispectral images were taken in the visible and infrared range of: 1) sets of 100 grapes classifying them by means of densimetric baths, 2) individual bunches. The images taken with the laboratory analysis of the ripening stage were correlated. Technological maturity, pH, probable degree, malic acid content, tartaric acid content and parameters for assessing phenolic maturity, IPT, anthocyanin content were determined. It has been calculated for each single image the mean value of each spectral band (only taking into account the pixels of interest) and a correlation study of these values with laboratory data has been carried out. These studies are still provisional and it will be necessary to continue with them, jointly with the training of the machine learning algorithms. Processed data will allow to determine the sensitivity of the multispectral images and select bands of interest in maturation.

Effect of fertigation strategies to adapt PGI Côtes de Gascogne production to hot vintage

The development of fertigation could be a possible solution to adapt PGI Côtes de Gascogne (south-western France) wine production to climate change. The goal would be to limit the negative effects of water stress on yield performance expectation (around 15 tons per hectare) and to make the use of fertilizers more efficient. This study aimed to compare the effects of three strategies of water and minerals supply on grapes and wines qualities. Two fertigation practices were compared to a rainfed control which is the current standard of the local grape growing production. The fertilizers (nitrogen and potassium) were (i) fully brought by irrigation pipe during the season, (ii) partially brought by irrigation pipe and partially on the soil or (iii) fully brought on the soil at the beginning of the season for the non-irrigated control (local standard). The trial was run on cv. Colombard trained on spur pruned with vertical shoot positioning system on a sandy-silty-clay soil over the 2020 vintage which was particularly hot for the region. Moderate to strong water deficit appeared during the growing period of the berries and held on after veraison. Irrigation strategies allowed for maintaining grapevine without water deficit and being significantly different from the control water status. Grapevine with fully or partial fertigation strategies produced 25% more yield mainly due to the increase of the bunch weight. Also, the fully fertigation showed the best ratio between yield and maturity and brought 30% less of fertilizers (both nitrogen and potassium) than the two other strategies. Finally, the analysis of aromatic compounds in Colombard wines, varietal thiols family, showed the same level of concentrations for the 3 treatments, confirming that the yield performance did not impact the aromatic potential in this trial.