Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).

Rootstock regulation of scion phenotypes: the relationship between rootstock parentage and petiole mineral concentration

Grapevine is grown as a graft since the end of the 19th century. Rootstocks not only provide tolerance to Phylloxera but also ensure the supply of water and mineral nutrients to the scion. Rootstocks are an important mean of adaptation to environmental conditions, because the scion controls the typical features of the grapes and wine. However, among the large diversity of rootstocks worldwide, few of them are commercially used in the vineyard. The aim of this study was to investigate the extent to which rootstocks modify the mineral composition of the petioles of the scion. Vitis vinifera cvs. Cabernet-Sauvignon, Pinot noir, Syrah and Ugni blanc were grafted onto 55 different rootstock genotypes and planted in a vineyard as three replicates of 5 vines. Petioles were collected in the cluster zone with 6 replicates per combination. Petiolar concentrations of 13 mineral elements (N, P, K, S, Mg, Ca, Na, B, Zn, Mn, Fe, Cu, Al) at veraison were determined. Scion, rootstock and the interaction explained the same proportion of the phenotypic variance for most mineral elements. Rootstock genotype showed a significant influence on the petiole mineral element composition. Rootstock effect explained from 7 % for Cu to 25 % for S of the variance. The difference of rootstock conferred mineral status is discussed in relation to vigor and fertility. Rootstocks were also genotyped with 23 microsatellite markers. Data were analysed according to genetic groups in order to determine whether the petiole mineral composition could be related to the genetic parentage of the rootstock. Thanks to a highly powerful design, it is the first time that such a large panel of rootstocks grafted with 4 scions has been studied. These results give the opportunity to better characterize the rootstocks and to enlarge the diversity used in the vineyard.

Organic recycled mulches in sustainable viticulture: assessment of spontaneous plants communities and weed coverage

In recent years, developing more efficient and sustainable viticulture management has been essential due to the impact of climate change in semiarid regions. For this reason, the use of recycled organic mulching (ROM) in the vineyard has become an interesting strategy to cope with water stress, isolated soil from extreme temperatures and improving soil humidity, control the presence of weeds and therefore reduce the inputs of herbicides and improve soil fertility. This work aimed to analyse the effect of three different organic mulches [straw (S), grape pruning debris (GPD) and spent mushroom compost (SMC)] and two traditional soil management techniques [herbicide (H) and interrow (IN)] on weed coverage and the spontaneous plant communities’ presence. Data sampling was collected throughout the vine vegetative cycle of 2021 in La Rioja, Spain. The different soil management techniques had a clear effect on weed coverage and his development during the vine vegetative cycle. SMC and H were the treatments with the highest and the lowest coverage percentage, respectively. IN had a delayed weed emergence at the beginning of the vine vegetative cycle, but finally it reached maximum values nearby SMC. GPD and S had similar effects on weed emergence, reaching 25-30% of the maximum coverage values. A total of 29 herbaceous species were identified during the vegetative cycle, some of them very isolated and occasional. Principal component analysis (PCAs) showed a good association between spontaneous species and treatments, furthermore, specific species-treatment associations were found. Moreover, three clear groups of herbaceous communities were identified by cluster analysis. This study provides interesting information about the effect of different alternative soil management on herbaceous plant coverage and weed species communities which could contribute to making more sustainable viticulture.

Frost risk projections in a changing climate are highly sensitive in time and space to frost modelling approaches

Late spring frost is a major challenge for various winegrowing regions across the world, its occurrence often leading to important yield losses and/or plant failure. Despite a significant increase in minimum temperatures worldwide, the spatial and temporal evolution of spring frost risk under a warmer climate remains largely uncertain. Recent projections of spring frost risk for viticulture in Europe throughout the 21st century show that its evolution strongly depends on the model approach used to simulate budburst. Furthermore, the frost damage modelling methods used in these projections are usually not assessed through comparison to field observations and/or frost damage reports.
The present study aims at comparing frost risk projections simulated using six spring frost models based on two approaches: a) models considering a fixed damage threshold after the predicted budburst date (e.g BRIN, Smoothed-Utah, Growing Degree Days, Fenovitis) and b) models considering a dynamic frost sensitivity threshold based on the predicted grapevine winter/spring dehardening process (e.g. Ferguson model). The capability of each model to simulate an actual frost event for the Vitis vinifera cv. Chadonnay B was previously assessed by comparing simulated cold thermal stress to reports of events with frost damage in Chablis, the northernmost winegrowing region of Burgundy. Models exhibited scores of κ > 0.65 when reproducing the frost/non-frost damage years and an accuracy ranging from 0.82 to 0.90.
Spring frost risk projections throughout the 21st century were performed for all winegrowing subregions of Bourgogne-Franche-Comté under two CMIP5 concentration pathways (4.5 and 8.5) using statistically downscaled 8×8 km daily air temperature and humidity of 13 climate models. Contrasting results with region-specific spring frost risk trends were observed. Three out of five models show a decrease in the frequency of frost years across the whole study area while the other two show an increase that is more or less pronounced depending on winegrowing subregion. Our findings indicate that the lack of accuracy in grapevine budburst and dehardening models makes climate projections of spring frost risk highly uncertain for grapevine cultivation regions.

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.

Variations of soil attributes in vineyards influence their reflectance spectra

Knowledge on the reflectance spectrum of soil is potentially useful since it carries information on soil chemical composition that can be used to the planning of agricultural practices. If compared with analytical methods such as conventional chemical analysis, reflectance measurement provides non-destructive, economic, near real-time data. This paper reports results from reflectance measurements performed by spectroradiometry on soils from two vineyards in south Brazil. The vineyards are close to each other, are on different geological formations, but were subjected to the same management. The objective was to detect spectral differences between the two areas, correlating these differences to variations in their chemical composition, to assess the technique’s potential to predict soil attributes from reflectance data.To that end, soil samples were collected from ten selected vine parcels. Chemical analysis yield data on concentration of twenty-one soil attributes, and spectroradiometry was performed on samples. Chemical differences significant to a 95% confidence level between the two studied areas were found for six soil attributes, and the average reflectance spectra were separated by this same level along most of the observed spectral domain. Correlations between soil reflectance and concentrations of soil attributes were looked for, and for ten soil traits it was possible to define wavelength domains were reflectance and concentrations are correlated to confidence levels from 95% to 99%. Partial Least Squares Regression (PLSR) analyses were performed comparing measured and predicted concentrations, and for fifteen out of 21 soil traits we found Pearson correlation coefficients r > 0.8. These preliminary results, which have to be validated, suggest that variations of concentration in the investigated soil attributes induce differences in reflectance that can be detected by spectroradiometry. Applications of these observations include the assessment of the chemical content of soils by spectroradiometry as a fast, low-cost alternative to chemical analytical methods.

δ13C : A still underused indicator in precision viticulture  

The first demonstration of the interest of carbon isotope composition of sugars in grapevine, as an integrated indicator of vineyard water status, dates back to 2000 (Gaudillère et al., 1999; Van Leeuwen et al., 2001). Thanks to the isotopic discrimination of Carbon that takes place during plant photosynthesis, under hydric stress conditions, it is possible to accurately estimate the photosynthetic activity. Ever since, δ13C has been widely applied with success to zonation, terroir studies and vine physiology research, but is still not widely used by viticulturists. This is quite astonishing by considering the impact of global warming on viticulture and the need to improve water management, that would justify a widespread use of δ13C.
The lack of private laboratories proposing the analysis, the cost of the technology, as well as the long analytical delays, have been detrimental to its development. Some laboratories tried to overcome the analytical difficulties of isotopic analysis by using fourier transformed infrared spectroscopy, as a fast and cheap alternative to the official OIV method (IRMS). These claimed FTIR models have never been published or peer reviewed and cannot be considered robust. In this work, thanks to the recent acquisition of IRMS technology, new modern and robust applications of δ13C for viticulture are proposed. This includes the use of the analysis to make parcel separations at harvesting, the possibility to increase the precision of hydric stress cartography and the potential cost reduction when compared with Scholander pressure bomb analysis.

Climate change projections to support the transition to climate-smart viticulture

The Earth’s system is undergoing major changes through a wide range of spatial and temporal scales as a response to growing anthropogenic radiative forcing, which is pushing the whole system far beyond its natural variability. Sources of greenhouse gases largely exceed their sinks, thus leading to a strengthened greenhouse effect. More energy is thereby being supplied to the system, with inevitable shifts in climatic patterns and weather regimes. Over the last decades, these modifications have been manifested in the full statistical distributions of the atmospheric variables, with dramatic changes in the frequency and intensity of extremes. Natural hazards, such as severe droughts, floods, forest fires, or heatwaves, are being triggered by extreme atmospheric events worldwide, thus threatening human activities. Viticultculture is not only exposed to changing climates but is also highly vulnerable, as grapevine phenology and physiological development are strongly controlled by atmospheric conditions. Therefore, the assessment of climate change projections for a given region is critical for climate change adaptation and risk reduction in viticulture. By adopting timely and suitable measures, the future sustainability and resiliency of the sector can be fostered. Climate-grapevine chain modelling is an essential tool for better planning and management. However, the accuracy of the resulting projections is limited by many uncertainties that must be duly taken into account when transferring knowledge to stakeholders and decision-makers. Climate-smart viticulture will comprise ensembles of locally tuned strategies, envisioning both adaptation and mitigation, assisted by emerging technologies and decision-support systems.

Effect of multi-level and multi-scale spectral data source on vineyard state assessment

Currently, the main goal of agriculture is to promote the resilience of agricultural systems in a sustainable way through the improvement of use efficiency of farm resources, increasing crop yield and quality under climate change conditions. This last is expected to drastically modify plant growth, with possible negative effects, especially in arid and semi-arid regions of Europe on the viticultural sector. In this context, the monitoring of spatial behavior of grapevine during the growing season represents an opportunity to improve the plant management, winegrowers’ incomes, and to preserve the environmental health, but it has additional costs for the farmer. Nowadays, UAS equipped with a VIS-NIR multispectral camera (blue, green, red, red-edge, and NIR) represents a good and relatively cheap solution to assess plant status spatial information (by means of a limited set of spectral vegetation indices), representing important support in precision agriculture management during the growing season. While differences between UAS-based multispectral imagery and point-based spectroscopy are well discussed in the literature, their impact on plant status estimation by vegetation indices is not completely investigated in depth. The aim of this study was to assess the performance level of UAS-based multispectral (5 bands across 450-800nm spectral region with a spatial resolution of 5cm) imagery, reconstructed high-resolution satellite (Sentinel-2A) multispectral imagery (13 bands across 400-2500 nm with spatial resolution of <2 m) through Convolutional Neural Network (CNN) approach, and point-based field spectroscopy (collecting 600 wavelengths across 400-1000 nm spectral region with a surface footprint of 1-2 cm) in a plant status estimation application, and then, using Bayesian regularization artificial neural network for leaf chlorophyll content (LCC) and plant water status (LWP) prediction. The test site is a Greco vineyard of southern Italy, where detailed and precise records on soil and atmosphere systems, in-vivo plant monitoring of eco-physiological parameters have been conducted.