Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

REMEDIATION OF SMOKE TAINTED WINE USING MOLECULARLY IMPRINTED POLYMERS

In recent years, vineyards in Australia, the US, Canada, Chile, South Africa and Europe have been exposed to smoke from wildfires. Wines made from smoke-affected grapes often exhibit unpleasant smoky, ashy characters, attributed to the presence of smoke-derived volatile compounds, including volatile phenols (which occur in free and glycosylated forms). Various strategies for remediation of smoke tainted wine have been evaluated. The most effective strategies involve the removal of smoke taint compounds via the addition of adsorbent materials such as activated carbon, which can either be added directly or used in combination with nanofiltration. However, these treatments often simultaneously remove wine constituents responsible for desirable aroma, flavour and colour attributes.

Lactiplantibacillus plantarum – A versatile tool for biological deacidification

Malolactic fermentation (MLF) is a secondary wine fermentation conducted by lactic acid bacteria (LAB). This fermentation is important in winemaking as it deacidifies the wine, converting L-malic acid into L-lactic acid and carbon dioxide, and it contributes to microbial stability. Wine pH is highly selective, and at pH below 3.5 generally only strains of O. oeni can survive and express malolactic activity, while under more favorable growth conditions above pH 3.5, species of Lactobacillus and Pediococcus may conduct the MLF. Among the LAB species Lactiplantibacillus plantarum strains have shown most interesting results under hot climate conditions, not only for their capacity to induce MLF, but also for their homo-fermentative properties towards hexose sugars, which makes them suitable for induction of MLF in high pH and high alcohol wines, when inoculated at the beginning of alcoholic fermentation.

Grapevine yield-gap: identification of environmental limitations by soil and climate zoning in Languedoc-Roussillon region (south of France)

Grapevine yield has been historically overlooked, assuming a strong trade-off between grape yield and wine quality. At present, menaced by climate change, many vineyards in Southern France are far from the quality label threshold, becoming grapevine yield-gaps a major subject of concern. Although yield-gaps are well studied in arable crops, we know very little about grapevine yield-gaps. In the present study, we analysed the environmental component of grapevine yield-gaps linked to climate and soil resources in the Languedoc Roussillon. We used SAFRAN data and IGP Pays d’Oc wine yields from 2010 to 2018. We selected climate and soil indicators proving to have a significant effect on average wine yield-gaps at the municipality scale. The most significant factors of grapevine yield were the Soil Available Water Capacity; followed by the Huglin Index and the Climatic Dryness Index. The Days of Frost; the Soil pH; and the Very Hot Days were also significant. Then, we clustered geographical zones presenting similar indicators, facilitating the identification of resources yield-gaps. We discussed the number of zones with the experts of IGP Pays d’Oc label, obtaining 7 zones with similar limitations for grapevine yield. Finally, we analysed the main resources causing yield-gaps and the grapevine varieties planted on each zone. Mapping grapevine resource yield-gaps are the first stage for understanding grapevine yield-gaps at the regional scale.

Plastid genomics of Vitis vinifera L. for understanding the molecular basis of  grapevine (Vitis vinifera L.) domestication

The precise molecular mechanisms underlying the domestication of grapevine (Vitis vinifera L.) Are still not fully understood. In the recent years, next-generation sequencing (NGS) of plastid genomes has emerged as a powerful and increasingly effective tool for plant phylogenetics and evolution. To uncover the biological profile of the grapevine domestication process comprehensively, an investigation should encompass both the cultivated varieties (V. vinifera subsp. Vinifera) and their wild ancestors V. vinifera subsp. Sylvestris) across all potential sites of their distribution and domestication.

Defining gene regulation and co-regulation at single cell resolution in grapevine

Conventional molecular analyses provide bulk genomic/transcriptomic data that are unable to reveal the cellular heterogeneity and to precisely define how gene networks orchestrate organ development. We will profile gene expression and identify open chromatin regions at the individual cells level, allowing to define cell-type specific regulatory elements, developmental trajectories and transcriptional networks orchestrating organ development and function. We will perform scRNA-seq and snATAC-seq on leaf/berry protoplasts and nuclei and combine them with the leaf/berry bulk tissues obtained results, where the analysis of transcripts, chromatin accessibility, histone modification and transcription factor binding sites showed that a large fraction of phenotypic variation appears to be determined by regulatory rather than coding variation and that many variants have an organ-specific effect.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

New food trend ahead? Highlighting the nutritional benefits of grapevine leaves

The wine industry produces an enormous amount of waste every year. A wider inclusion of disregarded by-products in the human diet or its use as a source of bioactive compounds is a good strategy for reducing waste. It will not only introduce an added value to a waste product but also come upon the European Union and United Nations’ demands towards more sustainable agricultural approaches and circular economy.

Impact of water stress on the phenolic composition of cv. Merlot grapes, in a typical terroir of the La Mancha region (Spain)

The study was carried out in 2006 with Merlot grapes from vines grown using the trellis system, where four treatments were compared with different levels of water stress.

Impact of copper residues in grape must on alcoholic fermentation: effects on yeast performance, acetaldehyde and SO2 production

A relevant trend in winemaking is to reduce the use of chemical compounds in both the vineyard and winery.

La Région Délimitée du Douro et le Vin de Porto — un terroir historique —

The viticulture of the Douro Delimited Region, one of the heirs of ancestral viticulture, traditionally empirical and of quality, while integrating modernity and contemporary tools, respects and has always present the principles on which it was developed.

Tomatoes and Grapes: berry fruits with a (bright) biotech future?

Tomatoes and Grapes are berries that are genetically related and therefore at least partially their developmental pathways leading to a fleshy fruit should share some of the components. In a sense knowledge obtained from the model plant tomato could be useful for grape and conversely the more amenable tomato can be used to test some hypothesis that would be difficult to obtain in grape. Research in my lab and other labs have led to a better understanding of the molecular genetics mechanisms underlying fruit development and ripening in tomato and more specifically those related to metabolite accumulation that may lead to changes in fruit nutritional and flavor composition. This research has involved the use of genetic variability in natural population, but also biparental population and genetically engineered lines that are easy to develop in tomato tomato but not in grape. NGTs also can be easily implemented in tomato to not only speed up the gene-to-trait but also develop new tomato varieties.