Macrowine 2021
IVES 9 IVES Conference Series 9 Data fusion approaches for sensory and multimodal chemistry data applied to storage conditions

Data fusion approaches for sensory and multimodal chemistry data applied to storage conditions

Abstract

AIM: The need to combine multimodal data for complex samples is due to the different information captured in each of the techniques (modes). The aim of the study was to provide a critical evaluation of two approaches to fusing multi-modal chemistry and sensory data, namely, multiblock multiple factor analysis (MFA) and concatenation using principal component analysis (PCA).

METHODS: Wines were submitted to sensory analysis using Pivot©Profile (Thuillier et al. 2015) and chemical analysis in four modes: antioxidant measurements (AM), volatile compounds composition (VCC), ultraviolet-visible light (UV-Vis) spectrophotometry (Mafata et al. 2019), and infra-red (IR) spectroscopy. Correspondence analysis (CA), principal component analysis (PCA), and multiple factor analysis (MFA) were used to model data under the data analysis steps involving data cleaning, visualizing, modelling and evaluation (Pagès 2004). Percentage explained variation (%EV) and regression vector (RV) coefficients were used as comparative evaluation parameters between data models (Abdi 2007).

RESULTS: IR spectral data were used as an example of the assessment of the need for data cleaning/pre-processing. Similarities in MFA and high RV coefficients indicated that the raw (unprocessed data) could be used for the data fusion. High RV coefficients and MFA proximity between the antioxidants and UV-Vis measurements indicated an overlap between the type of information contained in the two. The differences between the information captured in each of the five modes can be seen in the different measurements, from the knowledge of the theory/ ontext behind the technique, and statistically. Statistically, the differences are measured and visualised by a lack of overlap (redundancy) in the MFA and its accompanying cluster analysis. 

CONCLUSIONS

The %EV when performing PCA are higher than with MFA, a consequence of fusing big data sets from various modes and not necessarily a direct result of the relationships among the data sets. Therefore, the %EV was ruled out as a reliable measure of the differences in informational value between MFA and PCA fusion strategies. RV coefficients, of which MFA were highest, were the best measurements of the performance of data fusion approaches. MFA demonstrated greater appropriateness as a statistical tool for fusing multi-modal data.

DOI:

Publication date: September 13, 2021

Issue: Macrowine 2021

Type: Article

Authors

Jeanne Brand

South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa,Mpho, MAFATA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Martin, KIDD, Centre for Statistical Consultation, Stellenbosch University, South Africa Andrei, MEDVEDOVICI, Faculty of Chemistry, University of Bucharest, Romania Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa

Contact the author

Keywords

data fusion; sensory evaluation; chemical composition; white wines; storage

Citation

Related articles…

Nitrogen isotope ratio (δ15N) as a tool to trace the major nitrogen source in vineyards

Aim: to elucidate if it is possible to detect variations in the source of nitrogen (organic vs. inorganic) measuring nitrogen isotope ratio (δ15N) in berries and to examine the degree of variation occurring for this parameter naturally within a vineyard.

Multi-omics methods to unravel microbial diversity in fermentation of Riesling wines

Wine aroma is shaped by the wine’s chemical compositions, in which both grape constituents and microbes play crucial roles. Although wine quality is influenced by the microbial communities, less is known about their population interactions.

VineAI: artificial intelligence for fungal disease

Early and accurate grapevine disease detection and surveillance are crucial for optimizing vineyard management practices.

Mathematical modeling of fermentation kinetics: a tool to better understand interactions between Torulaspora delbrueckii and Saccharomyces cerevisiae in mixed cultures

Nowadays the use of Torulaspora delbrueckii is more and more common in winemaking. However, its behavior in presence of Saccharomyces cerevisiae is not always predictable.

The modification of cultural practices in grapevine cv. Syrah, does it modify the characteristics of the musts?

The work shows the results of a year of experimentation (2020) in a Syrah variety vineyard in La Roda (Castilla-La Mancha, Spain). The trial approach was on a randomized block design with two factors: Irrigation (I) and Pruning (P).
Irrigation schedules were adjusted to apply amounts close to 1,500 m3/ha. With this provision, 2 different irrigation treatments were proposed: I1) Start of irrigation from pea-sized grape to post-harvest (providing at least 20 % of the total amount of irrigation water to be provided post-harvest); I2) Start of irrigation from pea-sized grape to harvest (usual irrigation practice in the study area). Pruning was proposed with two treatments, one at the end of January (P1), which is pruning on a conventional date; and P2) pruning carried out at the beginning of budding. In total, 4 repetitions were designed with 4 elementary plots, each one of them representing one of the proposed treatments (I1P1; I1P2; I2P1; I2P2). In total, 16 plots were worked on and each elementary plot consisted of 30 strains, distributed in 3 lines.
The productive response was evaluated with the yield results of the harvest harvested at 23 ºBrix. The qualitative response was measured in the musts through the indices of technological (acidity, pH and potassium) and phenolic maturity and aromatic compounds in free and glycosylated fractions. The treatments tested had, in general, an effect on the different variables analyzed.