Macrowine 2021
IVES 9 IVES Conference Series 9 Data fusion approaches for sensory and multimodal chemistry data applied to storage conditions

Data fusion approaches for sensory and multimodal chemistry data applied to storage conditions

Abstract

AIM: The need to combine multimodal data for complex samples is due to the different information captured in each of the techniques (modes). The aim of the study was to provide a critical evaluation of two approaches to fusing multi-modal chemistry and sensory data, namely, multiblock multiple factor analysis (MFA) and concatenation using principal component analysis (PCA).

METHODS: Wines were submitted to sensory analysis using Pivot©Profile (Thuillier et al. 2015) and chemical analysis in four modes: antioxidant measurements (AM), volatile compounds composition (VCC), ultraviolet-visible light (UV-Vis) spectrophotometry (Mafata et al. 2019), and infra-red (IR) spectroscopy. Correspondence analysis (CA), principal component analysis (PCA), and multiple factor analysis (MFA) were used to model data under the data analysis steps involving data cleaning, visualizing, modelling and evaluation (Pagès 2004). Percentage explained variation (%EV) and regression vector (RV) coefficients were used as comparative evaluation parameters between data models (Abdi 2007).

RESULTS: IR spectral data were used as an example of the assessment of the need for data cleaning/pre-processing. Similarities in MFA and high RV coefficients indicated that the raw (unprocessed data) could be used for the data fusion. High RV coefficients and MFA proximity between the antioxidants and UV-Vis measurements indicated an overlap between the type of information contained in the two. The differences between the information captured in each of the five modes can be seen in the different measurements, from the knowledge of the theory/ ontext behind the technique, and statistically. Statistically, the differences are measured and visualised by a lack of overlap (redundancy) in the MFA and its accompanying cluster analysis. 

CONCLUSIONS

The %EV when performing PCA are higher than with MFA, a consequence of fusing big data sets from various modes and not necessarily a direct result of the relationships among the data sets. Therefore, the %EV was ruled out as a reliable measure of the differences in informational value between MFA and PCA fusion strategies. RV coefficients, of which MFA were highest, were the best measurements of the performance of data fusion approaches. MFA demonstrated greater appropriateness as a statistical tool for fusing multi-modal data.

DOI:

Publication date: September 13, 2021

Issue: Macrowine 2021

Type: Article

Authors

Jeanne Brand

South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa,Mpho, MAFATA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Martin, KIDD, Centre for Statistical Consultation, Stellenbosch University, South Africa Andrei, MEDVEDOVICI, Faculty of Chemistry, University of Bucharest, Romania Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa

Contact the author

Keywords

data fusion; sensory evaluation; chemical composition; white wines; storage

Citation

Related articles…

Sviluppo di una metodologia di tracciabilità e definizione dell’impronta petrochimica in suoli e vini della Sicilia occidentale nella piana di Marsala (TP)

I risultati delle ricerche condotte in un vigneto sperimentale di Marsala (TP), scelto per omogeneità di fattori bio-agronomici (età, tecniche colturali, potenzialità vegetativa e produttiva)

Impact of grapevine rootstock genotypes on nitrogen status of the scion and phenolic composition in Pinot noir berries and wine

Context and purpose of the study. Nitrogen (N) limitation enhances the production of phenolic compounds in grapes due to the downregulation of the flavonoid biosynthesis pathway.

Climate and mesoclimate zonification in the Miño valley (Galicia, NW Spain)

Galicia est une région située dans le Nord-Ouest de l’Espagne avec une longe tradition de culture de la vigne. A jour d’oui la vigne occupe en Galicia presque 28.500 ha, desquelles 8.100 correspondent aux 5 zones ayant droit à l’appellation DO (« Denominación de Origen ») équivalent aux AOC françaises.

α-Terpinyl ethyl ether: stereoselective GC × GC confirmation and identification of its precursors in wine

Wines exhibit profound chemical complexity which arise from a diverse array of compounds that contribute to its sensory profile.

Qualitative modelling of factors influencing the development of Black rot, for the prediction of damage to bunches

Vines are one of the most pesticide-intensive crops in France, and reducing their use is a major challenge for both the environment and human health.