Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Berry shrivel causes – summarizing current hypotheses

Diverse ripening disorders affect grapevine resulting in high economic losses worldwide. The common obvious symptom is shriveling berries, however the shriveling pattern and the consequences for berry quality traits are distinct in each disorder. Among them, the disorder berry shrivel is characterized by a reduced sugar accumulation short after the onset of berry ripening leaving the clusters unsuitable for wine processing. Although our knowledge on BS increased recently, potential internal or external triggers contributing to the induction of BS are yet to be explored.

Effect of non-Saccharomyces yeast and lactic acid bacteria on selected sensory attributes and polyphenols of Syrah wines

Consumers predominantly use visual, aromatic and texture cues as quality/preference indicators to describe olfactory sensations. In this study, the effect of micro-organism in wine production was investigated using analytical and sensory techniques to achieve relevant analytical characterisation. Selected anthocyanins, flavan-3-ols, flavonols and phenolic acids were quantified in Syrah wines using RP-HPLC-DAD. Standard oenological parameters were also measured. Syrah grape must was fermented with various combinations of Saccharomyces cerevisiae (S. cerevisiae) and non-Saccharomyces (Metschnikowia pulcherrima or Hanseniaspora uvarum) yeasts, which was followed by sequential inoculation of lactic acid bacteria (LAB) (Oenococcus oeni or Lactobacillus plantarum).

Growing characteristics of new PIWI varieties from the breeding program in the Czech Republic

Context and purpose of the study. The breeding of PIWI varieties has a long tradition in the Czech Republic. In the last two years, 9 new PIWI varieties have been registered.

A multivariate approach using attenuated total reflectance mid-infrared spectroscopy to measure the surface mannoproteins and β-glucans of yeast cell walls during wine fermentations

Yeast cells possess a cell wall comprising primarily glycoproteins, mannans, and glucan polymers. Several yeast phenotypes relevant for fermentation, wine processing, and wine quality are correlated with cell wall properties. To investigate the effect of wine fermentation on cell wall composition, a study was performed using mid-infrared (MIR) spectroscopy coupled with multivariate methods (i.e., PCA and OPLS-DA). A total of 40 yeast strains were evaluated, including Saccharomyces strains (laboratory and industrial) and non-Saccharomyces species. Cells were fermented in both synthetic MS300 and Chardonnay grape must to stationery phase, processed, and scanned in the MIR spectrum.

Terroir et variabilité microclimatique : pour une approche à l’échelle de la parcelle

The climatic component is one of the elements of the zoning of viticultural potential, alongside the geological and pedological components (Morlat, 1989; Lebon et al , 1993). Many climatic indices have thus been defined to estimate the potential for wine production at the scale of a region or a country (Carbonneau et al ., 1992). The main climatic variables used are temperature and radiation. We note in particular the indices of Branas, Huglin and Ribereau-Gayon (Huglin, 1986). However, few studies have been undertaken on the spatial variability of microclimatic conditions at the scale of a vineyard, a valley, or even a municipality.