Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The wine microbial ecosystem: Molecular interactions between yeast species and evidence for higher order interactions

Fermenting grape juice represents one of the oldest continuously maintained anthropogenic microbial environments and supports a well-mapped microbial ecosystem. Several yeast and bacterial species dominate this ecosystem, and some of these species are part of the globally most studied and best understood individual organisms. Detailed physiological, cellular and molecular data have been generated on these individual species and have helped elucidate complex evolutionary processes such as the domestication of wine yeast strains of the species Saccharomyces cerevisiae. These data support the notion that the wine making environment represents an ecological niche of significant evolutionary relevance. Taken together, the data suggest that the wine fermentation ecosystem is an excellent model to study fundamental questions about the working of microbial ecosystems and on the impact of biotic selection pressures on microbial ecosystem functioning. Indeed, and although well mapped, the rules and molecular mechanisms that govern the interactions between microbial species within this, and other, ecosystems remain underexplored. Here we present data derived from several converging approaches, including microbiome data of spontaneous fermentations, the population dynamics of constructed consortia, the application of biotic selection pressures in directed laboratory evolution, and the physiological and molecular analysis of pairwise and higher order interactions between yeast species. The data reveal the importance of cell wall-related elements in interspecies interactions and in evolutionary adaptation and suggest that predictive modelling and biotechnological control of the wine ecosystem during fermentation are promising strategies for wine making in future.

Genomic perspective of Lachancea thermotolerans in wine bioacidification

We have sequenced two commercial strains of Lachancea thermotolerans (Lt) from the company Lallemand: Laktia™ y Blizz™.

Which risk assessment of water quality in pdo vineyards in Burgundy (France)?

To meet the demand of assessment tool of water managers we adapted to the vine production the INDIGO® method to developed initially for arable farming at the field scale.

Pratiques de taille et développement des jeunes vignes

Dans le cadre de TerclimPro 2025, Gonzaga Santesteban a présenté l’article IVES Technical Reviews. Retrouvez la présentation ci-dessous ainsi que l’article associé : https://ives-technicalreviews.eu/article/view/8465

The effect of soil and climate on the character of Sauvignon blanc wine

Un projet multidisciplinaire sur l’effet du sol et du climat sur la qualité du vin a débuté en Afrique du Sud il y a 5 ans. Des mesures sont effectuées sous culture sèche dans des vignes de Sauvignon Blanc dans six localités différentes, cinq dans le district de Stellenbosch et une à Durbanville.