Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

NEUROPROTECTIVE AND ANTI-INFLAMMATORY PROPERTIES OF HYDROXYTYROSOL: A PROMISING BIOACTIVE COMPONENT OF WINE

Hydroxytyrosol (HT) is a phenolic compound present in olives, virgin olive oil and wine. HT has attracted great scientific interest due to its biological activities which have been related with the ortho-dihydroxy conformation in the aromatic ring. In white and red wines, HT has been detected at concentrations ranging from 0.28 to 9.6 mg/L and its occurrence has been closely related with yeast metabolism of aromatic amino acids by Ehrlich pathway during alcoholic fermentation. One of the most promising properties of this compound is the neuroprotective activity against pathological mechanisms related with neurode-generative disorders including Alzheimer’s and Parkinson’s disease.

Exploring grapevine water relations in the context of fruit growth at pre- and post-veraison

Climate change is increasing the frequency of water deficit in many grape-growing regions. Grapevine varieties differ in their stomatal behavior during water deficit, and their ability to regulate water potential under dry soil conditions is commonly differentiated using the concept of isohydricity. It remains unclear whether stomatal behavior, water potential regulation, and the resulting degree of isohydricity has a relationship with changes to fruit growth during water deficit. This study was conducted on four varieties (`Cabernet Franc`, `Semillon`, `Grenache`, and `Riesling`) subjected to both short-term, severe water deficit and long-term, moderate water deficit applied at both pre- and post-veraison.

Zoning mountain landscapes for a valorisation of high identity products

Mountain agriculture is made difficult by the geomorphological complexity of the territory. This is especially true for viticulture: over the centuries the work of men in such a difficult environment

Aromatic stability of Syrah and Petit Verdot tropical wines from Brazil

The production of fine wines in the Sub-middle of the São Francisco River Valley, Northeast of Brazil, is relatively recent, about twenty-five years ago. This region presents different characteristics

Methodology to assess vine cultivation suitability using climatic ranges for key physiological processes: results for three South African regions

Le climat a de fortes implications sur le bon fonctionnement physiologique de la vigne et a besoin d’être quantifié afin de déterminer l’aptitude des régions à la culture de la vigne. Une méthode, qui pourrait éventuellement servir à prévoir l’aptitude des régions à la culture de la vigne, est proposée.