Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Terroir, sol et sous-sol : principes de modélisation spatiale de quelques paramètres physiques caractérisant le substrat altéré dans les régions viticoles établies sur socle ancien

For several years, the development of computer resources, and in particular of Geographic Information Systems, have allowed the emergence of a new approach to the analysis and characterization of wine-growing areas (Morlat, 1989; Laville, 1990). These methods, which make it possible to identify homogeneous areas or units of terroir, are based on crossing, statistical analysis (in particular Principal Component Analysis: PCA) and the integration of parameters describing the natural environment in which develop the vine.

Identification and biological properties of new resveratrol derivatives formed in red wine

Resveratrol is a well-known wine constituent with a wide range of activities. In wines, resveratrol can be oxidized to form various derivatives including oligomers [1]. In this study, resveratrol derivative transformation in hydroalcoholic solution was investigated by oxidative coupling using metals. De novo resveratrol derivatives were synthetized and analysed by NMR and MS experiments

Biosynthetic evolution of galloilated polyphenols in Tannat grapes during ripening, potential applications of grape thinning

Galloylated flavan-3-ols are a class of polyphenolic compounds present in various plants, including grape seeds. These compounds are formed through the condensation of flavan-3-ols, such as catechins, although the precise mechanism by which gallic acid is incorporated into the molecule remains unclear.

The impact of vine pruning methods on physiological development and health condition of Vitis vinifera

This project aims on monitoring the plant development and comparison of the effects of various training systems on vine fertility and physiological processes.

Wine archeochemistry: a multiplatform analytical approach to chemically profile shipwreck wines

The Cape of Storms (also known as Cape of Good Hope) is renowned for harbouring a multitude of shipwrecks due to the inherent treacherous coastline and blistering storms.