Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Geospatial technologies in spatially defined viticulture: case study of a vineyard with Agiorgitiko variety in Koutsi, Nemea, Greece

Geospatial technologies have significant contribution to viticulture, especially in small-scale vineyards, which require precise management. Geospatial data collected by modern technologies, such as Unmanned Aerial Vehicle (UAV) and satellite imagery, can be processed by modern software and easily be stored and transferred to GIS environments, highlighting important information about the health of vine plants, the yield of grapes and the wine, especially in wine-making varieties. The identification of field variability is very important, particularly for the production of high quality wine. Modern geospatial data management technologies are used to achieve an easy and effortless localization of the fields’ variability.

Comprendre la sensibilité des cépages, une clé pour la gestion durable de l’esca

Dans le cadre de TerclimPro 2025, Pierre Gastou a présenté un article IVES Technical Reviews. Retrouvez la présentation ci-dessous ainsi que l’article associé : https://ives-technicalreviews.eu/article/view/8300

Have the best Bordeaux wines been drunk already? A reflection on the transient nature of terroir, using case study Australia

Aim:  The aim of this paper is to demonstrate that the meaning of terroir should be regarded as transient. This is because climate, one of the principal components of terroir, is changing with time, and can no longer be assumed to be constant with fluctuations about a mean. This is due to the climate crisis.

Impact of microclimate on berry quality parameters of white Riesling (Vitis vinifera L.)

Knowledge has been accumulated on the impact of microclimate, in particular berry temperature and irradiation, for a wide range of red varieties. However, little research has been dedicated on the effects of the same factors on the quality of white grape varieties.

Tokaj zonation, traditions and future prospects

La superficie actuelle de l’ensemble des vignobles est de 5.293 ha qui est repartie dans 27 communes (données officielles du Conseil National des Communes de montagnes).