Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Novel contribution to the study of mouth-feel properties in wines

In general, there is a well-established lexicon related to wine aroma and taste properties; however mouth-feel-related vocabulary usually includes heterogeneous, multimodal and personalized terms. Gawel et al.
(2000) published a wheel related to mouthfeel properties of red wine. However, its use in scientific publications has been limited. The authors accepted that the approach had certain limitations as it included redundant and terms with hedonic tone and some others were absent. It is of high interest to generate a mouth-feel lexicon and finding the chemical compound or group of compounds responsible for such properties in red wine. In the present work a chemical fractionation method has been developed.

Unique resistance traits against downy mildew from the domestication center of grapevine

The Eurasian grapevine (Vitis vinifera), an Old World species now cultivated worldwide for high-quality wine production, is extremely susceptible to the agent of downy mildew, Plasmopara viticola.

What defines the aging signature of Chasselas wines?

Chasselas is a refined grape variety renowned for its subtlety and its remarkable ability to reflect terroir characteristics [1]. Typically consumed young, it is appreciated for its low acidity and delicate fruity and floral aromas.

Sensory significance of aroma carry-over during bottling from aromatized wine-based beverages into regular wine

In 2020 one out of  eight wine bottles were filled with a flavoured wine-based beverage.

Characterization of a Sémillon clonal population: exploring genetic diversity, metabolomic profiles, and phenotypic variations

Sémillon is a cultivated grape variety known for contributing to dry and sweet white wine production. However, only seven approved clones have been officially recognized in France[1]. In this study, we aimed to characterize the genetic diversity and metabolomic profiles of a Sémillon clonal population, shedding light on the potential variations within this important grape variety.