Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Future scenarios for viticultural climatic zoning in Europe

Climate is one of the main conditioning factors of winemaking. In this context, bioclimatic indices are a useful zoning tool, allowing the description of the suitability of a particular region

Cartographie des terroirs viticoles: valorisation des résultats par un logiciel de consultation dynamique de cartes

Pour son travail de cartographie et de caractérisation des terroirs, la Cellule Terroirs Viticoles utilise la méthode développée par l’Unité Vigne et Vin du Centre INRA d’Angers. Cette méthode reconnue au niveau international est appliquée dans les vignobles du Val de Loire à l’échelle du 1/10 000e et est valorisée par des éditions d’Atlas Viticoles à destination des viticulteurs et des organismes techniques.

Physical-chemical and sensory characterization of wine made with the cultivar syrah produced in a double pruning system

In recent years, the consumption of fine wines in Brazil has increased significantly, a phenomenon that is also reflected in the expansion of production to new regions. In the brazilian southeast for example, the so-called “winter wines” are being produced, through management in two cycles, one of formation and one of production, with two prunings and one harvest per year, a technique known as double pruning, with vineyards established at altitudes close to or above 1,000 m above sea level.

Key odorants of french syrah wines from the northern rhone valley

Little research has been undertaken to investigate the main contributors to the aroma of Syrah wines from the cool northern part of the Rhone valley despite the historical importance of this cultivar for this wine region. The aim of the present work was to study the key odorants of Crozes-Hermitage wines made

Juvenile-to-adult vegetative phase transition in grapevine 

The sequential activity of miR156 and miR172 controls the juvenile to adult phase transition in many plant species, where miR156 abundance decreases while miR172 increases along plant development. Very little is known about phase transition in horticultural woody species, which show substantially long vegetative phases. In grapevine, phase transition seems to be dissociated, displaying a first transition from juvenile to adult vegetative state in the first year, coincident with tendril differentiation and a subsequent induction of inflorescences in place of some of tendrils in later years under flowering inductive environmental conditions. Since grapevine is a highly heterozygous species, the generation of genetically homogeneous material for replicated transcriptomic analyses from seed-derived plants was a main challenge.