Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Il paesaggio delle alberate aversane ed il vino Asprinio

Nel corso del 2009, in alcuni vigneti allevati ad alberata in provincia di Caserta (Italia), è stata avviata una ricerca per valutare la variabilità genetica della popolazione del vitigno ‘Asprinio’, la condizione sanitaria delle piante e le caratteristiche del vino sia rispetto alla forma di allevamento (alberata tradizionale e controspalliera) che all’altezza della fascia produttiva.

FLAVONOID POTENTIAL OF MINORITY RED GRAPE VARIETIES

The alteration in the rainfall pattern and the increase in the temperatures associated to global climate change are already affecting wine production in many viticultural regions all around the world (1). In fact, grapes are nowadays ripening earlier from a technological point of view than in the past, but they are not necessarily mature from a phenolic point of view. Consequently, the wines made from these grapes can be unbalanced or show high alcohol content. Dramatic shifts in viticultural areas are currently being projected for the future (2).

Aroma composition of mono-varietal white wines for the production of Custoza

AIM: The appellation “Bianco di Custoza” or “Custoza”, born in 1971, is one of the oldest white wines Protected Designation of Origin in Italy.

Estimation of plant hydraulics of grapevine in various «terroirs» in the Canton of Vaud (Switzerland)

The study of the physiological behaviour of the grapevine (cv. Chasselas), and of plant hydraulics in particular, was conducted on various « terroirs » in the Canton of Vaud (Switzerland) between 2001 and 2003 by Agroscope Changins-Wädenswil ACW, in collaboration with the firm I. Letessier (SIGALES) in Grenoble and the Federal Polytechnic School of Lausanne (EPFL). An evaluation of the vine plant hydraulics was made by means of physiological indicators (leaf and stem water potentials, transpiration and leaf stomatal conductance, carbon isotope discrimination and a model of transpirable soil water), in relation to estimations of the soil water reservoir and climatic factors.

The effect of short and long-term water deficit on physiological performance and leaf microbiome of different rootstock and scion combinations

Climate change, particularly drought stress, threatens viticulture sustainability. Understanding scion-rootstock interactions and their link to the grapevine microbiome is key to improving vine health, productivity, and drought resilience.