Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Are dicysteinyl polysulfanes responsible for post-bottling release of hydrogen sulfide?

Hydrogen sulfide (H2S) has a significant impact on wine aroma attributes and wine quality when present at concentrations above its aroma threshold of 1.1 to 1.6 μg/L.

Responses of grape yield and quality, soil physicochemical and microbial properties to different planting years

As an economically important fruit crop, continuous cropping of grapes can potentially impact soil health resulting in decreased yields.

Microbial ecosystems in wineries – molecular interactions between species and modelling of population dynamics

Microbial ecosystems are primary drivers of viticultural, oenological and other cellar-related processes
such as wastewater treatment. Metagenomic datasets have broadly mapped the vast microbial species
diversity of many of the relevant ecological niches within the broader wine environment, from vineyard
soils to plants and grapes to fermentation. The data highlight that species identities and diversity
significantly impact agronomic performance of vineyards as well as wine quality, but the complexity
of these systems and of microbial growth dynamics has defeated attempts to offer actionable
tools to guide or predict specific outcomes of ecosystem-based interventions.

Rootstock differences in soil-water uptake during drying-wetting cycles imaged with 3d electrical resistivity tomography

Limited knowledge has been acquired on grapevine roots and rhizosphere processes because of harder access when compared to aerial parts. There is need for new methods to study root behavior in undisturbed field conditions, and relate these effects on canopy and yield. The aim of this multidisciplinary study was to image and quantify spatial-temporal differences in soil-water uptake by genetically different rootstocks and to assess the response of the canopy during drought and rewetting.

Responses of grapevine cells to physiological doses of ethanol, among which induced resistance to heat stress

Grapevine naturally endures stresses like heat, drought, and hypoxia. A recent study showed very low oxygen levels inside grape berries, linked to ethanol content.