Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Spiders in vineyards show varying effects of inter-row management and the surrounding landscape

In vineyards, management and the surrounding landscape can have different effects on spiders. In temperate regions management (organic vs. conventional) may have less strong effects than for other crops.

Spatial variability of grape berry maturation program at the molecular level 

The application of sensors in viticulture is a fast and efficient method to monitor grapevine vegetative, yield and quality parameters and determine their spatial intra-vineyard variability. Molecular analysis at the gene expression level can further contribute to the understanding of the observed variability by elucidating how pathways responsible for different grape quality traits behave in zones diverging for one or the other parameter. The intra-vineyard variability of a Cabernet Sauvignon vineyard was evaluated by a standard Normalized Difference Vegetation Index (NDVI) mapping approach, employing UAV platform, accompanied by detailed ground-truthing (e.g. vegetative, yield, and berry ripening compositional parameters) that was applied in 14 spots in the vineyard. Berries from different spots were additionally investigated by microarray gene expression analysis, performed at five time points from fruit set to full ripening.

ACCUMULATION OF GRAPE METABOLITES IS DIFFERENTLY IMPACTED BY WATER DEFICIT AT THE BERRY AND PLANT LEVELS IN NEW FUNGUS DISEASE-TOLERANT GENOTYPES

The use of new fungus disease-tolerant varieties is a promising long-term solution to better manage chemical input in viticulture, but unfortunately little is known regarding these new hybrids fruit development and metabolites accumulation in front of abiotic stresses such as water deficit (WD). Thus, prior to the adoption of such varieties by the wine industry in Mediterranean regions, there is a need to consider their suitability to WD.

Evaluation of biodegradable mulch for weed control with focus on vineyard performance

Context and Purpose of the Study. The use of herbicides and mechanical soil tillage, particularly on steep slopes, poses significant ecological challenges, including soil compaction and erosion.

Aroma composition of mono-varietal white wines for the production of Custoza

AIM: The appellation “Bianco di Custoza” or “Custoza”, born in 1971, is one of the oldest white wines Protected Designation of Origin in Italy.