Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

OmicBots – An innovative and intelligent multi-omics platform facing wine sector challenges

To face emerging competition and challenges, wine producers globally rely on precision viticulture (PV) solutions to boost productivity, enhance quality, increase profitability, and reduce the environmental impact of vineyards. Current pv methods predominantly use multispectral sensor data from several platforms (satellites or vineyard installations). However, these applications generally use data analysis strategies lacking physiological grapevine support.

Biotic and abiotic factors affecting physiological aspects underlying vegetative vigour in two commercial grapevine varieties

Grapevine vigour, defined as the propensity to assimilate, store and/or use non-structural sugars for allowing fast growth of shoots and producing large canopies[1], is crucial to optimize vineyard management. Recently, a model has been proposed for predicting the vigor of young grapevines through the measurement of the vegetative growth and physiological parameters, such as water status and gas exchange[2]. Our objectives were (1) to explore the influence of the association of two grapevine varieties (Tempranillo and Cabernet Sauvignon, grafted onto R110 rootstocks) with arbuscular mycorrhizal fungi (AMF) on the vegetative vigour of young plants; and (2) to assess the effect of environmental factors linked to climate change on the vegetative vigour of Cabernet Sauvignon.

Terroir in Slovak viticulture area

Terroir method has been used for assessment of growing site in the world for years. In Slovakia actually regionalisation is used as the similar method which does not cover all the elements of wine quality evaluation however.

Genetic traceability of ‘Nebbiolo’ musts and wines by single nucleotide polymorphism (SNP) genotyping assays

AIM: ‘Nebbiolo’ (Vitis vinifera L.) is one of the most ancient and prestigious Italian grape cultivars. It is renowned for its use in producing monovarietal high-quality red wines, such Barolo and Barbaresco. Wine quality and value can be heavily modified if cultivars other than those allowed are employed.

The use of δ13C as an indicator of water use efficiency for the selection of drought tolerant grapevine varieties

In the context of climate change with increasing evaporative demand, understanding the water use behavior of different grapevine cultivars is of critical importance. Carbon isotope discrimination (δ13C) measurements in wine provide a precise and integrated assessment of the water status of the vines during the sugar accumulation period in grape berries. When collected over multiple vintages on different cultivars, δ13C measurements can also provide insights into the effects of genotype on water use efficiency.