Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Influence of temperature and light on vegetative growth and bud fruitfulness of grapevine cv. Semillon

Aim: To investigate the effects of different levels of temperature and light intensity on grapevine vegetative growth and bud fruitfulness, which includes the number and size of inflorescence primordia in primary buds.

Metodología para la zonificación de áreas vitícolas: aplicación en un area modelo del Penedés

Se propone una metodología para la zonificación del viñedo, a partir de las características climáticas, edáficas y geomorfológicas, en una área de 3700 ha del Penedés

Vineyard yield estimation using image analysis: assessing bunch occlusions and its dependency on fruiting zone canopy features

Performing accurate vineyard yield estimation is of upmost importance as it provides important benefits to the whole vine and wine industry. Recently, image-analysis approaches have been explored to address this issue however this approach has as main challenge the bunch occlusion, mostly by vegetation but also by neighboring bunches. The present work aims at assessing the magnitude of bunch occlusion by neighboring bunches and to evaluate its dependency on a selection of vegetative and reproductive vine parameters assessed at fruiting zone. Forty vine segments (1 m) of two vineyard plots of the white cultivars ‘Alvarinho’ and ‘Arinto’ were assessed for vegetative and reproductive features at fruiting zone and imaged with a 2D camera.

Application of zoning for wine production, digitalisation and traceability

Depuis la création des outils d’amélioration et de suivi de la qualité, le CREDO développe et réalise des zonages de potentialités viticoles.

Is it possible to approximate the technological and phenolic maturity of grapes by foliar application of elicitors?

The increase in the temperature and the more severe water stress conditions, trends observed in recent years as a consequence of climate change, are leading a mismatch between the technological and phenolic maturity of grapes