Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Impact of grapevine leafroll virus infections on vine physiology and the berry transcriptome

Grapevine leafroll associated virus (GLRaV) infections deteriorate vine physiological performance and cause high losses of yield and fruit quality

Bio-modulating wine acidity: The role of non-Saccharomyces yeasts

In this video recording of the IVES science meeting 2021, Alice Maria Correia Vilela (University of Trás-os-Montes and Alto Douro, Vila Real, Portugal) speaks about bio-modulating wine acidity: the role of non-Saccharomyces yeasts. This presentation is based on an original article accessible for free on IVES Technical Reviews.

Decline of new vineyards in Southern Spain

In-season vineyard pest management relies on proper timing, selection, and application of products. Most of the research on pest management tends to focus on the influence of regional conditions on these aspects, with an emphasis on product timing and efficacy evaluation. One aspect that is not fully vetted in various vineyard regions is application (sprayer) technology. The purpose of this study was to determine the influence of regional conditions on sprayer performance in commercial wine grape vineyards in eastern Washington.

L’évolution des Appellations d’Origine aux Etats-Unis

Un peu d’histoire pour nous efforcer de mettre le sujet des appellations dans un contexte général. Six cents ans avant Jésus-Christ, le Côte du Rhône était plantée en vignes peu après l’arrivée des Grecs

Le zonage viticole en Italie. État actuel et perspectives futures

Over the past few decades, viticultural research has made numerous contributions which have made it possible to better understand the behavior of the vine as well as its response to the conditions imposed on it by the environment and agronomic practices. However, these results have only rarely been used in the practical management of vineyards because the research has been carried out using partial experimental models where reality is only represented by a few factors which are sometimes even made more complex by the introduction of elements foreign to the existing situation and difficult to apply to production (varieties, methods of cultivation, management techniques, etc.). To these reasons, one could add a low popularization of the results obtained, as well as the difficulty of implementing the scientific contributions, which does not allow the different production systems to fully express their potential. This limit of viticultural research can only be exceeded by the design of integrated projects designed directly on and for the territory. Indeed, only the integrated evaluation of a viticultural agro-system, which can be achieved through zoning, makes it possible to measure, or even attribute to each element of the system, the weight it exerts on the quality of the wine.