Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Analyse et modélisation des transferts thermiques dans un sol de vignoble. Effets des techniques culturales

Natural factors such as the environment in which the vine is grown play an important role in the quality of the wine. If you want to produce a good wine, it is indeed essential to produce quality grapes. To do this, we must enhance and optimize the terroir effect which, for the moment, plays a role that is not very well known. It is therefore essential, for example, to have scientifically established and well quantifiable relationships in order to have the system of areas of controlled origin accepted. R. Morlat (1989) and G. Seguin (1970) have already carried out studies on the role of certain soil factors on grape quality. In particular, they showed the importance of soil temperature and water content.

Managing Grapevine Powdery Mildew with Ultraviolet-C Light in Washington State

Germicidal ultraviolet-C (UV-C) light has shown promising results for suppression of several plant-pathogenic microorganims, including Erysiphe necator, which attacks grapevine. In Washington State the majority of winegrape production is in a semi-arid steppe environment, with historically low powdery mildew disease pressure, making it a promising area to deploy UV-C as a disease management tool. Trials focusing on UVC application timing and frequency will assist in developing regionally-appropriate application recommendations for eastern Washington State.

Studio dell’ambiente viticolo attraverso la parametrazione (punto di incrocio) delle curve di maturazione delle uve (pinot nero, oltrepo’ pavese pv italia settentrionale – 45° parallelo Nord)

Sono stati presi in considerazione alcuni dati agrometeorologici dell’Oltrepò Pavese (temperature e piovosità degli ultimi 80 anni) e gli studi delle curve di maturazione condotti in zona sul Pinot nero da spumante negli anni (1988-1991, 1999-2000, 2006-2008), si nota che l’aumento progressivo negli anni delle temperature attive (indice di Winkler) ha determinato un anticipo dell’invaiatura, definita dal parametro “punto di incrocio” (intersezione delle funzioni di zuccheri ed acidità nel tempo), con conseguente anticipo della data di vendemmia di circa 12-15 gg.

Sensory and physicochemical impact of proanthocyanidic tannins on red wine fruity aroma

AIM: Previous research on the fruity character of red wines highlighted the role of esters [1]. Literature provides evidence that, besides these esters, other compounds that are not necessarily volatiles may have an important impact on the overall fruity aroma of wine, contributing to a masking effect [2][3]. The goal of this work was to assess the olfactory consequences of a mixture between esters and proanthocyanidic tannins, through sensory and physico-chemical approaches.

Terroir and precision viticulture: are they compatible?

The concept of terroir or sense of place is almost as old as the wine industry. It is generally used as an all-encompassing term to reflect the effects of the biophysical environment in which grapes and their resultant wines are produced on the character of those wines. Historically, terroir has generally been considered at the regional or property scale.