Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The drought, the temperature, and the time: drivers of osmotic adjustment?

Context and purpose of the study. Leaf osmotic adjustment (i.e., active accumulation of osmolytes in the cells) has been reported in grapevines in response to drought and as a natural process throughout the growing season (seasonal osmotic adjustment).

Analysis of temporal variability of cv. Tempranillo phenology within Ribera del Duero Do (Spain) and relationships with climatic characteristics

The Ribera del Duero Designation of Origin (DO) has acquired great recognition during the last decades, being considered one of the highest quality wine producing regions in the world. This DO has grown from 6,460 ha of vineyards officially registered in 1985 to approximately 21,500 ha in 2013. The total grape production stands at around 90 million kg, with an average yield that approaches nearly 4,500 kg/ha. Most vineyards are cultivated under rainfed conditions.

The environmental footprint of selected vineyard management practices: A case study from Logroño (La Rioja) Spain

Viticulture is globally important for socioeconomic and environmental reasons. The EU is globally leading grape and wine production, and Spain is among the top grape and wine producers. As climate change affects viticulture, mitigation and adaptation are crucial for protecting grape production. In this research work, data on viticultural management practices such as soil cultivation, irrigation, energy, machinery, plant protection and the use of fertilizers from vineyards located in Logroño (La Rioja) have been obtained.

Estudios de zonificación vitícola en España

La delimitación y caracterización de zonas vitícolas plantea en España problemas específicos no sólo por las características peculiares del territorio sino también por el tamaño

Caracterización de las tierras de viña de Navarra

Este programa se enmarca dentro de las líneas de trabajo del Departamento de Agricultura, Ganadería y Alimentación del Gobiemo de Navarra y su objetivo general es conocer adecuadamente las