Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Botrytis cinerea: Coconut or Catastrophe? Quantification of γ-Nonalactone in Botrytised and Non-Botrytised New Zealand Wines

g-Nonalactone has been identified as a significant contributor to the aroma profile of a range of wines and is associated with stonefruit and coconut descriptors.

Effect of row direction in the upper part of the hillside vineyard of Somló, Hungary

Hillside vineyards have a great potential to produce world class wines. The unique microclimate lead to the production of rich, flavory wines.

Fructose implication in the Sotolon formation in fortified wines: preliminary results

Sotolon (3-hydroxy-4,5-dimethyl-2(5H)-furanone) is a naturally occurring odorant compound with a strong caramel/spice-like scent, present in many foodstuffs. Its positive contribution for the aroma of different fortified wines such as Madeira, Port and Sherry is recognized. In contrast, it is also known to be responsible for the off-flavor character of prematurely aged dry white wines. The formation mechanisms of sotolon in wine are still not well elucidated, particularly in Madeira wines, which are submitted to thermal processing during its traditional ageing. The sotolon formation in these wines has been related to sugar degradation mechanisms, particularly from fructose [1].

«Observatoire Mourvèdre»: (2) climatic mapping for successful plantation of Cv. Mourvèdre

A statistical model of sugar potential for Mourvèdre grapevine cultivar has been obtained using a group of 32 plots all around de south-east french mediterranean area.

Ampelograpic and genetic characterisation of grapevine genetic resources from Ozalj-Vivodina region (Croatia)

Ozalj- vivodina region is small vine growing area (only about 100 hectares of vineyards), but with significant number of old, ancient vineyards planted between 50 and 100 years ago. Trend of abandoning or replanting ancient vineyards takes place for the last 30 years. This trend results in grapevine germplasm erosion because traditional varieties are replaced with well known international varieties.Few known traditional varieties are dominantly present in ancient vineyards together with many others of unknown identity. Historical data about prevalence and characteristic of varieties on this area are very poor.