Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Greek and Cypriot grape varieties as a sustainable solution to mitigate climate change

Aim: The aim of this report is to present evidence on the potential of Greek and Cypriot grape varieties to serve as a sustainable solution to mitigate climate change.

Methods and Results: The work provides a review of recent works involving Greek and Cypriot varieties’ performance under high temperatures and increased dryness.

Elicitors application in two maturation stages of Vitis vinifera L. cv Monastrell: changes on the skin cell walls

AIM: In a recent study, it was determined that the mid-ripening period is the most suitable for the application of methyl jasmonate (MeJ), benzothiadiazole BTH and MeJ+BTH on Monastrell grapes, to favor maximum accumulation of phenolic compounds at the time of harvest. However, the increase in the anthocyanin content of

VOLATILE, PHENOLIC AND COLORIMETRIC CHARACTERIZATION OF THREE DIFFERENT LAMBRUSCO APPELLATIONS

Lambrusco is a commercially successful sparkling red and rosé wine. With 13.06 million litres sold in 2021 was the second best-selling Italian wine after Chianti. According to National Catalogue of Vine Varieties there are thirteen Lambrusco Varieties with which to date are produced seven PDO wines. Among these, “Lambrusco Salamino di Santa Croce”, “Lambrusco Grasparossa di Castelvetro” and “Lambrusco di Sorbara” are the only ones that can be considered mono-varietal appellations, all located in Modena area. The PDOs contemplate the possibility of producing wines by secondary fermentation either in tank (Charmat method), or in bottle (Classico method). Sur lie is a third method commonly employed for Lambrusco, similar to the Classico method, from which differs for the absence of disgorgement.

“Vinhos de mesa” et oenophilie : quand les caractéristiques organoleptiques des cépages américains empêchent l’intégration des consommateurs à l’univers de l’appréciation esthétique

Au Brésil, 80 % du vignoble national et 90 % du vignoble de l’État du Rio Grande do Sul (principale région productrice de vins dans le pays) sont plantés avec des cépages issus de vitis labrusca ou de cépages hybrides (DEBASTIANI, 2015). Une partie de cette production est utilisée pour la préparation de jus de raisin et de concentrés de moût ou de pulpe de raisin. Le restant est consacré à

Innovative sparkling wines, traditional grape varieties and autochthonous yeasts: emerging trends for regional products diversification

Italy, like all the major vine-growing and wine-producing countries, has experienced a decline in wine export volumes in recent years.