Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Effect of Quercus Alba oak barrels from different forests on the polyphenolic composition of Tempranillo red wines

The species and origin used for red wine oak aging determines the physiological composition of the wood and thus the finished wines. In America, oak is grown primarily in the states of Virginia, Missouri, Kentucky, Oregon, Ohio, Minnesota, Wisconsin and California. The aim of this study was to analyze how the choice of barrels made with Quercus Alba oak from different geographic areas of the United States (Missouri, Kentucky, Ohio and Pennsylvania) influences the polyphenolic composition of Tempranillo red wines.

How the management of ph during winemaking affects acetaldehyde evolution and the formation of polymeric phenolics over the red wine aging

The aim of this study is to evaluate the role of pH on both the acetaldehyde chemistry and wine phenolics evolution over the aging period. In addition, the effect of both an early and late acidification was evaluated

Zonage viticole des surfaces potentielles dans la vallée Centrale de Tarija (Bolivie)

La présente étude de zonage viticole a été faite dans la région de la vallée Central de Tarija(VCT), dans la ville de Tarija, au Sud de la Bolivie; une région avec plus de 400 années de tradition qui présente une vitiviniculture de haute qualité. La Vallée possède une surface total de 332 milles ha.; existant des vignobles entre 1660 y 2300 m.s.n.m. et dans ce rang d’altitude il existe 91 mille ha.

Evaluation of Polarized Projective Mapping as a possible tool for attributing South African Chenin blanc dry wine styles

Multiple Factor Analysis (MFA) According to the Chenin blanc Association of South Africa, there are three recognized dry wine styles, Fresh and Fruity (FF), Rich and Ripe Unwooded (RRU), and Rich and Ripe Wooded (RRW), classically attributed with the help of sensory evaluation. One of the “rapid methods” has drawn our attention for the purpose of simplifying and making style attribution for large sample sets, evaluated during different sessions, more robust. Polarized Projective Mapping (PPM) is a hybrid of Projective Mapping (PM) and Polarised Sensory Positioning (PSP). It is a reference-based method in which poles
(references) are used for the evaluation of similarities and dissimilarities between samples.

Screening of soil yeasts with fermentative capacity from the antarctic continent for their application in the wine industry

AIM: In the last years, many wineries are increasing experimentation to produce more distinguishable beverages. In this sense, the reduction of the fermentation temperature could be a useful tool because it preserves volatile compounds and prevents wines from browning, particularly in the case of white wines.