Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Innovative strategies for reducing astringency in Mandilaria wines 

Mandilaria, a red grape variety indigenous to the Aegean islands, is well known for its robust tannins and pronounced astringency, which can challenge the palatability and marketability of its wines. The aim of this study was the reduction of astringency in wines made exclusively from mandilaria grapes through dehydrations practices and targeted winery applications.

Crop water stress index as a tool to estimate vine water status

Crop Water Stress Index (CWSI) has long been a ratio to quantify relative plant water status in several crop and woody plants. Given its rather well relationship to either leaf or stem water potential and the feasibility to sample big vineyard areas as well as to collect quite a huge quantity of data with airborne cameras and image processing applications, it is being studied as a tool for irrigation monitoring in commercial vineyards. The objective of this paper was to know if CWSI estimated by measuring leaf temperature with an infrared hand held camera could be used to substitute the measure of stem water potential (SWP) without losing accuracy of plant water status measure.

AROMATIC AND FERMENTATIVE PERFORMANCES OF HANSENIASPORA VINEAE IN DIFFERENT SEQUENTIAL INOCULATION PROTOCOLS WITH SACCHAROMYCES CEREVISIAE FOR WHITE WINEMAKING

Hanseniaspora vineae (Hv) is a fermenting non-Saccharomyces yeast that compared to Saccharomyces cerevisiae (Sc) present some peculiar features on its metabolism that make it attractive for its use in wine production. Among them, it has been reported a faster yeast lysis and release of polysaccharides, as well as increased ß-glucosidase activity. Hv also produces distinctive aroma compounds, including elevated levels of fermentative compounds such as ß-phenylethyl acetate and norisoprenoids like safranal. However, it is known for its high nutritional requirements, resulting in prolonged and sluggish fermentations, even when complemented with Sc strain and nutrients.

Crown procyanidin: a new procyanidin sub-family with unusual cyclic skeleton in wine

Condensed tannins (also called proanthocyanidins) are a widely distributed throughout in plants kingdom and are one of the most important classes of secondary metabolites, in addition, they are part of the human diet. In wine, they are extracted during the winemaking process from grape skins and seeds. These compounds play an important role in red wine organoleptic characteristics such as color, bitterness and astringency. Condensed tannins in red wine are oligomers and polymers of flavan-3-ols unit such as catechin, epicatechin, epigallocatechin and epicatechin-3-O-gallate. The monomeric units can be linked among them with direct interflavanoid linkage or mediated by aldehydes.

Impact of defoliation on leaf and berry compounds of Vitis vinifera L. Cv. Riesling investigated using non-destructive methods)

Climate change has a strong impact on the earlier onset of important phenological stages and plant development in viticulture.