Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

OENOLOGICAL AND SUSTAINABILITY POTENTIAL OF WINES PRODUCED FROM DISEASE RESISTANT GRAPE CULTIVARS (PIWI WINES)

The strategy for sustainability in the wine sector of the EU refers to a set of practices and principles that aim to minimize the negative impact of wine production on the environment, social and economic sustainability. Sustainable wine production involves a range of practices that are designed to reduce waste, conserve resources, and promote the well-being of workers and communities.

S. CEREVISIAE AND O. ŒNI BIOFILMS FOR CONTINUOUS ALCOHOLIC AND MALOLACTIC FERMENTATIONS IN WINEMAKING

Biofilms are sessile microbial communities whose lifestyle confers specific properties. They can be defined as a structured community of bacterial cells enclosed in a self-produced polymeric matrix and adherent to a surface and considered as a method of immobilisation. Immobilised microorganisms offer many advantages for industrial processes in the production of alcoholic beverages and specially increasing cell densities for a better management of fermentation rates.

Elicitors application in two maturation stages of Vitis vinifera L. cv Monastrell: changes on the skin cell walls

AIM: In a recent study, it was determined that the mid-ripening period is the most suitable for the application of methyl jasmonate (MeJ), benzothiadiazole BTH and MeJ+BTH on Monastrell grapes, to favor maximum accumulation of phenolic compounds at the time of harvest. However, the increase in the anthocyanin content of

Modification on grape phenolic and aromatic composition due to different leafroll virus infections

Viral diseases are reported to cause several detrimental effects on grapevine. Among them, leafroll, due to single or mixed infection of GLRaV1 and GLRaV3, and rugose wood, associated to GVA, are considered the most widespread and dangerous.

Terroir zoning in appellation campo de borja (northeast Spain): Preliminary results

The components and methodology for characterization of the terroir have been described by Gómez-Miguel & Sotés (1993-2014, 2003) and Gómez-Miguel (2011) taking into account the full range of environmental factors (i.e: climate, lithology, vegetation, topography, soils, altitude, etc.), landscape variables (derived from photo-interpretation and a digital elevation model), and specific variables to the country’s viticulture (i.e: size and distribution of the vineyards, varieties, phenology, productivity, quality, designation regulations, etc.).