Fully automated non-targeted GC-MS data analysis


Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005


Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article


Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author


metabolomics, non-targeted, GC-MS, exploratory data analysis 


IVES Conference Series | OENO IVAS 2019


Related articles…

Physiological and growth reaction of Shiraz/101-14 Mgt to row orientation and soil water status

Advanced knowledge on grapevine row orientation is required to improve establishment, management and outcomes of vineyards on terroirs with different environmental conditions (climate, soil, topography) and in view of a future change to more extreme climatic conditions. The purpose of this study was to determine the combined effect of row orientation, plant water status and ripeness level on the physiological and viticultural reaction of Shiraz/101-14 Mgt.

Effects of mechanical leafing and deficit irrigation on Cabernet Sauvignon grown in warm climate of California

San Joaquin Valley accounts for 40% of wine grape acreage and produces 70% of wine grape in California. Fruit quality is one of most important factors which impact the economical sustainability of farming wine grapes in this region. Due to the recent drought and expected labor cost increase, the wine industry is thrilled to understand how to improve fruit quality while maintaining the yield with less water and labor input. The present study aims to study the interactive effects of mechanical leafing and deficit irrigation on yield and berry compositions of Cabernet Sauvignon grown in warm climate of California.

The effects of cane girdling on berry texture properties and the concentration of some aroma compounds in three table grape cultivars

The marketability of the table grapes is highly influenced by the consumer demand; therefore the market value of the table grapes is mainly characterized by its berry size, colour, taste and texture. Girdling could cause accumulation of several components in plants above the ringing of the phloem including clusters and resulting improved maturity. The aim of the experiments was to examine the effect of girdling on berry texture characteristics and aroma concentration.

Application of a fluorescence-based method to evaluate the ripening process and quality of Pinot Blanc grape

The chemical composition of grape berries at harvest is one of the most important factors that should be considered to produce high quality wines. Among the different chemical classes which characterize the grape juice, the polyphenolic compound, such as flavonoids, contribute to the final taste and color of wines. Recently, an innovative non-destructive method, based on chlorophyll fluorescence, was developed to estimate the phenolic maturity of red grape varieties through the evaluation of anthocyanins accumulated in the berry skin. To date, only few data are available about the application of this method on white grape varieties.

Different yield regulation strategies in semi-minimal-pruned hedge (SMPH) and impact on bunch architecture

Yields in the novel viticulture training system Semi-Minimal-Pruned Hedge (SMPH) are generally higher compared to the traditional Vertical Shoot Positioning (VSP). Excessive yields have a negative impact on the vine and wine quality, which can result in substantial losses in yield in subsequent vintages (alternate bearing) or penalties in fruit quality. Therefore yield regulation is essential. The bunch architecture in SMPH differs from VSP. Generally there is a higher amount but smaller bunches with lower single berry weights in SMPH compared to VSP.