Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

IDENTIFICATION OF NEW RESVERATROL DERIVATIVES FORMED IN RED WINE AND THEIR BIOLOGICAL PROPERTIES

Stilbenes are natural bioactive polyphenols produced by grapevine. Recently, we have reviewed the na- tural presence of these compounds in wines [1]. This study showed that the resveratrol and its glycoside, the piceid, are the most abundant stilbenes in wines. Resveratrol is a well-known stilbene with a wide range of biological activities. Due to its specific structure, resveratrol can be oxidized in wines to form various derivatives including oligomers [2]. In this study, we investigate the resveratrol and piceid transformation in wines.

Aroma composition of young and aged Lugana and Verdicchio

AIM Verdicchio and Lugana are two Italian white wines produced in the Marche and Garda lake regions respectively. They are however obtained using grape varieties sharing the same genetic background, locally known as Verdicchio in Marche and Trebbiano di Soave in Garda. Anecdotal evidence suggests that these two wine types exhibit distinctive aroma features. The aim of this work was to explore the existence of a recognizable odour profile for Lugana and Verdicchio, and whether specific aroma chemical markers could be identified. METHODS 13 commercial wines, 6 Lugana and 7 Verdicchio were used. Sensory analysis was done using sorting task methodology, assessing only odor similarities. A total of 53 volatile compounds were identified and quantified GC-MS analysis. Aging behaviors were also evaluated after an accelerated aging at 40 ° C for 3 months. RESULTS HCA analysis of sorting task data identified indeed two groups: one characterized by floral and minty notes and mostly associated with Lugana wines, the other characterized by spicy and toasted aromas and mostly associated with Verdicchio. From a chemical point of view, major differences between the two wines types were observed for cis-3-hexenol, methionol, phenylethyl alcohol, and geraniol.

Relationships between vine isohydricity and changes of fruit growth and metabolism during water deficit

The frequency of water deficits is increasing in many grape-growing regions due to climate change.

A first look at the aromatic profile of “Monferace” wines

Grignolino, is a native Piedmont grape variety which well represents the historical and
enological identity of Monferrato, a territory between Asti and Casale Monferrato, included in the World Heritage List designated by UNESCO (1).

Applications of Infrared Spectroscopy from laboratory to industry

The grape and wine industries have long sought rapid, reliable and cost-effective methods to screen and monitor all the stages of the winemaking process, which include grape ripening in the vineyard, harvest and grape reception at the weighbridge, the fermentation stage and the bottling of the final product.