Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The chemical composition of disease resistant hybrid grape cultivars and its impact on wine quality: an exploratory enquiry into sustainable wines

Disease resistant hybrid grape cultivars are now allowed in a number of EU wine PDOs, and are also accepted in a number of countries outside the EU. There is increasing interest in diseases resistant hybrid grape cultivars (RHGCs) because they allow for the production of healthy, high quality grapes with limited use of pesticides and the associated environmental and public health

From vineyard to bottle. Rationalizing grape compositional drivers of the expression of “Amarone della Valpolicella” terroir

Valpolicella is a famous Italian wine-producing region. One of its main characteristic is the intensive use of grapes that are submitted to post-harvest withering. This is rather unique in the context of red wine, especially for the production of a dry red wine such as Amarone. Amarone wines produced in Valpolicella different geographic origin are anecdotally believed to be aromatically different, although there is no systematic study addressing the chemical bases of such diversity. Aroma is the product of a biochemical and technological series of steps, resulting from the contribution of different volatile molecules deriving from grapes, fermentations, and reactions linked to aging, as well as one of the most important features in the expression of the geographic identity and sensory uniqueness of a wine.

The use of Hanseniaspora vineae on the production of base sparkling wine

Non-Saccharomyces yeasts have been associated, for many years, with challenging alcoholic fermentation processes. However, during the last decade the use of non-Saccharomyces yeasts in wine production has become increasingly widespread due to the advantages they can offer in mixed inoculations with Saccharomyces cerevisiae (Sc). In this respect, Hanseniaspora vineae (Hv), in synergy with Saccharomyces spp, represents an interesting opportunity to impart a positive contribution to the aroma complexity of wines. In fact, it is a well-known producer of pleasant esters, such as 2-phenylethyl acetate. This study compares the performances of Hv (strain Hv-205) in sequential inoculation modality to Sc in three Chardonnay musts for base sparkling wine production. No significant differences were observed in basic chemical parameters between wines except for titratable acidity, with a significantly decrease (up to 1.5 g/L) in Hv processes due to malic acid degradation. The analysis of the aroma compounds revealed remarkable differences in concentration of volatile metabolites, among others up to 37-fold increase of 2-phenylethyl acetate. In contrast, lower concentration of its alcohol were detected, suggesting higher acetylation activity by Hv.

IDENTIFICATION AND LEVELS OF PHENOLIC COMPOUNDS (TANINS, ANTHO-CYANS) IN RED VARIETAL WINES (PROKUPAC AND BLACK TAMJANIKA) FROM SERBIA

The phenolic compounds of red wines represent a source of numerous benefits for human health, which is why they are a constant subject of scientific research. Winemaking in Serbia has a growing economic significance, with particularly autochthonous varieties included [1]. This research identifies and quantifies phenolic compounds of Serbian red varietal wines of Prokupac and Black Tamjanika varieties. Quantification of the level of phenolics has been conducted, including molecular tannins [(+)-catechin, (-)-epicatechin, procyanidin dimers B1, B2, B3, B4], molecular anthocyanins, and the mean degree of polymerization of tannins by HPLC by UV detection, total antioxidant capacity via spectrophotometric methods and chromatic characteristics via CIELAB.

Volatile compounds production during ripening of cv. “Sangiovese” grapes from different terroir

“Sangiovese” (Vitis vinifera L. sativa cv. Sangiovese) is the main grape variety to be established in Italy, being the only country in Europe where this grape is commonly found.