Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Investigating the conceptualization and practices linked to peppery notes in Syrah red wines by French winemakers from different regions

The peppery attribute is often used to describe the aroma of Syrah wines. Rotundone was identified as the main aroma compound responsible for these notes. A significant percentage of anosmic respondents to this molecule was reported in previous studies. However, in most cases, these anosmic respondents, formally tested through three-alternative forced choice (3AFC), frequently declare being able to perceive peppery notes in wines. The main objective of this study was to investigate how anosmic French producers from two different regions conceptualize the peppery notes in Syrah red wines, and how they link it to production practices in comparison with non-anosmic producers.

Characterization of four Chenin Blanc-rootstock combinations to assess grapevine adaptability to water constraint

Climate change impacts water availability for agriculture, notably in semi-arid regions like South Africa, necessitating research on cultivar and rootstock adaptability to water constraints. To evaluate the performance (vegetative and reproductive) of different Chenin Blanc-rootstock combinations to the two water regimes, a field experiment was established in a model vineyard at Stellenbosch University, South Africa. Chenin Blanc vines grafted onto four different rootstocks (110Richter, 99Richter, 1103Paulsen and US 8-7) were planted in 2020. The vines are managed under two contrasting water conditions – dryland and irrigated (industry norm).

Introducing heterogeneity measurements in terroir studies. Application in the região demarcada do douro (n portugal)

Terroir zoning studies have to manage the heterogeneity and complexity of the landscape properties and processes. The varying geology is one of the main landscape properties conditioning the spatial variability of terroirs.

« Wine routes »: a collective brand to build a wine reputation on the basis of terroir and landscapes

Le marché international du vin est désormais tourné vers la qualité et les vignobles de vin de masse se transforment pour construire la qualité et la réputation de leurs produits. Cette construction s’appuie notamment sur la valorisation de ressources territoriales de nature physique (terroir, pacage, écosystème) et humaine (savoir-faire, culture, patrimoine…). Les « Routes des Vins » sont des exemples concrets de ces processus de «territorialisation», combinant ces ressources territoriales pour communiquer sur l’ancrage géographique et la spécificité des vins. Les «Routes des Vins» émergentes, observées dans les régions vitivinicoles en transition vers la qualité, en Languedoc Roussillon, à Mendoza (Argentine) et au Western Cape (Afrique du Sud), participent souvent à la valorisation des terroirs, en organisant un itinéraire sur le territoire associé, en faisant découvrir les vins «de qualité», les paysages, les pratiques et le savoir-faire associés à leur élaboration.

Results of late-wurmian to present-day climatic-geological evolution on to spatial variability of pedologic-geological characters of the AOC Gaillac terroirs (Tarn, Midi-Pyrénées)

The AOC Gaillac area is divided into three main terroirs : « The left bank terraces », « The right bank coteaux » and
« The plateau Cordais ». This division is valid at a regional scale, but it suffers of a number of local-scale exceptions. This spatial variability of the pedologic-geologic characteristics at the plot scale has been derived mainly from the main late-Würmian solifluxion phase occurring at the transition between the peri-glacial climate and the Holocene temperate conditions (13,000-10,000 yrs BP).