Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

La zonazione della valle d’Illasi (Verona)

In the bottom of Val d’Illasi (Verona province), one of the major valleys which passes through the Lessini mountains, viticulture is widely extended. In the territory belonging to Illasi and Tregnago villages, which includes ca. 1100 ha of vineyards, devoted to produce Soave and Valpolicella DOC wines, an experimental survey was conducted on a network of twenty five reference vineyards.

Free amino acid composition of must from 7 Vitis vinifera L. cv. in Latium (Italy)

Free amino acid concentrations in must of 7 Vitis vinifera cultivars (Cabernet Franc, Syrah, Merlot, Montepulciano, Sangiovese, Cesanese d’Affile, Carmenere) grown in the Latium region (Italy) were monitored from 2003 to 2005. The cultivars were located in a homogeneous soil and climatic zone and with the same training system (Cordon Spur).

La haie bocagère comme critère de zonage à l’échelle parcellaire

In the French AOCs, the production area of ​​the raw material can be subject to plot delimitation based on criteria of physical environment and use. On the other hand, many environmental zonings are developing and the AOCs are called upon include provisions relating to these concerns. Hedges, through their effects on local changes in the regional climate and on functional biodiversity, can impact the functioning of vines and orchards. It is for this reason that their consideration as a delimitation criterion is envisaged.

Amyndeon‐naoussa: the two faces of Xinomavro

Xinomavro is the most important indigenous red wine variety grown in Northern Greece. It participates in the production of several PGI wines in Macedonia while from 100% Xinomavro the PDO “Amyndeon” and “Naoussa” are produced. The viticultural area of Amyndeon lies in a plateau of 550 ‐700 m of altitude, in a semi‐continental climate with mostly deep sandy loamy soils derived from limestone and marl bedrocks while in Naoussa, Xinomavro is grown in a Mediterranean climate on more heavy textured soils, sandy clay loam to clay, derived from ophiolithic, limestone and marl bedrocks, in an altitude which varies from 150 to 400 m. Different soil, climate and viticultural technique interactions, result in great variability with respect to morphological, ampelographical and physiological characters of Xinomavro as well as in the characteristics of the wines produced. 

Under-vine and between the rows: investigating sustainable floor management in vineyards

Investigating vineyard floor management is essential as these practices directly impact soil health, vine growth, and grape quality.