Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Photo-oxidative stress and light-struck defect in Corvina rosé wines: influence of yeast nutritional strategies

Light exposure is one of the major factors affecting the sensory quality of rosé wines and resulting in the light-struck fault.

Saccharomyces cerevisiae intraspecies differentiation by metabolomic signature and sensory patterns in wine

AIM: The composition and quality of wine are directly linked to microorganisms involved in the alcoholic fermentation. Several studies have been conducted on the impact of Saccharomyces cerevisiae on volatile compounds composition after fermentation. However, if different studies have dealt with combined sensory and volatiles analyses, few works have compared so far the impact of distinct yeast strains on the global metabolome of the wine.

Red wine substituted esters involved in fruity aromatic expression: an enantiomeric approach to understand their sensory impact and their pathway formation

Among red wines ethyl esters, those from short hydroxylated and branched-chain aliphatic acids constitute a family with a particular behavior and sensory importance. They have been previously discussed in the literature [1] and recent studies have established that some of them were strongly involved in of red wines’ fruity aroma [2]. As some among them have an asymmetrical carbon atom, it seemed important to separate their different enantiomers to obtain an accurate assessment of their organoleptic impact. Three chiral esters have been identified, presenting alkyl and/or hydroxyle substituants: ethyl 2-hydroxy-4-methylpentanoate, ethyl 2-methylbutanoate, and ethyl 3-hydroxybutanoate.

Evolution of oak barrels C-glucosidic ellagitannins

During oak wood contact, wine undergoes important modifications that modulate its organoleptic quality and complexity, including its aroma, structure, astringency, bitterness and color. Vescalagin and castalagin are the two main C-glucosidic ellagitannins found in oak wood used for wine aging wood but lyxose/xylose derivatives (grandinin and roburin e) and dimeric forms (roburins a,b, c and d) are also present. The presence of several hydroxyl groups in the ortho-positions at the periphery of the structure of the ellagitannin isomers allows these molecules to undergo oxidation or condensation reactions with other compounds.