Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Screening of Italian red wines for quercetin precipitation risk index

Quercetin (Q), a phenolic compound released from grape skins during red wine maceration, has been identified as a source of instability in bottled wines, particularly Sangiovese, due to crystallisation. This phenomenon represents an economic challenge for producers and affects wine clarity and consumer perception.

Inactivated yeasts: a case study for the future of precision enology

Yeasts serve as highly versatile tools in oenology. They do more than just perform alcoholic fermentation. Nowadays, yeasts from various species, naturally present in grapes, are selected for specific non-fermentative applications. For example, the use of selected non-saccharomyces at the early stage of winemaking has become a common practice to limit the growth of unwanted microorganisms. When inactivated, yeasts can be fractionated into soluble and insoluble fractions providing a wide range of benefits related to structural components or specific metabolites.

CHANGES IN CU FRACTIONS AND RIBOFLAVIN IN WHITE WINES DURING SHORT-TERM LIGHT EXPOSURE: IMPACTS OF OXYGEN AND BOTTLE COLOUR

Copper in white wine can be associated with Cu(II) organic acids (Cu fraction I), Cu(I) thiol species (Cu fraction II), and Cu sulfides (Cu fraction III). The first two fractions are associated with the repression of reductive aromas in white wine, but these fractions gradually decrease in concentration during the normal bottle aging of wine. Although exposure of white wine to fluorescent light is known to induce the accumulation of volatile sulfur compounds, causing light-struck aroma, the influence on the loss of protective Cu fractions is uncertain. Riboflavin is known to be a critical initiator of photochemical reac-tions in wine, but the rate of its decay under short-term light exposure in different coloured bottles and for wine of different oxygen concentrations is not well understood.

Guyot or pergola for dehydration of Rondinella grape

Pergola veronese is the most important vine training system in Valpolicella area but Guyot in the last decades is diffusing. Rondinella is one of the three most important varieties

Key learnings about the chemical bases of wine uniqueness and quality, essential companions for future developments

This presentation aims to demonstrate that the value attributed to wine as we today know it is based on three factors: 1) sensory balance, 2) personality, and 3) bioactivity.