Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Microbial consortia as a tool for sustainable vineyard management: A study on their acceptance among Veneto region’s grape-growers

Sustainability is a key focus in viticulture, where managing abiotic and biotic stress presents a major challenge.

The role of climate/soil of different zones/terroirs on grape characteristics

According to the different concern of the ‘traditional’ and the ‘new’ wine-producing Countries, a variable importance is recognized to the climate/soil and to grapevine cultivars as factors affecting the wine quality. However, the viticultural experience can state that, within each area, climate and soil plays an incontestable role in affecting grape quality, and consequently wine quality, as well as the genetic characteristics of the cultivar.

Interest in measuring the grape texture to characterise grapes from different cultivation areas – Example of Cabernet franc from the Loire Valley

A two-bite compression test was applied on Cabernet franc grapes during two harvest seasons. The evolution of the texture parameters from véraison to harvest was studied and a new mechanical ripeness notion was introduced.

Chemical activation of ABA signaling in grapevine through ABA receptor agonists

Grapevine (Vitis vinifera) and its derived products, in terms of cultivated area and economic volume, constitute the most relevant fruit crop in the world (7.5 million cultivated hectares). In the current context of climate change, the wine sector faces unprecedented challenges to satisfy a growing demand for wines of greater quality through sustainable viticulture. Global warming threatens quality wine production in Mediterranean wine regions in particular. The increase in heatwaves and drought episodes accelerate the vine phenology and alter the ripening and composition of grapes and wine. Extreme abiotic stress episodes compromise grape production and plant survival, intensifying the pressure on the use of limited resources like water. Abscisic acid (ABA) is an important hormone in the ripening of certain fruits and in plant response to abiotic stress.

Defining gene regulation and co-regulation at single cell resolution in grapevine

Conventional molecular analyses provide bulk genomic/transcriptomic data that are unable to reveal the cellular heterogeneity and to precisely define how gene networks orchestrate organ development. We will profile gene expression and identify open chromatin regions at the individual cells level, allowing to define cell-type specific regulatory elements, developmental trajectories and transcriptional networks orchestrating organ development and function. We will perform scRNA-seq and snATAC-seq on leaf/berry protoplasts and nuclei and combine them with the leaf/berry bulk tissues obtained results, where the analysis of transcripts, chromatin accessibility, histone modification and transcription factor binding sites showed that a large fraction of phenotypic variation appears to be determined by regulatory rather than coding variation and that many variants have an organ-specific effect.