Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Grapevine rootstock field evaluation under drought and saline condition in California

Climate change impacts grape production worldwide and in California drought and salinity became increasingly challenging for grape growers to maintain sustainable production and fruit quality.

On sample preparation methods for fermentative beverage VOCs profiling by GCxGC-TOFMS

Study the influence of sample preparation methods on the volatile organic compounds (VOCs) profiling for fermentative beverages by GCxGC-TOFMS analysis. METHODS: Five common sample preparation methods were tested on pooled red wine, white wine, cider, and beer. Studied methods were DHS, Liquid-liquid extraction, mSBSE, SPE and SPME. VOCs were analyzed by GCxGC-TOFMS followed by data analysis with ChromaTOF. RESULTS: The volatile organic compounds (VOCs) profiling results were very dependent on the sample preparation methods.

Unraveling the complexity of high-temperature tolerance by characterizing key players of heat stress response in grapevine

Grapevine (Vitis spp.) is greatly influenced by climatic conditions and its economic value is therefore directly linked to environmental factors. Among these factors, temperature plays a critical role in vine phenology and fruit composition. In such conditions, elucidating the mechanisms employed by the vine to cope with heat waves becomes urgent. For the past few years, our research team has been producing molecular and metabolic data to highlight the molecular players involved in the response of the vine and the fruit to high temperatures [1]. Some of these temperature-sensitive genes are currently undergoing characterization using transgenesis approaches coupled or not with genome editing, taking advantage of the Microvine genotype [2].

Delaying grapevine budbreak and/or phenological stages

In the current climatic context, with milder winters leading to earlier budburst in most wine regions, vines are exposed to the risk of spring frosts for a longer period. Depending on the year, frost can lead to yield losses of between 20 and 100 %, jeopardizing the economic survival of wine estates. In addition, by destroying young shoots, spring frosts can impact the following season’s production, by reducing the number of canes available for pruning, for example. Late pruning is one method to combat spring frosts.

Effect of alcoholic strength on the phenolic and furfural compounds of Brandy de Jerez aged in Sherry Casks®

Brandy is a spirit drink produced from wine spirit aged for at least six months in oak casks with a capacity of less than 1000 L and minimum alcohol by volume (ABV) of 36%. During the aging process, physicochemical and sensory changes take place. Manifested by colour, flavour or aroma variations that improve the quality of the initial distillate.