Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Downscaling of remote sensing time series: thermal zone classification approach in Gironde region

In viticulture, the challenges of local climate modelling are multiple: taking into account the local environment, fine temporal and spatial scales, reliable time series of climate data, ease of implementation and reproducibility of the method. At the local scale, recent studies have demonstrated the contribution of spatialization methods for ground-based climate observation data considering topographic factors such as altitude, slope, aspect, and geographic coordinates (Le Roux et al, 2017; De Rességuier et al, 2020). However, these studies have shown questions in terms of the reproducibility and sustainability of this type of climate study. In this context, we evaluated the potential of MODIS thermal satellite images validated with ground-based climate data (Morin et al, 2020). Previous studies have been encouraging, but questions remain to be explored at the regional scale, particularly in the dynamics of the massive use of bioclimatic indices to classify the climate of wine regions. The results at the local scale were encouraging, but this approach was tested in the current study at the regional scale. Several objectives were set: 1) to evaluate the downscaling method for land surface temperature time series, 2) to identify regional thermal structure variations. We used weekly minimum and maximum surface temperature time series acquired by MODIS satellites at a spatial resolution of 1000 m and downscaled at 500 m using topographical variables. Two types of analyses were performed:

Modulation of the tannic structure of Tannat wines through maceration techniques: cross analytical and sensory study

The Tannat grape, native to the foothills of the Pyrenees in France, is known for producing wines with intense colour, exceptional tannic structure, and remarkable aging potential. These distinctive characteristics are attributed to its unique genome, making Tannat one of the grape varieties with the highest tannins concentration.

Induction of polyphenols in seedlings of Vitis vinifera cv. Monastrell by the application of elicitors

Contamination problems arising from the use of pesticides in viticulture have raised concerns. One of the alternatives to reduce contamination is the use of elicitors, molecules capable of stimulating the natural defences of plants, promoting the production of phenolic compounds (PC) that offer protection against biotic and abiotic stress. Previous studies on Cabernet-Sauvignon seedlings demonstrated that foliar application of elicitors methyl jasmonate (MeJ) and benzothiadiazole (BTH) increased proteins and PC involved in grapevine defence mechanisms. However, no trials had been conducted on Monastrell seedlings, a major winegrape variety in Spain.

The influence of soil management practices on functional traits and biodiversity of weed communities in Swiss vineyards

Green cover in vine rows provides many ecological services, but can also negatively impact the crop, depending on the weed species. The composition of a vineyard weed community is influenced by many parameters. Ensuring an evolution of the vine row flora into a desired direction is therefore very complex. A key step towards this goal is to know which factors influence the establishment of the weed community and which types of communities are best suited for vineyards. In this study, we analysed the weed communities of several vineyards in the Lake Geneva region (379 botanical surveys on 117 plots), with the aim to highlight the links between soil management practices (chemical and mechanical weeding, mowing, mulching roll) and phytosociological profiles, biodiversity and selected functional traits (growth forms, life strategies, root depth). T

What triggers the decision to ripen 

The decision for grape berries to ripen involves a complex interplay of genetic regulation and environmental cues. This review explores the molecular mechanisms underlying the transition from vegetative growth to ripening, focusing on transcriptomic studies and the role of the NAC gene family. Transcriptomic analyses reveal a significant rearrangement of gene expression patterns during this transition, with up-regulation of ripening-related genes and down-regulation of those associated with vegetative growth. A molecular phenology scale providing a high-precision map of berry transcriptomic development, indicates that key molecular changes occur well before the onset of ripening.