Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

New highlights of polyphenols from red wine to counteract ocular degenerative diseases

More recently, studies have shown that polyphenols could also prevent or improve vision in patients with ocular diseases and especially age-related macular degeneration (AMD) which is an eye disease characterized by damage to the central part of the retina, the macula, and that affects millions of people worldwide. Despite therapeutic advances thanks to the use of anti-vascular endothelial growth factor (VEGF), many resistance mechanisms have been found to accentuate the visual deficit.

Aroma profile evaluation in whole grape juices

Table grapes (Vitis labrusca and hybrids) are widely cultivated in Brazil [1] due to the climate, their resistance to disease and the way they are consumed and commercialized, either in-natura or for processing, producing whole juices, jams and table wines.

Phenolic profiles of minor red grape cultivars autochthonous from the Spanish region of La Mancha

The phenolic profiles of little known red grape cultivars, namely Garnacho, Moribel and Tinto Fragoso, which are autochthonous from the Spanish region of La Mancha (ca. 600,000 ha of vineyards) have been studied over the consecutive seasons of years 2013 and 2014. The study was separately performed over the skins, the pulp and the seeds, and comprised the following phenolic types: anthocyanins, flavonols, hydroxycinnamic acid derivatives (HCADs), total proanthocyanidins (PAs) and their structural features. The selected grape cultivars belong to the Vine Germplasm Bank created in this region in order to preserve the great diversity of genotypes grown in La Mancha.

Outside and inside grapevine roots: arbuscular mycorrhizal fungal communities in a ‘nebbiolo’ vineyard 

In field conditions, grapevine roots are colonized by arbuscular mycorrhizal fungi (AMF). Little is known about the species composition of AMF communities associated to grapevine.

Implication of secondary viral infections on grafting success rated in nurseries

Grapevine grafting is a complex process that since the establishment of phylloxera has become mandatory for grapevine. Grafting success in grapevine nurseries considerably varies among years and batches with most variety/rootstock combinations reach a high success rate (between 75% and 90%), but some combinations show lower success rates of around 40-50%. The causes of this variation are unknown, although biotic stresses like those caused by some viral infections have been demonstrated to affect the process. European certification schemes for the vegetative propagation of the vine include five major viruses (Arabis mosaic virus, Grapevine Fanleaf Virus, Grapevine Fleck Virus, and Grapevine-associated Leafroll Virus 1 and 3).