Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Impacts of environmental variability and viticultural practices on grapevine behaviour at terroir scales

Climate change poses several challenges for the wine-industry in the 21st century. Adaptation of viticultural and winemaking practices are therefore essential to preserve wine quality and typicity. Given the complex interactions between physical, biological and human factors at terroir scales, studies conducted at these fine scales allow to better define the local environment and its influences on grapevine growth and berry ripening.

Assessment of antimicrobial effect of chitosan extracted from different sources against unwanted wine microorganisms

During wine production process high attention to the microbiological control from fermentation of the grape must to bottling is necessary. In fact, control of the indigenous microflora of the grape ensures correct fermentation activity of the inoculated starter, while control of the microorganisms in the finished wine is essential to prevent wine spoilage and to ensure the dominance of the desired bacteria when malolactic fermentation is required (Mas and Portillo, 2022).

Phenotypical impact of a floral somatic mutation in the cultivar Listán Prieto

The accession Criolla Chica Nº2 (CCN2) is catalogued as a floral mutation of cultivar Criolla Chica (synonym for cv. Listán Prieto). Contrary to what is observed in hermaphrodite-cultivated varieties like Criolla Chica, CCN2 exhibits a prevalence of masculinized flowers. Aiming to study the incidence and phenotypical implications of this mutation, CCN2 plants were deeply studied using Criolla Chica ‘Ballista’ (CCBA) as control plants. For each CCN2 plant, two inflorescences per shoot were sampled and segmented into proximal, mid and distal positions, relative to the pedicel. Flowers were observed through magnifying lens and classified according to OIV151 descriptor.

Pharmacological basis of the J-shaped curve in biological effects of wine

The classical pharmacological model assumes that the effect of a drug is proportional to the fraction of receptors occupied by the drug. In the simplest circumstances, the relationship between dose of a drug and response, when plotted on a logarithmic scale for drug concentration, is described by a sigmoidal curve. It presumes the existence of a threshold dose, below which no biological effect appears, and a maximal response in the form of a plateau, when a further increase in the dose of drug has no effect.

The effect of wine matrix on the initial release of volatile compounds and their evolution in the headspace

There is evidence in the literature that non-volatile wine matrix can modify the release and therefore the perception of the compounds involved in wine aroma [1-3].