Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Soil quality in Beaujolais vineyard. Importance of pedology and cultural practices

A pedological study was carried out from 2009 to 2017 in Beaujolais vineyard, to improve physical and chemical knowledge of soils. It was completed in 2016 and 2017 by the current study, dealing with microbial aspects, in order to build a reference frame for improved advice in soil management. Microbial biomass was measured on representative plots of the six most common soil types identified in Beaujolais and, for each soil type, on plots with different levels of the main impacting parameters: total organic carbon, pH, cation exchange capacity, extractable copper. A total of 59 soil samples were collected. Confirming the results of various trials carried out in Beaujolais over the past 20 years, the results of the present study showed that the soils were still alive, but exhibited a large variability of biological parameters, which appeared dependant on both pedological and anthropic factors. Therefore, a good interpretation of biological parameters and advice for vine growers must rely on a pedologically-based referential with differentiated main driving factors. For example, the control of pH is of primary importance in granitic soils and in no way organic matter addition can improve soil quality if pH is too low. Conversely, in calcareous soils, biological parameters are more directly affected by direct or indirect (cover crops for example) inputs of organic matter. The use of biological parameters, such as microbial biomass, is of great potential value to improve advice on agro-viticultural practices (soil management, fertilization, liming, etc.), basis of a sustainable wine production on fragile soils.

Creativini: an augmented reality card game to promote the learning of the reasoning process of a technical management route for making wine 

Nowadays, the entire viticultural and enological process is wisely thought out according to the style of wine to be produced and the local climatic conditions. Acquiring the approach of a technical management route specific for wine production remains a complex learning process for students. To enhance such learning, The Ecole d’Ingénieurs de PURPAN (PURPAN), an engineering school located in Toulouse southwest France, has recently developed Creativini, a collaborative card game in English made of 150 cards spread into 14 batches. Students in groups of 3 to 6 must design a technical production route, from plant material to bottling.

Soil monoliths, soil variability and terroir

Aim: The aim of this work is educating people about soil variability and terroir. Soil monoliths are used to educate the wine industry about how to describe a soil profile, interpret the soil formation processes operating in a particular soil profile and consequently the impact of soil properties on vine growth, fruit quality and wine production. Soil monoliths are a permanent artistic tool for educating, research and management of soil variability.  

Implication of secondary viral infections on grafting success rated in nurseries

Grapevine grafting is a complex process that since the establishment of phylloxera has become mandatory for grapevine. Grafting success in grapevine nurseries considerably varies among years and batches with most variety/rootstock combinations reach a high success rate (between 75% and 90%), but some combinations show lower success rates of around 40-50%. The causes of this variation are unknown, although biotic stresses like those caused by some viral infections have been demonstrated to affect the process. European certification schemes for the vegetative propagation of the vine include five major viruses (Arabis mosaic virus, Grapevine Fanleaf Virus, Grapevine Fleck Virus, and Grapevine-associated Leafroll Virus 1 and 3).

Biosurfactant from corn-milling industry improves the release of phenolic compounds during red winemaking

AIM: Biosurfactants can be used as emulsifier agents to improve the taste, flavour, and quality of food-products with minimal health hazards [1]. They are surface-active compounds with antioxidant and solubilizing properties [2].