Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Enhancing vineyard resilience: evaluating sustainable practices in the Douro demarcated region

In mediterranean agriculture, sustainability and productivity are seriously threatened by climate change and water scarcity. This situation is exacerbated by poor management practices such as excessive use of agrochemicals, overgrazing, and monoculture. The Douro demarcated region (ddr) is an emblematic region, classified world heritage site by UNESCO in 2001. Viticulture is the main agricultural activity in DDR, widely known to produce port wine.

Effect of soil texture on early bud burst

Notre objectif est d’étudier de façon précise les relations entre la physiologie de la vigne et le sol, en prenant en compte l’effet millésime. Nous avons plus précisément étudier la précocité de débourrement de la vigne (stade D) en fonction de la texture du sol et plus particulièrement de la teneur en éléments grossiers.

Monferace a new “old style” for Grignolino wine, an autochthonous Italian variety: unity in diversity

Monferace project is born from an idea of 12 winegrowers willing to create a new “old style” Grignolino wine and inspired byancient winemaking techniques of this variety (1). Monferace wine is produced with 100% Grignolino grapes after 40 months of ageing, of which 24 in wooden barrels of different volumes. Grignolino is an autochthonous Italian variety cultivated in Piedmont (north-west Italy), recently indicated as a “nephew” of the famous Nebbiolo (2) and is used to produce three different DOC wines. The Monferace Grignolino is cultivated in the geographical area identified in the Aleramic Monferrato, defined by the Po and Tanaro rivers, in the heart of Piedmont and the produced wine is characterized by a high content of tannins, marked when young, that evolve over the years. Its color is generally slight ruby red and garnet red with orange highlights with ageing.

Hyperspectral imaging for the appraisal of varietal aroma composition along maturation in intact Vitis vinifera L. Tempranillo Blanco berries

The knowledge of the grape aromatic composition during ripening provides very important information for winegrowers, who may carry out different viticultural practices, or determine the harvest date more accurately. However, there are currently no tools that allow this measurement to be carried out in a non-invasive and rapid way. For this reason, the aim of this work was to design a non-invasive methodology, based on hyperspectral imaging to estimate the aromatic composition and total soluble solids (TSS) of Tempranillo Blanco berries during ripening.

Differentiation and characterization of Spanish fortified wines with protected designation of origin based on volatiles using multivariate approaches

Spain is one of the main producers of high-quality fortified wines. Particularly some of them elaborated in Andalusia have acquired a great prestige for being unique due to their production in a specific geographical area with traditional methods, the grape variety used, the climate and the soil. Such is their distinguishing feature achieved that they have been protected by the European Union with the indication “Protected Designation of Origin” (PDO). Thus, there are four PDO of fortified wines in Andalucía (‘Condado de Huelva’, ‘Jerez Xérès Sherry’, ‘Manzanilla Sanlúcar de Barrameda’, and ‘Montilla-Moriles’). Furthermore, within each PDO,there are different categories according to their particular characteristics and winemaking conditions such as the aging process.