Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Evolution of the amino acids content through grape ripening: Effect of foliar application of methyl jasmonate with or without urea

The parameters that determine the grape quality, and therefore the optimal harvest time, suffer variations during berry ripening, related to climate change, with the widely known problem of the gap between technological and phenolic maturities. However, there are few studies about its incidence on grape nitrogen composition. For this reason, the use of an elicitor, methyl jasmonate (MeJ), alone or with urea, is proposed as a tool to reduce climatic decoupling, allowing to establish the harvest time in order to achieve the optimum grape quality. The aim was to study the effect of MeJ and MeJ+Urea foliar applications on the evolution of Tempranillo amino acids content throughout the grape maturation. Three treatments were foliarly applied, at veraison and 7 days later: control (water), MeJ (10 mM) and MeJ+Urea (10 mM+6 kg N/ha). Grape samples were taken at five stages of maturation: day before the first and second applications, 15 days after the second application (pre-harvest), harvest day, and 15 days after harvest (post-harvest). The amino acids analysis of the samples was carried out by HPLC. Results showed that the evolution of amino acids was similar regardless of the treatment; however, foliar applications influenced the nitrogen compounds content, i.e., there was no qualitative effect but quantitative one. Most of the amino acids reached their maximum concentration in pre-harvest, being higher in grapes from the treatments than in the control. In general, no differences in grape amino acids content were observed between MeJ and MeJ+Urea treatments. Foliar applications with MeJ and MeJ+Urea enhanced the grape amino acids content, without affecting their profile, helping to optimize their quality and allowing to establish a more complete grape ripening standard. Therefore, MeJ and MeJ+Urea foliar applications can be a simple agronomic practice, which has shown promising results in order to enhance the grape quality.

Estimating bulk stomatal conductance of grapevine canopies

In response to changes in their environment, grapevines regulate transpiration using various physiological mechanisms that alter conductance of water through the soil-plant-atmosphere continuum. Expressed as bulk stomatal conductance at the canopy scale, it varies diurnally in response to changes in vapor pressure deficit and net radiation, and over the season to changes in soil water deficits and hydraulic conductivity of both soil and plant. It is necessary to characterize the response of conductance to these variables to better model how vine transpiration also responds to these variables. Furthermore, to be relevant for vineyard-scale modeling, conductance is best characterized using data collected in a vineyard setting. Applying a crop canopy energy flux model developed by Shuttleworth and Wallace, bulk stomatal conductance was estimated using measurements of individual vine sap flow, temperature and humidity within the vine canopy, and estimates of net radiation absorbed by the vine canopy. These measurements were taken on several vines in a non-irrigated vineyard in Bordeaux France, using equipment that did not interfere with ongoing vineyard operations. An inverted Penman-Monteith equation was then used to calculate bulk stomatal conductance on 15-minute intervals from July to mid-September 2020. Time-series plots show significant diurnal variation and seasonal decreases in conductance, with overall values similar to those in the literature. Global sensitivity analysis using non-parametric regression found transpiration flux and vapor pressure deficit to be the most important input variables to the calculation of bulk stomatal conductance, with absorbed net radiation and bulk boundary layer conductance being much less important. Conversely, bulk stomatal conductance was one of the most important inputs when calculating vine transpiration, further emphasizing the need for characterizing its response to environmental changes for use in vineyard water use modeling.

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

Is wine terroir a valid concept under a changing climate?

The OIV[i] defines terroir as a concept referring to an area in which collective knowledge of the interactions between the physical and biological environment (soil, topography, climate, landscape characteristics and biodiversity features) and vitivinicultural practices develops, providing distinctive wine characteristics. Those are perceptible in the taste of wine, which drives consumer preference and, therefore, wine’s value in the marketplace. Geographical indications (GI) are recognized regulatory constructs formalizing and protecting the nexus between wine taste and the terroir generating it. Despite considering updates, GIs do not consider the nexus as a dynamic one and do not anticipate change, namely of climate. Being climate a fundamental feature of terroir, it strongly impacts wine characteristics, such as taste. According to IPCC[ii], many widespread, rapid and unprecedented changes of climate occurred, some being irreversible over hundreds to thousands of years. Climatic shifts and atmospheric-driven extreme events have been widely reported worldwide. Recent climatic trends are projected to strengthen in upcoming decades, whereas extremes are expected to increase in frequency and intensity, forcing wines away from GI definitions. Geographical shifts of viticultural suitability are projected, often moving into regions and countries different from current ones. Some authors propose adaptation in viticulture, winemaking and product innovation. We show evidence of climate changing wine characteristics in the Douro valley, home of 270-year-old Port GI. We discuss herein resist or adapt stances for when climate changes the nexus between terroir and wine characteristics. Using the MED-GOLD[iii] dashboard, a tool allowing for easy visual navigation of past and future climates, we demonstrate how policymakers can identify future moments, throughout the 21st century under different emission scenarios, when GI specifications will likely need updates (e.g., boundaries, varieties) to reduce climate-change impacts.

The use of rootstock as a lever in the face of climate change and dieback of vineyard

As viticulture faces challenges such as climate change or vineyard dieback, the choice of the variety and rootstock becomes more and more crucial. To study rootstock levers in the Bordeaux region, a parcel of Cabernet Sauvignon (CS) was planted with four rootstocks in 2014. Twenty repetitions of each of the following four rootstocks were set up: 101-14 MGt, Nemadex AB, 420A MGt and Gravesac. The number of bunches, yields and pruning weights of the vine shoots were measured individually on 240 vines from 2017 to 2021. Since 2020, nitrogen status assessed by assimilable nitrogen level, hydric status assessed by δ13C and berry maturity were measured on 80 samples taken from 20 repetitions of the four rootstocks. A lower yield was measured for CS grafted onto Nemadex AB due to the lower number of bunches and the lower weight of berries. The differences between the other three rootstocks are small, but CS grafted onto 420A MGt was the most productive. The CS grafted onto Nemadex AB had the lowest pruning weight while 101-14 MGt had the highest. In 2020, δ13C showed a more moderate water stress with 101-14 MGt and 420A MGt than with Nemadex AB. Surprisingly, the Gravesac was under more stress than the 101-14 MGt. The nitrogen status in the berries was better for Nemadex AB but this was perhaps due to the significantly lower weight of the berries.Rootstock 101-14 MGt attained the highest accumulation of sugars in the berries while 420A MGt allows to preserve higher acidity. The parcel is still young which may explain some of the results. These measures must therefore be continued over the next several years to fully assess the effects of these rootstocks on the development of the vines and the quality of the production under new climatic conditions.