Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

Different soil types and relief influence the quality of Merlot grapes in a relatively small area in the Vipava Valley (Slovenia) in relation to the vine water status

Besides location and microclimatic conditions, soil plays an important role in the quality of grapes and wine. Soil properties influence…

Estimating bulk stomatal conductance of grapevine canopies

In response to changes in their environment, grapevines regulate transpiration using various physiological mechanisms that alter conductance of water through the soil-plant-atmosphere continuum. Expressed as bulk stomatal conductance at the canopy scale, it varies diurnally in response to changes in vapor pressure deficit and net radiation, and over the season to changes in soil water deficits and hydraulic conductivity of both soil and plant. It is necessary to characterize the response of conductance to these variables to better model how vine transpiration also responds to these variables. Furthermore, to be relevant for vineyard-scale modeling, conductance is best characterized using data collected in a vineyard setting. Applying a crop canopy energy flux model developed by Shuttleworth and Wallace, bulk stomatal conductance was estimated using measurements of individual vine sap flow, temperature and humidity within the vine canopy, and estimates of net radiation absorbed by the vine canopy. These measurements were taken on several vines in a non-irrigated vineyard in Bordeaux France, using equipment that did not interfere with ongoing vineyard operations. An inverted Penman-Monteith equation was then used to calculate bulk stomatal conductance on 15-minute intervals from July to mid-September 2020. Time-series plots show significant diurnal variation and seasonal decreases in conductance, with overall values similar to those in the literature. Global sensitivity analysis using non-parametric regression found transpiration flux and vapor pressure deficit to be the most important input variables to the calculation of bulk stomatal conductance, with absorbed net radiation and bulk boundary layer conductance being much less important. Conversely, bulk stomatal conductance was one of the most important inputs when calculating vine transpiration, further emphasizing the need for characterizing its response to environmental changes for use in vineyard water use modeling.

Modulation of berry composition by different vineyard management practices

High concentration of sugars in grapes and alcohol in wines is one of the consequences of climate change on viticulture production in several wine-growing regions. In order to investigate the possibilities of adaptation of vineyard management practices aimed to reduce the accumulation of sugar during the maturation phase without reducing the accumulation of anthocyanins in grapes, a study with severe shoot trimming, shoot thinning, cluster thinning and date of harvest was conducted on Merlot variety in Istria region (Croatia), under the Mediterranean climate. Four factors which may affect grape maturation and its composition at harvest were investigated in a two-years experiment; severe shoot trimming applied at veraison when >80% of berries changed colour (in comparison to untreated control), shoot thinning (0 and 30%), cluster thinning (0 and 30%), and the date of harvest (early and standard harvest dates). Shoot thinning had no significant impact on berry composition, despite the obtained reduction in yield per vine. Lower Brix in grapes were obtained with earlier harvest date and if no cluster thinning was applied, although at the same time a reduction in the concentration of anthocyanins in berries was observed in these treatments. On the other hand, if severe shoot trimming was applied when >80% of berries changed colour, a reduction of Brix was obtained without a negative impact on berry anthocyanins concentration. We conclude that in cases when undesirably high sugar concentrations at harvest are expected, severe shoot trimming at 80% veraison may effectively be used in order to obtain moderate sugar concentration in berries together with the adequate phenolic composition.

20-Year-Old data set: scion x rootstock x climate, relationships. Effects on phenology and sugar dynamics

Global warming is one of the biggest environmental, social, and economic threats. In the Douro Valley, change to the climate are expected in the coming years, namely an increase in average temperature and a decrease in annual precipitation. Since vine cultivation is extremely vulnerable and influenced by the climate, these changes are likely to have negative effects on the production and quality of wine.
Adaptation is a major challenge facing the viticulture sector where the choice of plant material plays an important role, particularly the rootstock as it is a driver for adaptation with a wide range of effects, the most important being phylloxera, nematode and salt, tolerance to drought and a complex set of interactions in the grafted plant.
In an experimental vineyard, established in the Douro Region in 1997, with four randomized blocs, with five varieties, Touriga Nacional, Tinta Barroca, Touriga Franca and Tinta Roriz, grafted in four rootstocks, Rupestris du Lot, R110, 196-17C, R99 and 1103P, data was collected consecutively over 20 years (2001-2020). Phenological observations were made two to three times a week, following established criteria, to determine the average dates of budbreak, flowering and veraison. During maturation, weekly berry samples were taken to study the dynamics of sugar accumulation, amongst other parameters. Climate data was collected from a weather station located near the vineyard parcel, with data classified through several climatic indices.
The results achieved show a very low coefficient of variations in the average date of the phenophases and an important contribution from the rootstock in the dynamic of the phenology, allowing a delay in the cycle of up to10-12 days for the different combinations. The Principal Component Analysis performed, evaluating trends in the physical-chemical parameters, highlighted the effect of the climate and rootstock on fruit quality by grape varieties.