Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Pruned vine biomass exclusion from a clay loam vineyard soil – examining the impact on physical/chemical properties

The wine industry worldwide faces increasing challenges to achieve sustainable levels of carbon emission mitigation. This project seeks to establish the feasibility of harvesting winter pruned vineyard biomass (PVB) for potential use in carbon footprint reduction, through its use as a renewable biofuel for energy production. In order to make this recommendation, technical issues such as the potential environmental impact, chemical composition and fuel suitability, and logistical challenges of harvesting biomass needs to be understood to compare with the results from similar studies. Of particular interest is the role PVB plays as a carbon source in vineyard soils and what effect annual removal might have on soil carbon sequestration. A preliminary trial was established in the Waite Campus vineyard (University of Adelaide) to test current management strategies. Vines are grown in a Eutrophic, Red Dermosol clay loam soil with well managed midrow swards. A comparison was undertaken of mid-row treatments in two 0.25 Ha blocks (Shiraz and Semillon), including annual cultivation for seed bed preparation, the deliberate exclusion of PVB (25 years) and incorporation of PVB (13 years) at an average of 3.4 and 5.5 Mg/Ha-1 for Shiraz and Semillon respectively. In both 0-10cm and 10-30cm soil core sample depths, combined soil carbon % measures in the desired range of 1.80 to 3.50, were not significantly different between treatments or cultivars and yielded an estimated 42 Mg/ha-1 of sequestered soil carbon. Other key physical and chemical measures were likewise not significantly different between treatments. Preliminary results suggest that in a temperate zone vineyard, managed such as the one used in this study, there is no long term negative impact on soil carbon sequestration through removing PVB. This implies that growers could confidently harvest PVB for use in several end fates including as a bio fuel.

Impact of climate variability and change on grape yield in Italy

Viticulture is entangled with weather and climate. Therefore, areas currently suitable for grape production can be challenged by climate change. Winegrowers in Italy already experiences the effect of climate change, especially in the form of warmer growing season, more frequent drought periods, and increased frequency of weather extremes.
The aim of this study is to investigate the impact of climate variability and change on grape yield in Italy to provide winegrowers the information needed to make their business more sustainable and resilient to climate change. We computed a specific range of bioclimatic indices, selected by the International Organisation of Vine and Wine (OIV), and correlated them to grape yield data. We have worked in collaboration with some wine consortiums in northern and central Italy, which provided grape yield data for our analysis.
Using climate variables from the E-OBS dataset we investigate how the bioclimatic indices changed in the past, and the impact of this change on grape productivity in the study areas. The climate impact on productivity is also investigated by using high-resolution convection-permitting models (CPMs – 2.2 horizontal resolution), with the purpose of estimating productivity in future emission scenarios. The CPMs are likely the best available option for this kind of impact studies since they allow a better representation of small-scale processes and features, explicitly resolve deep convection, and show an improved representation of extremes. In our study, we also compare CPMs with regional climate models (RCMs – 12 km horizontal resolution) to assess the added value of high-resolution models for impact studies. Further development of our study will lead to assessing the future suitability for vine cultivation and could lead to the construction of a statistical model for future projection of grape yield.

The concept of terroir: what place for microbiota?

Microbes play key roles on crop nutrient availability via biogeochemical cycles, rhizosphere interactions with roots as well as on plant growth and health. Recent advances in technologies, such as High Throughput Sequencing Techniques, allowed to gain deeper insight on the structure of bacterial and fungal communities associated with soil, rhizosphere and plant phyllosphere. Over the past 10 years, numerous scientific studies have been carried out on the microbial component of the vineyard. Whether the soil or grape compartments have been taken into account, many studies agree on the evidence of regional delineations of microbial communities, that may contribute to regional wine characteristics and typicity. Some authors proposed the term “microbial terroir” including “yeast terroir” for grapes to describe the connection between microbial biogeography and regional wine characteristics. Many factors are involved in terroir including climate, soil, cultivar and human practices as well as their interactions. Studies considering “microbial terroir” greatly contributed to improve our knowledge on factors that shape the vineyard microbial structure and diversity. However, the potential impact of “microbial terroir” on wine composition has yet not received strong scientific evidence and many questions remain to be addressed, related to the functional characterization of the microbial community and its impact on plant physiology and grape composition, the origins and interannual stability of vineyard microbiota, as well as their impact on wine sensorial attributes. The presentation will give an overview on the role of microbiota as a terroir component and will highlight future perspectives and challenges on this key subject for the wine industry.

Effect of multi-level and multi-scale spectral data source on vineyard state assessment

Currently, the main goal of agriculture is to promote the resilience of agricultural systems in a sustainable way through the improvement of use efficiency of farm resources, increasing crop yield and quality under climate change conditions. This last is expected to drastically modify plant growth, with possible negative effects, especially in arid and semi-arid regions of Europe on the viticultural sector. In this context, the monitoring of spatial behavior of grapevine during the growing season represents an opportunity to improve the plant management, winegrowers’ incomes, and to preserve the environmental health, but it has additional costs for the farmer. Nowadays, UAS equipped with a VIS-NIR multispectral camera (blue, green, red, red-edge, and NIR) represents a good and relatively cheap solution to assess plant status spatial information (by means of a limited set of spectral vegetation indices), representing important support in precision agriculture management during the growing season. While differences between UAS-based multispectral imagery and point-based spectroscopy are well discussed in the literature, their impact on plant status estimation by vegetation indices is not completely investigated in depth. The aim of this study was to assess the performance level of UAS-based multispectral (5 bands across 450-800nm spectral region with a spatial resolution of 5cm) imagery, reconstructed high-resolution satellite (Sentinel-2A) multispectral imagery (13 bands across 400-2500 nm with spatial resolution of <2 m) through Convolutional Neural Network (CNN) approach, and point-based field spectroscopy (collecting 600 wavelengths across 400-1000 nm spectral region with a surface footprint of 1-2 cm) in a plant status estimation application, and then, using Bayesian regularization artificial neural network for leaf chlorophyll content (LCC) and plant water status (LWP) prediction. The test site is a Greco vineyard of southern Italy, where detailed and precise records on soil and atmosphere systems, in-vivo plant monitoring of eco-physiological parameters have been conducted.

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.