Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

Better understand the soil wet bulb formation with subsurface or aerial drip irrigation in viticulture

The gradual change in rainfall patterns experienced in the south of France vineyards, especially around the Mediterranean sea, means that the vines are increasingly subject to summer drought. The winegrowers developped the use of irrigation techniques to ensure the maintenance of competitive yields in the production of wines under Protected Geographical Indication label. In practice, drip irrigation pipes can be installed above the ground or buried into the soil as well as at different distances from the vine row. The objective of this study was to examine the profiles of the wet bulbs of the soil obtained from two drip irrigation systems : aerial drip located under the vine row and subsurface drip placed in the middle of the inter-row. This experiment took place over two consecutive seasons (2020-2021) on a 3.4 ha Viognier plot in the Mediterranean region (PGI Oc, France) on sandy clay soil. The annual rainfalls were less than 400 mm. Soil water content probes were installed at different depths (20 – 40 – 60 – 80 cm) and at different lateralities from the vine row (30 – 60 – 90 – 120 cm) to control the formation of the soil wet bulb during irrigation. The mapping and the analysis of the data allowed a better understanding and differentiation of the water percolation when irrigating with subsurface or aerial drip. For the same amount of water and without differences of vine water status, it is shown that in a subsurface drip irrigation situation, the size of the wet bulb formed is larger than in aerial drip irrigation system.

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Heatwaves and grapevine yield in the Douro region, crop model simulations

Heatwaves or extreme heat events can be particularly harmful to agriculture. Grapevines grown in the Douro winemaking region are particularly exposed to this threat, due to the specificities of the already warm and dry climatic conditions. Furthermore, climate change simulations point to an increase in the frequency of occurrence of these extreme heat events, therefore posing a major challenge to winegrowers in the Mediterranean type climates. The current study focuses on the application of the STICS crop model to assess the potential impacts of heatwaves in grapevine yields over the Douro valley winemaking region. For this purpose, STICS was applied to grapevines using high-resolution weather, soil and terrain datasets over the Douro. To assess the impact of heatwaves, the weather dataset (1989-2005) was artificially modified, generating periods with anomalously high temperatures (+5 ºC), at certain onset dates and with specific durations (from 5 to 9 days). The model was run with this modified weather dataset and results were compared to the original unmodified runs. The results show that heatwaves can have a very strong impact on grapevine yields, strongly depending on the onset dates and duration of the heatwaves. The highest negative impacts may result in a decrease in the yield by up to -35% in some regions. Despite some uncertainties inherent to the current modelling assessment, the present study highlights the negative impacts of heatwaves on viticultural yields in the Douro region, which is critical information for stakeholders within the winemaking sector for planning suitable adaptation measures.

Traditional agroforestry vineyards, sources of inspiration for the agroecological transition of viticulture

A unique “terroir” can be found in southern Bolivia, which combines the specific features of climate, topography and altitude of high valleys, with the management of grapevines staked on trees. It is one of the rare remnants of agroforestry viticulture. A survey was carried out among 29 grapegrowers in three valleys, to characterize the structure and management of these vineyards, and identify the services they expect from trees. Farms were small (2.2 ha on average) and 85% of vineyards were less than 1 ha. Viticulture was associated with vegetable, fruit and fodder production, sometimes in the same fields. Molle trees were found in all plots, together with one or two other native tree species. Traditional grapevine varieties such as Negra Criolla, Moscatel de Alejandría and Vicchoqueña were grown with a large range of densities from 1550 to 9500 vines ha-1. From 18 to 30% of them were staked on trees, with 1.2 to 4.9 vines per tree. The management of these vineyards (irrigation, fertilization and grapevine protection) was described, the most particular technical operation being the coordinated pruning of trees and grapevines. Three types of management could be identified in the three valleys. Grapegrowers had a clear idea of the ecosystem services they expected from trees in their vineyards. The main one was protection against climate hazards (hail, frost, flood). Then they expected benefits in terms of pest and disease control, improvement of soil fertility and resulting yield. At last, some producers claimed that tree-staking was quicker and cheaper than conventional trellising. It can be hypothesized then that agroforestry is a promising technique for the agroecological transition of viticulture. Its contribution to the “terroir” of the high valleys of southern Bolivia and its link with the specificities of the wines and spirits produced there remain to be explored.