Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Use of a new, miniaturized, low-cost spectral sensor to estimate and map the vineyard water status from a mobile 

Optimizing the use of water and improving irrigation strategies has become increasingly important in most winegrowing countries due to the consequences of climate change, which are leading to more frequent droughts, heat waves, or alteration of precipitation patterns. Optimized irrigation scheduling can only be based on a reliable knowledge of the vineyard water status.

In this context, this work aims at the development of a novel methodology, using a contactless, miniaturized, low-cost NIR spectral tool to monitor (on-the-go) the vineyard water status variability. On-the-go spectral measurements were acquired in the vineyard using a NIR micro spectrometer, operating in the 900–1900 nm spectral range, from a ground vehicle moving at 3 km/h. Spectral measurements were collected on the northeast side of the canopy across four different dates (July 8th, 14th, 21st and August 12th) during 2021 season in a commercial vineyard (3 ha). Grapevines of Vitis vinifera L. Graciano planted on a VSP trellis were monitored at solar noon using stem water potential (Ψs) as reference indicators of plant water status. In total, 108 measurements of Ψs were taken (27 vines per date).

Calibration and prediction models were performed using Partial Least Squares (PLS) regression. The best prediction models for grapevine water status yielded a determination coefficient of cross-validation (r2cv) of 0.67 and a root mean square error of cross-validation (RMSEcv) of 0.131 MPa. This predictive model was employed to map the spatial variability of the vineyard water status and provided useful, practical information towards the implementation of appropriate irrigation strategies. The outcomes presented in this work show the great potential of this low-cost methodology to assess the vineyard stem water potential and its spatial variability in a commercial vineyard.

Evolution of the amino acids content through grape ripening: Effect of foliar application of methyl jasmonate with or without urea

The parameters that determine the grape quality, and therefore the optimal harvest time, suffer variations during berry ripening, related to climate change, with the widely known problem of the gap between technological and phenolic maturities. However, there are few studies about its incidence on grape nitrogen composition. For this reason, the use of an elicitor, methyl jasmonate (MeJ), alone or with urea, is proposed as a tool to reduce climatic decoupling, allowing to establish the harvest time in order to achieve the optimum grape quality. The aim was to study the effect of MeJ and MeJ+Urea foliar applications on the evolution of Tempranillo amino acids content throughout the grape maturation. Three treatments were foliarly applied, at veraison and 7 days later: control (water), MeJ (10 mM) and MeJ+Urea (10 mM+6 kg N/ha). Grape samples were taken at five stages of maturation: day before the first and second applications, 15 days after the second application (pre-harvest), harvest day, and 15 days after harvest (post-harvest). The amino acids analysis of the samples was carried out by HPLC. Results showed that the evolution of amino acids was similar regardless of the treatment; however, foliar applications influenced the nitrogen compounds content, i.e., there was no qualitative effect but quantitative one. Most of the amino acids reached their maximum concentration in pre-harvest, being higher in grapes from the treatments than in the control. In general, no differences in grape amino acids content were observed between MeJ and MeJ+Urea treatments. Foliar applications with MeJ and MeJ+Urea enhanced the grape amino acids content, without affecting their profile, helping to optimize their quality and allowing to establish a more complete grape ripening standard. Therefore, MeJ and MeJ+Urea foliar applications can be a simple agronomic practice, which has shown promising results in order to enhance the grape quality.

Effect of partial net shading on the temperature and radiation in the grapevine canopy, consequences on the grape quality of cv. Gros Manseng in PDO Pacherenc-du-vic-Bilh

As elsewhere, southwestern France vineyards face more recurrent summer heat waves these last years. Among the possibilities of adaptation to this climate changing parameter, the use of net shading is a technique that allow for limiting canopy exposure to radiations. In this trial, we tested net shading installed on one face of the canopy, on a north-south row-oriented plot of cv. Gros Manseng trained on VSP system in the PDO Pacherenc-du-Vic-Bilh. The purpose was to characterize the effects on the ambient canopy temperatures and radiations during the season and to observe the consequences on the composition of grapes and wines. Two sorts of net were used with two levels of obstruction (50% and 75%) of the photosynthesis active radiation (PAR). They have been installed on the west side of the canopy and compared to a netless control. Temperature and PAR sensors registered hourly data during the season. On specific summer day (hot and sunny) manual measurements took also place on bunches (temperature) and in different spots of the canopy (PAR). The results showed that, on clear days, the radiation is lowered by the shade nets respecting the supplier criteria. The effects on the ambient canopy temperature were inconstant on this plot when we observed the data from the global period of shading between fruit set and harvest. However, during hot days (>30°C), the temperature in the canopy was reduced during afternoon and the temperature of the bunch surface was reduced as well comparing to the control. A decrease of the maturity parameters of the berries, sugar and acidity, was also observed. Concerning the wine aromatic potential, no differences clearly appeared.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Grape must quality and mesoclimatic variability in Fruška Gora wine-growing region, Serbia

The Fruška Gora mountain is a traditional wine-growing region in Serbia situated in the Pannonian Basin. Due to such a position, the vicinity of the Danube River and the presence of concave configuration, it is suitable for grape production. This paper provides analyses of spatial variations in meteorological parameters and grape juice quality within Fruška Gora wine region over three consecutive vintages (2018-2020). The examined period can be defined as warm with cool nights during September (AVG 18,9°C; GDD 1918°C; CI 12°CF) and with the presence of mesoclimatic variability. The East part of the study area was somewhat drier and hotter compared to other parts of the region. The analyses of grape must samples (190 in total) of five cultivars (Cabernet-Sauvignon, Merlot, Chardonnay, Sauvignon blanc and Grašac (Welschriesling)) commonly grown across the region (19 sites), were performed using Fourier Transform Infrared Technology (FTIR). Among all cultivars, Sauvignon blanc was harvested first in the East area (DOY=246±5, GDD at harvest=1552±74, 22.2±0.7 °Brix), while the latest harvest was recorded for Cabernet-Sauvignon in the West (DOY=283±5, GDD at harvest=1936±187, 23.4±1.0 °Brix ). Both the red and white cultivars had higher acidity and YAN in the grape must if the vines were grown in the North and East compared to South and West areas. According to PCA analysis, Grašac showed the lowest variation in grape must chemical composition. Thus, the results confirm that Grašac is the most stable cultivar in Fruška Gora. All monitored cultivars reached technological fruit ripeness by the end of the growing season. However, it was difficult to reach full ripeness of red cultivars, mostly beacuse of uncoupling of technolocical and phenolic ripeness. Thus, Cabernet-Sauvignon had higher variations in GDD sums at harvest compared to other cultivars, which probably increased variations in grape must quality.