Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.

Variations of soil attributes in vineyards influence their reflectance spectra

Knowledge on the reflectance spectrum of soil is potentially useful since it carries information on soil chemical composition that can be used to the planning of agricultural practices. If compared with analytical methods such as conventional chemical analysis, reflectance measurement provides non-destructive, economic, near real-time data. This paper reports results from reflectance measurements performed by spectroradiometry on soils from two vineyards in south Brazil. The vineyards are close to each other, are on different geological formations, but were subjected to the same management. The objective was to detect spectral differences between the two areas, correlating these differences to variations in their chemical composition, to assess the technique’s potential to predict soil attributes from reflectance data.To that end, soil samples were collected from ten selected vine parcels. Chemical analysis yield data on concentration of twenty-one soil attributes, and spectroradiometry was performed on samples. Chemical differences significant to a 95% confidence level between the two studied areas were found for six soil attributes, and the average reflectance spectra were separated by this same level along most of the observed spectral domain. Correlations between soil reflectance and concentrations of soil attributes were looked for, and for ten soil traits it was possible to define wavelength domains were reflectance and concentrations are correlated to confidence levels from 95% to 99%. Partial Least Squares Regression (PLSR) analyses were performed comparing measured and predicted concentrations, and for fifteen out of 21 soil traits we found Pearson correlation coefficients r > 0.8. These preliminary results, which have to be validated, suggest that variations of concentration in the investigated soil attributes induce differences in reflectance that can be detected by spectroradiometry. Applications of these observations include the assessment of the chemical content of soils by spectroradiometry as a fast, low-cost alternative to chemical analytical methods.

Making sense of available information for climate change adaptation and building resilience into wine production systems across the world

Effects of climate change on viticulture systems and winemaking processes are being felt across the world. The IPCC 6thAssessment Report concluded widespread and rapid changes have occurred, the scale of recent changes being unprecedented over many centuries to many thousands of years. These changes will continue under all emission scenarios considered, including increases in frequency and intensity of hot extremes, heatwaves, heavy precipitation and droughts. Wine companies need tools and models allowing to peer into the future and identify the moment for intervention and measures for mitigation and/or avoidance. Previously, we presented conceptual guidelines for a 5-stage framework for defining adaptation strategies for wine businesses. That framework allows for direct comparison of different solutions to mitigate perceived climate change risks. Recent global climatic evolution and multiple reports of severe events since then (smoke taint, heatwave and droughts, frost, hail and floods, rising sea levels) imply urgency in providing effective tools to tackle the multiple perceived risks. A coordinated drive towards a higher level of resilience is therefore required. Recent publications such as the Australian Wine Future Climate Atlas and results from projects such as H2020 MED-GOLD inform on expected climate change impacts to the wine sector, foreseeing the climate to expect at regional and vineyard scale in coming decades. We present examples of practical application of the Climate Change Adaptation Framework (CCAF) to impacts affecting wine production in two wine regions: Barossa (Australia) and Douro (Portugal). We demonstrate feasibility of the framework for climate adaptation from available data and tools to estimate historical climate-induced profitability loss, to project it in the future and to identify critical moments when disruptions may occur if timely measures are not implemented. Finally, we discuss adaptation measures and respective timeframes for successful mitigation of disruptive risk while enhancing resilience of wine systems.

A multidisciplinary approach to evaluate the effects of the training system on the performance of “Aglianico del Vulture” vineyards

Vineyards are complex agro-ecosystems with high spatial and temporal variability. An efficient training system may counteract the adverse effects of this variability. Moreover, considering the climate change issues, choosing an efficient training system that enhances water use and protects the vines from radiative thermal stress has become a priority for the farmers. A multidisciplinary approach that assesses the soil-crop-yield-wine relationships of vineyards in a distributed and holistic way could bring added knowledge on the behavior of the different training systems. This ongoing research aimed to implement a multidisciplinary approach to study the behavior of “Aglianico del Vulture” grapevines trained with two different systems: a spurred cordon (SC) and an “Alberello in parete” (AL), grown in a high-quality wine production area of Basilicata region (Italy). The approach merged several methods and scales of soil, ecophysiology, must/wine quality, and spectral data collection to assess the influence of the training system. Homogeneous zones (HZs) in both training systems were defined through a procedure based on geomorphological classification, unmanned aerial vehicles (UAV) images analysis, and a traditional soil survey supported by geophysical scanning. During the 2021 season, TDR probes monitored soil water content, while grapevine health status was assessed using eco-physiological measurements (LWP, chlorophyll content, PSII photosynthetic efficiency, LAI, and point-based field spectroscopy). These grapevine in-vivo measurements validated the spectral vegetation indexes (NDVI, RENDVI, CVI, and TVI) derived from the UAV multispectral imagery, which monitored the grapevine status in a distributed and non-invasive way. Grape yield, quality of berries, must and wine were measured to assess the effects of the training systems. The first experimental year results showed the variability of the vineyards and revealed relationships among soil parameters, crop characteristics, and vegetation indices of the SC and AL training systems. This multidisciplinary study could bring new insights into the vineyard training system’s effects on grape yield and wine quality.

Grapevine yield-gap: identification of environmental limitations by soil and climate zoning in Languedoc-Roussillon region (south of France)

Grapevine yield has been historically overlooked, assuming a strong trade-off between grape yield and wine quality. At present, menaced by climate change, many vineyards in Southern France are far from the quality label threshold, becoming grapevine yield-gaps a major subject of concern. Although yield-gaps are well studied in arable crops, we know very little about grapevine yield-gaps. In the present study, we analysed the environmental component of grapevine yield-gaps linked to climate and soil resources in the Languedoc Roussillon. We used SAFRAN data and IGP Pays d’Oc wine yields from 2010 to 2018. We selected climate and soil indicators proving to have a significant effect on average wine yield-gaps at the municipality scale. The most significant factors of grapevine yield were the Soil Available Water Capacity; followed by the Huglin Index and the Climatic Dryness Index. The Days of Frost; the Soil pH; and the Very Hot Days were also significant. Then, we clustered geographical zones presenting similar indicators, facilitating the identification of resources yield-gaps. We discussed the number of zones with the experts of IGP Pays d’Oc label, obtaining 7 zones with similar limitations for grapevine yield. Finally, we analysed the main resources causing yield-gaps and the grapevine varieties planted on each zone. Mapping grapevine resource yield-gaps are the first stage for understanding grapevine yield-gaps at the regional scale.