Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Frost risk projections in a changing climate are highly sensitive in time and space to frost modelling approaches

Late spring frost is a major challenge for various winegrowing regions across the world, its occurrence often leading to important yield losses and/or plant failure. Despite a significant increase in minimum temperatures worldwide, the spatial and temporal evolution of spring frost risk under a warmer climate remains largely uncertain. Recent projections of spring frost risk for viticulture in Europe throughout the 21st century show that its evolution strongly depends on the model approach used to simulate budburst. Furthermore, the frost damage modelling methods used in these projections are usually not assessed through comparison to field observations and/or frost damage reports.
The present study aims at comparing frost risk projections simulated using six spring frost models based on two approaches: a) models considering a fixed damage threshold after the predicted budburst date (e.g BRIN, Smoothed-Utah, Growing Degree Days, Fenovitis) and b) models considering a dynamic frost sensitivity threshold based on the predicted grapevine winter/spring dehardening process (e.g. Ferguson model). The capability of each model to simulate an actual frost event for the Vitis vinifera cv. Chadonnay B was previously assessed by comparing simulated cold thermal stress to reports of events with frost damage in Chablis, the northernmost winegrowing region of Burgundy. Models exhibited scores of κ > 0.65 when reproducing the frost/non-frost damage years and an accuracy ranging from 0.82 to 0.90.
Spring frost risk projections throughout the 21st century were performed for all winegrowing subregions of Bourgogne-Franche-Comté under two CMIP5 concentration pathways (4.5 and 8.5) using statistically downscaled 8×8 km daily air temperature and humidity of 13 climate models. Contrasting results with region-specific spring frost risk trends were observed. Three out of five models show a decrease in the frequency of frost years across the whole study area while the other two show an increase that is more or less pronounced depending on winegrowing subregion. Our findings indicate that the lack of accuracy in grapevine budburst and dehardening models makes climate projections of spring frost risk highly uncertain for grapevine cultivation regions.

Short-term relationships between climate and grapevine trunk diseases in southern French vineyards

[lwp_divi_breadcrumbs home_text="IVES" use_before_icon="on" before_icon="||divi||400" module_id="publication-ariane" _builder_version="4.19.4" _module_preset="default" module_text_align="center" module_font_size="16px" text_orientation="center"...

Underpinning terroir with data: rethinking the zoning paradigm

Agriculture, natural resource management and the production and sale of products such as wine are increasingly data-driven activities. Thus, the use of remote and proximal crop and soil sensors to aid management decisions is becoming commonplace and ‘Agtech’ is proliferating commercially; mapping, underpinned by geographical information systems and complex methods of spatial analysis, is widely used. Likewise, the chemical and sensory analysis of wines draws on multivariate statistics; the efficient winery intake of grapes, subsequent production of wines and their delivery to markets relies on logistics; whilst the sales and marketing of wines is increasingly driven by artificial intelligence linked to the recorded purchasing behaviour of consumers. In brief, there is data everywhere!

Opinions will vary on whether these developments are a good thing. Those concerned with the ‘mystique’ of wine, or the historical aspects of terroir and its preservation, may find them confronting. In contrast, they offer an opportunity to those interested in the biophysical elements of terroir, and efforts aimed at better understanding how these impact on vineyard performance and the sensory attributes of resultant wines. At the previous Terroir Congress, we demonstrated the potential of analytical methods used at the within-vineyard scale in the development of Precision Viticulture, in contributing to a quantitative understanding of regional terroir. For this conference, we take this approach forward with examples from contrasting locations in both the northern and southern hemispheres. We show how, by focussing on the vineyards within winegrowing regions, as opposed to all of the land within those regions, we might move towards a more robust terroir zoning than one derived from a mixture of history, thematic mapping, heuristics and the whims of marketers. Aside from providing improved understanding by underpinning terroir with data, such methods should also promote improved management of the entire wine value chain.

A multidisciplinary approach to evaluate the effects of the training system on the performance of “Aglianico del Vulture” vineyards

Vineyards are complex agro-ecosystems with high spatial and temporal variability. An efficient training system may counteract the adverse effects of this variability. Moreover, considering the climate change issues, choosing an efficient training system that enhances water use and protects the vines from radiative thermal stress has become a priority for the farmers. A multidisciplinary approach that assesses the soil-crop-yield-wine relationships of vineyards in a distributed and holistic way could bring added knowledge on the behavior of the different training systems. This ongoing research aimed to implement a multidisciplinary approach to study the behavior of “Aglianico del Vulture” grapevines trained with two different systems: a spurred cordon (SC) and an “Alberello in parete” (AL), grown in a high-quality wine production area of Basilicata region (Italy). The approach merged several methods and scales of soil, ecophysiology, must/wine quality, and spectral data collection to assess the influence of the training system. Homogeneous zones (HZs) in both training systems were defined through a procedure based on geomorphological classification, unmanned aerial vehicles (UAV) images analysis, and a traditional soil survey supported by geophysical scanning. During the 2021 season, TDR probes monitored soil water content, while grapevine health status was assessed using eco-physiological measurements (LWP, chlorophyll content, PSII photosynthetic efficiency, LAI, and point-based field spectroscopy). These grapevine in-vivo measurements validated the spectral vegetation indexes (NDVI, RENDVI, CVI, and TVI) derived from the UAV multispectral imagery, which monitored the grapevine status in a distributed and non-invasive way. Grape yield, quality of berries, must and wine were measured to assess the effects of the training systems. The first experimental year results showed the variability of the vineyards and revealed relationships among soil parameters, crop characteristics, and vegetation indices of the SC and AL training systems. This multidisciplinary study could bring new insights into the vineyard training system’s effects on grape yield and wine quality.

The plantation frame as a measure of adaptation to climate change

The mechanization of vineyard work originally led to a reduction in planting densities due to the lack of machinery adapted to the vineyard. The current availability of specific machinery makes it possible to establish higher planting densities. In this work, three planting densities (1.40×0.80 m, 1.80×1 m and 2.20×1.20 m, corresponding to 8928, 5555 and 3787 plants/ha respectively) were studied with four varieties autochthonous of Galicia (northwestern Spain): Albariño and Treixadura (white), Sousón and Mencía (red). The vines were trained in a vertical shoot positioning system using a single Royat cordon, and pruned to spurs with two buds each. Agronomic data (yield, pruning wood weight, Ravaz index) and oenological data in must were collected. The higher planting density (1.40×0.80 m) had no significant effect on grape yield per vine in white varieties, although production per hectare was much higher due to the greater number of plants. In red varieties, this planting density resulted in a significantly lower production per vine, compensated by the greater number of plants. In addition, it significantly reduced the Brix degree in the must of the Albariño, Treixadura and Sousón varieties, and increased the total acidity in the latter two and Mencía. It also caused an increase in extractable and total anthocyanins and IPT in red grapes. The effects of high planting density on grapes are of great interest for the adaptation of varieties in the context of climate change. In the future, it could be advisable to modify the limits imposed by the appellations of origin on the planting density of these varieties in order to obtain more balanced wines.