Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Sustaining wine identity through intra-varietal diversification

With contemporary climate change, cultivated Vitis vinifera L. is at risk as climate is a critical component in defining ecologically fitted plant materiel. While winegrowers can draw on the rich diversity among grapevine varieties to limit expected impacts (Morales-Castilla et al., 2020), replacing a signature variety that has created a sense of local distinctiveness may lead to several challenges. In order to sustain wine identity in uncertain climate outcomes, the study of intra-varietal diversity is important to reflect the adaptive and evolutionary potential of current cultivated varieties. The aim of this ongoing study is to understand to what extent can intra-varietal diversity be a climate change adaptation solution. With a focus on early (Sauvignon blanc, Riesling, Grolleau, Pinot noir) to moderate late (Chenin, Petit Verdot, Cabernet franc) ripening varieties, data was collected for flowering and veraison for the various studied accessions (from conservatory plots) and clones. For these phenological growing stages, heat requirements were established using nearby weather stations (adapted from the GFV model, Parker et al., 2013) and model performances were verified. Climate change projections were then integrated to predict the future behaviour of the intra-varietal diversity. Study findings highlight the strong phenotypic diversity of studied varieties and the importance of diversification to enhance climate change resilience. While model performances may require improvements, this study is the first step towards quantifying heat requirements of different clones and how they can provide adaptation solutions for winegrowers to sustain local wine identity in a global changing climate. As genetic diversity is an ongoing process through point mutations and epigenetic adaptations, perspective work is to explore clonal data from a wide variety of geographic locations.

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Elevational range shifts of mountain vineyards: Recent dynamics in response to a warming climate

Increasing temperatures worldwide are expected to cause a change in spatial distribution of plant species along elevational gradients and there are already observable shifts to higher elevations as a consequence of climate change for many species. Not only naturally growing plants, but also agricultural cultivations are subject to the effects of climate change, as the type of cultivation and the economic viability depends largely on the prevailing climatic conditions. A shift to higher elevations therefore represents a viable adaptation strategy to climate change, as higher elevations are characterized by lower temperatures. This is especially important in the case of viticulture because a certain wine-style can only be achieved under very specific climatic conditions. Although there are several studies investigating climatic suitability within winegrowing regions or longitudinal shifts of winegrowing areas, little is known about how fast vineyards move to higher elevations, which may represent a viable strategy for winegrowers to maintain growing conditions and thus wine-style, despite the effects of climate change. We therefore investigated the change in the spatial distribution of vineyards along an elevational gradient over the past 20 years in the mountainous wine-growing region of Alto Adige (Italy). A dataset containing information about location and planting year of more than 26000 vineyard parcels and 30 varieties was used to perform this analysis. Preliminary results suggest that there has been a shift to higher elevations for vineyards in general (from formerly 700m to currently 850 m a.s.l., with extreme sites reaching 1200 m a.s.l.), but also that this development has not been uniform across different varieties and products (i.e. vitis vinifera vs hybrid varieties and still vssparkling wines). This is important for climate change adaptation as well as for rural development. Mountain areas, especially at mid to high elevations, are often characterized by severe land abandonment which can be avoided to some degree if economically viable and sustainable land management strategies are available.

Short-term relationships between climate and grapevine trunk diseases in southern French vineyards

[lwp_divi_breadcrumbs home_text="IVES" use_before_icon="on" before_icon="||divi||400" module_id="publication-ariane" _builder_version="4.19.4" _module_preset="default" module_text_align="center" module_font_size="16px" text_orientation="center"...

Amino nitrogen content in grapes: the impact of crop limitation

As an essential element for grapevine development and yield, nitrogen is also involved in the winemaking process and largely affects wine composition. Grape must amino nitrogen deficiency affects the alcoholic fermentation kinetics and alters the development of wine aroma precursors. It is therefore essential to control and optimize nitrogen use efficiency by the plant to guarantee suitable grape nitrogen composition at harvest. Understanding the impact of environmental conditions and cultural practices on the plant nitrogen metabolism would allow us to better orientate our technical choices with the objective of quality and sustainability (less inputs, higher efficiency). This trial focuses on the impact of crop limitation – that is a common practice in European viticulture – on nitrogen distribution in the plant and particularly on grape nitrogen composition. A wide gradient of crop load was set up in a homogeneous plot of Chasselas (Vitis vinifera) in the experimental vineyard of Agroscope, Switzerland. Dry weight and nitrogen dynamics were monitored in the roots, trunk, canopy and grapes, during two consecutive years, using a 15N-labeling method. Grape amino nitrogen content was assessed in both years, at veraison and at harvest. The close relationship between fruits and roots in the maintenance of plant nitrogen balance was highlighted. Interestingly, grape nitrogen concentration remained unchanged regardless of crop load to the detriment of the growth and nitrogen content of the roots. Meanwhile, the size and the nitrogen concentration of the canopy were not affected. Leaf gas exchange rates were reduced in response to lower yield conditions, reducing carbon and nitrogen assimilation and increasing intrinsic water use efficiency. The must amino nitrogen profiles could be discriminated as a function of crop load. These findings demonstrate the impact of plant balance on grape nitrogen composition and contribute to the improvement of predictive models and sustainable cultural practices in perennial crops.