Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Modelling vine water stress during a critical period and potential yield reduction rate in European wine regions: a retrospective analysis

Most European vineyards are managed under rainfed conditions, where seasonal water deficit has become increasingly important. The flowering-veraison phenophase represents an important period for vine response to water stress, which is seldomly thoroughly evaluated. Therefore, we aim to quantify the flowering-veraison water stress levels using Crop Water Stress Indicator (CWSI) over 1986–2015 for important European wine regions, and to assess the respective potential Yield Lose Rate (YLR). Additionally, we also investigate whether an advanced flowering-veraison phase may help alleviating the water stress with improved yield. A process-based grapevine model STICS is employed, which has been extensively calibrated for flowering and veraison stages using observed data at 38 locations with 10 different grapevine varieties. Subsequently, the model is being implemented at the regional level, considering site-specific calibration results and gridded climate and soil datasets. The findings suggest wine regions with stronger flowering-veraison CWSI tend to have higher potential YLR. However, contrasting patterns are found between wine regions in France-Germany-Luxembourg and Italy-Portugal-Spain. The former tends to have slight-to-moderate drought conditions (CWSI<0.5) and a negligible-to-moderate YLR (<30%), whereas the latter possesses severe-to-extreme CWSI (>0.5) and substantial YLR (>40%). Wine regions prone to a high drought risk (CWSI>0.75) are also identified, which are concentrated in southern Mediterranean Europe. An advanced flowering-veraison phase may have benefited from cooler temperatures and a higher fraction of spring precipitation in wine regions of Italy-Portugal-Spain, resulting in alleviated CWSI and moderate reductions of YLR. For those of France-Germany-Luxembourg, this can have reduced flowering-veraison precipitation, but prevalent alleviations of YLR are also found, possibly because of shifted phase towards a cooler growing season with reduced evaporative demands. Overall, such a retrospective analysis might provide new insights towards better management of seasonal water deficit for conventionally vulnerable Mediterranean wine regions, but also for relatively cooler and wetter Central European regions.

Soil quality in Beaujolais vineyard. Importance of pedology and cultural practices

A pedological study was carried out from 2009 to 2017 in Beaujolais vineyard, to improve physical and chemical knowledge of soils. It was completed in 2016 and 2017 by the current study, dealing with microbial aspects, in order to build a reference frame for improved advice in soil management. Microbial biomass was measured on representative plots of the six most common soil types identified in Beaujolais and, for each soil type, on plots with different levels of the main impacting parameters: total organic carbon, pH, cation exchange capacity, extractable copper. A total of 59 soil samples were collected. Confirming the results of various trials carried out in Beaujolais over the past 20 years, the results of the present study showed that the soils were still alive, but exhibited a large variability of biological parameters, which appeared dependant on both pedological and anthropic factors. Therefore, a good interpretation of biological parameters and advice for vine growers must rely on a pedologically-based referential with differentiated main driving factors. For example, the control of pH is of primary importance in granitic soils and in no way organic matter addition can improve soil quality if pH is too low. Conversely, in calcareous soils, biological parameters are more directly affected by direct or indirect (cover crops for example) inputs of organic matter. The use of biological parameters, such as microbial biomass, is of great potential value to improve advice on agro-viticultural practices (soil management, fertilization, liming, etc.), basis of a sustainable wine production on fragile soils.

A multidisciplinary approach to evaluate the effects of the training system on the performance of “Aglianico del Vulture” vineyards

Vineyards are complex agro-ecosystems with high spatial and temporal variability. An efficient training system may counteract the adverse effects of this variability. Moreover, considering the climate change issues, choosing an efficient training system that enhances water use and protects the vines from radiative thermal stress has become a priority for the farmers. A multidisciplinary approach that assesses the soil-crop-yield-wine relationships of vineyards in a distributed and holistic way could bring added knowledge on the behavior of the different training systems. This ongoing research aimed to implement a multidisciplinary approach to study the behavior of “Aglianico del Vulture” grapevines trained with two different systems: a spurred cordon (SC) and an “Alberello in parete” (AL), grown in a high-quality wine production area of Basilicata region (Italy). The approach merged several methods and scales of soil, ecophysiology, must/wine quality, and spectral data collection to assess the influence of the training system. Homogeneous zones (HZs) in both training systems were defined through a procedure based on geomorphological classification, unmanned aerial vehicles (UAV) images analysis, and a traditional soil survey supported by geophysical scanning. During the 2021 season, TDR probes monitored soil water content, while grapevine health status was assessed using eco-physiological measurements (LWP, chlorophyll content, PSII photosynthetic efficiency, LAI, and point-based field spectroscopy). These grapevine in-vivo measurements validated the spectral vegetation indexes (NDVI, RENDVI, CVI, and TVI) derived from the UAV multispectral imagery, which monitored the grapevine status in a distributed and non-invasive way. Grape yield, quality of berries, must and wine were measured to assess the effects of the training systems. The first experimental year results showed the variability of the vineyards and revealed relationships among soil parameters, crop characteristics, and vegetation indices of the SC and AL training systems. This multidisciplinary study could bring new insights into the vineyard training system’s effects on grape yield and wine quality.

Measurement of redox potential as a new analytical winegrowing tool

Excell laboratory has initiated the development of an analytical method based on electrochemistry to evaluate the ability of wines to undergo or resist to oxidative phenomena. Electrochemistry is a powerful tool to probe reactions involving electron transfers and offers possibility of real-time measurements. In that context, the laboratory has implemented electrochemical analysis to assess oxidation state of different wine matrices but also in order to evaluate oxidative or reduced character of leaf and soil. Initially, our laboratory focused on dosage of compounds involved in responses of plant stresses and we were also interested in microbiological activity of soils. These analyses were compared with the measurement of redox potential (Eh) and pH which are two fundamental variables involved in the modulation of plant metabolism. Indeed, the variation of redox states of the plant reflects its biological activity but also its capacity to absorb nutriments. The Eh-pH conditions mainly determine metabolic processes involved in soil and leaf and our goal is to determine if this combined analytical approach will be sufficiently precise to detect biological evolutions (plant health, parasitic attack…).

Grapevine xylem embolism resistance spectrum reveals which varieties have a lower mortality risk in a future dry climate

Wine growing regions have recently faced intense and frequent droughts that have led to substantial economical losses, and the maintenance of grapevine productivity under warmer and drier climate will rely notably on planting drought-resistant cultivars. Given that plant growth and yield depend on water transport efficiency and maintenance of photosynthesis, thus on the preservation of the vascular system integrity during drought, a better understanding of drought-related hydraulic traits that have a significant impact on physiological processes is urgently needed. We have worked towards this end by assessing vulnerability to xylem embolism in 30 grapevine commercial varieties encompassing red and white Vitis vinifera varieties, hybrid varieties characterized by a polygenic resistance for powdery and downy mildew, and commonly used rootstocks. These analyses further allowed a global assessment of wine regions with respect to their varietal diversity and resulting vulnerability to stem embolism. Hybrid cultivars displayed the highest vulnerability to embolism, while rootstocks showed the greatest resistance. Significant variability also arose among Vitis vinifera varieties, with Ψ12 and Ψ50 values ranging from -0.4 to -2.7 MPa and from -1.8 to -3.4 MPa, respectively. Cabernet franc, Chardonnay and Ugni blanc featured among the most vulnerable varieties while Pinot noir, Merlot and Cabernet Sauvignon ranked among the most resistant. In consequence, wine regions bearing a significant proportion of vulnerable varieties, such as Poitou-Charentes, France and Marlborough, New Zealand, turned out to be at greater risk under drought. These results highlight that grapevine varieties may not respond equally to warmer and drier conditions, outlining the importance to consider hydraulic traits associated with plant drought tolerance into breeding programmes and modeling simulations of grapevine yield maintenance under severe drought. They finally represent a step forward to advise the wine industry about which varieties and regions would have the lowest risk of drought-induced mortality under climate change.