Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Elevational range shifts of mountain vineyards: Recent dynamics in response to a warming climate

Increasing temperatures worldwide are expected to cause a change in spatial distribution of plant species along elevational gradients and there are already observable shifts to higher elevations as a consequence of climate change for many species. Not only naturally growing plants, but also agricultural cultivations are subject to the effects of climate change, as the type of cultivation and the economic viability depends largely on the prevailing climatic conditions. A shift to higher elevations therefore represents a viable adaptation strategy to climate change, as higher elevations are characterized by lower temperatures. This is especially important in the case of viticulture because a certain wine-style can only be achieved under very specific climatic conditions. Although there are several studies investigating climatic suitability within winegrowing regions or longitudinal shifts of winegrowing areas, little is known about how fast vineyards move to higher elevations, which may represent a viable strategy for winegrowers to maintain growing conditions and thus wine-style, despite the effects of climate change. We therefore investigated the change in the spatial distribution of vineyards along an elevational gradient over the past 20 years in the mountainous wine-growing region of Alto Adige (Italy). A dataset containing information about location and planting year of more than 26000 vineyard parcels and 30 varieties was used to perform this analysis. Preliminary results suggest that there has been a shift to higher elevations for vineyards in general (from formerly 700m to currently 850 m a.s.l., with extreme sites reaching 1200 m a.s.l.), but also that this development has not been uniform across different varieties and products (i.e. vitis vinifera vs hybrid varieties and still vssparkling wines). This is important for climate change adaptation as well as for rural development. Mountain areas, especially at mid to high elevations, are often characterized by severe land abandonment which can be avoided to some degree if economically viable and sustainable land management strategies are available.

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.

Green berries on Gewürztraminer (Vitis vinifera L.) in South Tyrol (Italy)

The grape variety Gewürztraminer is known to be affected by two physiological disorders namely berry shrivel and bunch stem necrosis. During the season 2014 we noticed a new symptomatology type of ripening disorder on the variety. The new symptom showed not all berries fallowing the normal maturation stages, but single berries remaining at a soft but green stage till harvest. The broad distribution of these so called “green berries” symptoms in different production sites of our region, caused huge damage due to the difficulty of eliminating single berries per bunch before harvesting. Therefore, the Research Centre Laimburg began to investigate the reasons and origins of this new symptom. This work shows the results of first attempts to find causes for the symptom as well as the resulting approach to mitigate symptoms. Applications of magnesium leaf fertilizer showed first promising results against this putative disorder. To study the causal effect of the green berries 30 symptomatic vineyards in 2014 have been selected for a monitoring during the season 2016. To evaluate the foliar nutrient treatment two vineyards have been selected for application of magnesium sulfate and magnesium chloride. Leaf and berry nutrient analysis, as well as the main quality parameters during ripening have been performed. As soon as “green berries” symptoms appeared, incidence and severity have been evaluated. Most of the symptomatic vineyards of the 2016 monitoring showed light to clear magnesium deficit symptoms on their foliage. Only during the seasons 2020 and 2021 “green berries” symptoms could be found in the leaf fertilizer treatment vineyards. Both seasons showed a significant effect of the magnesium treatments to reduce the incidence and severity of the symptom. It seems that the appearance of the “green berries” symptom on Gewürztraminer is correlated to a disturbed uptake of magnesium of the vines.

Ecophysiological performance of Vitis rootstocks under water stress

The use of rootstocks tolerant to soil water deficit is an interesting strategy to cope with limited water availability. Currently, several nurseries are breeding new genotypes, but the physiological basis of its responses under water stress are largely unknown. To this end, an ecophysiological assessment of the conventional 110-Richter (110R) and SO4, and the new M1 and M4 rootstocks was carried out in potted ungrafted plants. During one season, these Vitis genotypes were grown under greenhouse conditions and subjected to two water regimes, well-watered and water deficit. Water potentials of plants under water deficit down to < -1.4 MPa, and net photosynthesis (AN) <5 μmol m-2 s-1 did not cause leaf oxidative stress damage compared to well-watered conditions in any of the genotypes. The antioxidant capacity was sufficient to neutralize the mild oxidative stress suffered. Under both treatments, gravimetric differences in daily water use were observed among genotypes, leading to differences in the biomass of root, shoot and leaf. Under well-watered conditions, SO4 and 110R were the most vigorous and M1 and M4 the least. However, under water stress, SO4 exhibited the greatest reduction in biomass while M4 showed the lowest. Remarkably, under these conditions, SO4 reached the least negative stem water potential (Ψstem), while M1 reduced stomatal conductance (gs) and AN the most. In addition, SO4 and M1 genotypes also showed the highest and lowest hydraulic conductance values, respectively. Our results suggest that there are differences in water use regulation among genotypes, not only attributed to differences in stomatal regulation or intrinsic water use efficiency at the leaf level. Therefore, because no differences in canopy-to-root ratio were achieved, it is hypothesized that xylem vessel anatomical differences may be driving the reported differences among rootstocks performance. Results demonstrate that each Vitis rootstock differs in its ecophysiological responses under water stress.

Assessing the relationship between cordon strangulation, dieback, and fungal trunk disease symptom expression

Grapevine trunk diseases including Eutypa dieback are a major factor in the decline of vineyards and may lead to loss of productivity, reduced income, and premature reworking or replanting. Several studies have yielded results indicating that vines may be more likely to express symptoms of vascular disease if their health is already compromised by stress. In Australia and many other wine-growing regions it is a common practice for canes to be wrapped tightly around the cordon wire during the establishment of permanent cordon arms. It is likely that this practice may have a negative effect on health and longevity, as older cordons that have been trained in this manner often display signs of decay and dieback, with the wire often visibly embedded within the wood of the cordon. It is possible that adopting a training method which avoids constriction of the vasculature of the cordon may help to limit the onset of vascular disease symptom expression. A survey was conducted during the spring of two consecutive growing seasons on vineyards in South Australia displaying symptoms of Eutypa lata infection when symptomless shoots were 50–100 cm long. Vines were assessed as follows: (i) the proportion of cordon exhibiting dieback was rated using a 0–100% scale; (ii) the proportion of canopy exhibiting foliar symptoms of Eutypa dieback was rated using a 0–100% scale; (iii) the severity of strangulation was rated using a 0–4 point scale. Images were also taken of each vine for the purpose of measuring plant area index (PAI) using the VitiCanopy App. The goal of the survey was to determine if and to what extent any correlation exists between severity of strangulation and cordon dieback, in addition to Eutypa dieback foliar symptom expression.