Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Upscaling the integrated terroir zoning through digital soil mapping: a case study in the Designation of Origin Campo de Borja

homogeneous zones by intersecting several partial zonings of major factors that influence vineyard growth. Each of them follows specific process from their corresponding disciplines. Soil zoning specifically refers to a Soil Resource Inventory map that has traditionally been generated by conventional soil mapping methods. These methods have shortcomings in reaching fine cartographic and categorical details and involve significant expenses, which undermines their applicability. A new framework named Digital Soil Mapping has introduced quantitative models by statistical techniques to establish soil-landscape relationships and is able to provide intensive scale cartography.

In the present study, a microzoning at 1:10.000 scale is generated from an initial zoning, where the conventional soil map with polytaxic map units is replaced by a new one from digital techniques that disaggregates them. The comparison between the zonings considers a quantitative evaluation of capability for each Homogeneous Terroir Unit by means of the Viticultural Quality Index and its categorization based on its distribution by map. The spatial intersection of both maps gives rise to a confusion matrix in which the flows of class variations after the substitution are assessed.

The results show a five-fold increase in the number of Homogeneous Terroir Units identified and a larger differentiation among them, evidenced by a wider range in the capability index distribution. Both elements are accompanied by an increase in the detection of areas of higher potential within previously undervalued uniform zones.These features are a direct effect of the improvements brought by Digital Soil Mapping techniques and would verify the advantages of their implementation in the Integrated Terroir zoning. Eventually, such new highly detailed terroir units would benefit precision viticulture and sustainable management practices.

Evolution of the amino acids content through grape ripening: Effect of foliar application of methyl jasmonate with or without urea

The parameters that determine the grape quality, and therefore the optimal harvest time, suffer variations during berry ripening, related to climate change, with the widely known problem of the gap between technological and phenolic maturities. However, there are few studies about its incidence on grape nitrogen composition. For this reason, the use of an elicitor, methyl jasmonate (MeJ), alone or with urea, is proposed as a tool to reduce climatic decoupling, allowing to establish the harvest time in order to achieve the optimum grape quality. The aim was to study the effect of MeJ and MeJ+Urea foliar applications on the evolution of Tempranillo amino acids content throughout the grape maturation. Three treatments were foliarly applied, at veraison and 7 days later: control (water), MeJ (10 mM) and MeJ+Urea (10 mM+6 kg N/ha). Grape samples were taken at five stages of maturation: day before the first and second applications, 15 days after the second application (pre-harvest), harvest day, and 15 days after harvest (post-harvest). The amino acids analysis of the samples was carried out by HPLC. Results showed that the evolution of amino acids was similar regardless of the treatment; however, foliar applications influenced the nitrogen compounds content, i.e., there was no qualitative effect but quantitative one. Most of the amino acids reached their maximum concentration in pre-harvest, being higher in grapes from the treatments than in the control. In general, no differences in grape amino acids content were observed between MeJ and MeJ+Urea treatments. Foliar applications with MeJ and MeJ+Urea enhanced the grape amino acids content, without affecting their profile, helping to optimize their quality and allowing to establish a more complete grape ripening standard. Therefore, MeJ and MeJ+Urea foliar applications can be a simple agronomic practice, which has shown promising results in order to enhance the grape quality.

The combined effects of climate, soils, and deficit irrigation on yield and quality of Touriga Nacional under high atmospheric demand in the Douro Region

Global warming is one of the biggest environmental, social and economic threats in several viticultural regions. In the Douro Valley, changes are expected in the coming years, namely an increase in temperature and a decrease in precipitation. These changes are likely to have consequences for the production and quality of wine.
The aim of this study was to explore the effects of different soil characteristics combined with several deficit irrigation strategies, managed throughout ETc references and predawn leaf water potentials thresholds, on physiology, yield, and qualitative attributes on the Touriga Nacional variety under years of mild to severe water and heat stress.
The studies were conducted over seven years (2015 to 2021) in two plots of a commercial vineyard located at Quinta do Ataíde (Symington Family Estates) planted in 2011 and 2014 at 170 meters elevation, growing under three water regimes: non-irrigated (NI) and two deficit irrigation strategies (30% and 60% ETc) assessed weekly by Ψpd. The site has an annual rainfall below 500 mm, with high atmospheric demand. Climate data was collected from a weather station, located on site. Berry ripening was followed weekly for fruit analysis. At harvest, yield, vigour and pruning weight per vine were determined from 90 vines by treatment. Each season at veraison the NDVI Index was accessed by a drone. The soils physic-chemistry in the experimental blocs were analysed and grouped by SWHC. Delta C-13 analyses were also performed per treatment in two years.Irrigation had a positive effect on yield per vine, mostly due to an increase in berry and cluster weight, and fertility index through the years. A significant increase in sugar content, colour and phenols was observed with deficit irrigation in some years, but vine vigour related to soil characteristics had by far the greatest impact on quality.

Second pruning as a strategy to delay maturation in cv. ‘Touriga nacional’ in the Portuguese Douro region

The advance in maturation of wine grapes is an important climate change risk related effect that could affect warm regions like Portuguese Douro Wine Region. Indeed, the climate analysis over the past years registered a decrease in the precipitation, significant higher average temperatures, and a more frequent occurrence of extreme weather events, including heat waves. In these conditions the length from anthesis until maturation is shortened and the uncoupling of technical and phenolic maturity results in berries with higher sugar concentration (and lower acidity), but lower anthocyanins, tannins, and total phenolic concentration, which produce unbalanced wines.
In this work, an innovative strategy of crop forcing, based on forcing vine regrowth after a second pruning of green shoots, was tested, aimed at delaying ripening until the temperature becomes lower and, therefore, preventing acidity loss and increasing anthocyanin-to-sugar ratio. The experiments were conducted in 2019 and 2020 in a commercial vineyard of ‘Touriga Nacional’ located in the Douro Region. Crop forcing was conducted 15 (CF1) to 30 (CF2) days after fruit set. Vines pruned with conventional methods were used as control (CF0). Results confirmed that fruit ripening was shifted from the hot season (August/September), until a cooler period (October through early-November). At harvest, grapevine berries from CF1 and CF2 presented lower pH and higher acidity, than control, with no significant differences in colour intensity and phenolic levels composition. Sugar content was lower in CF2-treated vines in both seasons. However, in CF-treated vines the number and size of clusters were significantly lower (up to 88% reduction) than in control plants. A metabolomics analysis of mature berries from CF-treated vines and control is underway. Crop forcing was indeed effective in producing a more balance berry composition but severely reduced grapevine yield,

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.