Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Using δ13C and hydroscapes as a tool for discriminating cultivar specific drought response

Measurement of carbon isotope discrimination in berry juice sugars at maturity (δ13C) provides an integrated assessment of water use efficiency (WUE) during the period of berry ripening, and when collected over multiple seasons can be used as an indication of drought stress response. Berry juice δ13C measurements were carried out on 48 different varieties planted in a common garden experiment in Bordeaux, France from 2014 through 2021 and were paired with midday and predawn leaf water potential measurements on the same vines in a subset of six varieties. The aim was to discriminate a large panel of varieties based on their stomatal behaviour and potentially identify hydraulic traits characterizing drought tolerance by comparing δ13C and hydroscapes (the visualisation of plant stomatal behaviour as a response to predawn water potential). Cluster analysis found that δ13C values are likely affected by the differing phenology of each variety, resulting in berry ripening of different varieties taking place under different stress conditions within the same year. We accounted for these phenological differences and found that cluster analysis based on specific δ13C metrics created a classification of varieties that corresponds well to our current empirical understanding of their relative drought tolerances. In addition, we analysed the water potential regulation of the subset of six varieties (using the hydroscape approach) and found that it was well correlated with some δ13C metrics. Surprisingly, a variety’s water potential regulation (specifically its minimum critical leaf water potential under water deficit) was strongly correlated to δ13C values under well-watered conditions, suggesting that base WUE may have a stronger impact on drought tolerance than WUE under water deficit. These results give strong insights on the innate WUE of a very large panel of varieties and suggest that studies of drought tolerance should include traits expressed under non-limiting conditions.

Grape berry size is a key factor in determining New Zealand Pinot noir wine composition

Making high quality but affordable Pinot noir (PN) wine is challenging in most terroirs and New Zealand’s (NZ) situation is no exception. To increase the probability of making highly typical PN wines producers choose to grow grapes in cool climates on lower fertility soils while adopting labour intensive practices. Stringent yield targets and higher input costs necessarily mean that PN wine cost is high, and profitability lower, in line-priced varietal wine ranges. To understand the reasons why higher yielding vines are perceived to produce wines of lower quality we have undertaken an extensive study of PN in NZ. Since 2018, we established a network of twelve trial sites in three NZ regions to find individual vines that produced acceptable commercial yields (above 2.5kg per vine) and wines of composition comparable to “Icon” labels. Approximately 20% of 660 grape lots (N = 135) were selected from within a narrow juice Total Soluble Solids (TSS) range and made into single vine wines under controlled conditions. Principal Component Analysis of the vine, berry, juice and wine parameters from three vintages found grape berry mass to be most effective clustering variable. As berry mass category decreased there was a systematic increase in the probability of higher berry red colour and total phenolics with a parallel increase in wine phenolics, changed aroma fraction and decreased juice amino acids. The influence of berry size on wine composition would appear stronger than the individual effects of vintage, region, vineyard or vine yield. Our observations support the hypothesis that it is possible to produce PN wines that fall within an “Icon” benchmark composition range at yields above 2.5kg per vine provided that the Leaf Area:Fruit Weight ratio is above 12cm2 per g, mean berry mass is below 1.2g and juice TSS is above 22°Brix.

Use of a new, miniaturized, low-cost spectral sensor to estimate and map the vineyard water status from a mobile 

Optimizing the use of water and improving irrigation strategies has become increasingly important in most winegrowing countries due to the consequences of climate change, which are leading to more frequent droughts, heat waves, or alteration of precipitation patterns. Optimized irrigation scheduling can only be based on a reliable knowledge of the vineyard water status.

In this context, this work aims at the development of a novel methodology, using a contactless, miniaturized, low-cost NIR spectral tool to monitor (on-the-go) the vineyard water status variability. On-the-go spectral measurements were acquired in the vineyard using a NIR micro spectrometer, operating in the 900–1900 nm spectral range, from a ground vehicle moving at 3 km/h. Spectral measurements were collected on the northeast side of the canopy across four different dates (July 8th, 14th, 21st and August 12th) during 2021 season in a commercial vineyard (3 ha). Grapevines of Vitis vinifera L. Graciano planted on a VSP trellis were monitored at solar noon using stem water potential (Ψs) as reference indicators of plant water status. In total, 108 measurements of Ψs were taken (27 vines per date).

Calibration and prediction models were performed using Partial Least Squares (PLS) regression. The best prediction models for grapevine water status yielded a determination coefficient of cross-validation (r2cv) of 0.67 and a root mean square error of cross-validation (RMSEcv) of 0.131 MPa. This predictive model was employed to map the spatial variability of the vineyard water status and provided useful, practical information towards the implementation of appropriate irrigation strategies. The outcomes presented in this work show the great potential of this low-cost methodology to assess the vineyard stem water potential and its spatial variability in a commercial vineyard.

Soil quality in Beaujolais vineyard. Importance of pedology and cultural practices

A pedological study was carried out from 2009 to 2017 in Beaujolais vineyard, to improve physical and chemical knowledge of soils. It was completed in 2016 and 2017 by the current study, dealing with microbial aspects, in order to build a reference frame for improved advice in soil management. Microbial biomass was measured on representative plots of the six most common soil types identified in Beaujolais and, for each soil type, on plots with different levels of the main impacting parameters: total organic carbon, pH, cation exchange capacity, extractable copper. A total of 59 soil samples were collected. Confirming the results of various trials carried out in Beaujolais over the past 20 years, the results of the present study showed that the soils were still alive, but exhibited a large variability of biological parameters, which appeared dependant on both pedological and anthropic factors. Therefore, a good interpretation of biological parameters and advice for vine growers must rely on a pedologically-based referential with differentiated main driving factors. For example, the control of pH is of primary importance in granitic soils and in no way organic matter addition can improve soil quality if pH is too low. Conversely, in calcareous soils, biological parameters are more directly affected by direct or indirect (cover crops for example) inputs of organic matter. The use of biological parameters, such as microbial biomass, is of great potential value to improve advice on agro-viticultural practices (soil management, fertilization, liming, etc.), basis of a sustainable wine production on fragile soils.

Metabolomic discrimination of grapevine water status for Chardonnay and Pinot noir

Water status impact in viticulture has been widely explored, as it strongly affects grapevine physiology and grape chemical composition. It is considered as a key component of vitivinicultural terroir. Most of the studies concerning grapevine water status have focused on either physiological traits, or berry compounds, or traits involved in wine quality. Here, the response of grapevine to water availability during the ripening period is assessed through non-targeted metabolomics analysis of grape berries by ultra-high resolution mass spectrometry. The grapevine water status has been assessed during 2 consecutive years (2019 & 2020), through carbon isotope discrimination on juices from berries collected at maturity (21.5 brix approx.) for 2 Vitis vinifera cv. Pinot noir (PN) and Chardonnay (CH). A total of 220 grape juices were collected from 5 countries worldwide (Italy; Argentina; France; Germany; Portugal). Measured δ13C (‰) varied from -28.73 to -22.6 for PN, and from -28.79 to -21.67 for CH. These results also clearly revealed higher water stress for the 2020 vintage. The same grape juices have been analysed by Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS) and Liquid Chromatography coupled to Mass Spectrometry (LC-qTOF-MS), leading to the detection of up to 4500 CHONS containing elemental compositions, and thus likely tens of thousands of individual compounds, which include fatty acids, organic acids, peptides, phenolics, also with high levels of glycosylation. Multivariate statistical analysis revealed that up to 160 elemental compositions, covering the whole range of detected masses (100 –1000 m/z), were significantly correlated to the observed gradients of water status. Examples of chemical markers, which are representative of these complex fingerprints, include various derivatives of the known abscisic acid (ABA), such as phaesic acid or abscisic acid glucose ester, which are significantly correlated with higher water stress, regardless of the variety. Cultivar-specific behaviours could also be identified from these fingerprints. Our results provide an unprecedented representation of the metabolic diversity, which is involved in the water status regulation at the grape level, and which could contribute to a better knowledge of the grapevine mitigation strategy in a climate change context.