Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Is wine terroir a valid concept under a changing climate?

The OIV[i] defines terroir as a concept referring to an area in which collective knowledge of the interactions between the physical and biological environment (soil, topography, climate, landscape characteristics and biodiversity features) and vitivinicultural practices develops, providing distinctive wine characteristics. Those are perceptible in the taste of wine, which drives consumer preference and, therefore, wine’s value in the marketplace. Geographical indications (GI) are recognized regulatory constructs formalizing and protecting the nexus between wine taste and the terroir generating it. Despite considering updates, GIs do not consider the nexus as a dynamic one and do not anticipate change, namely of climate. Being climate a fundamental feature of terroir, it strongly impacts wine characteristics, such as taste. According to IPCC[ii], many widespread, rapid and unprecedented changes of climate occurred, some being irreversible over hundreds to thousands of years. Climatic shifts and atmospheric-driven extreme events have been widely reported worldwide. Recent climatic trends are projected to strengthen in upcoming decades, whereas extremes are expected to increase in frequency and intensity, forcing wines away from GI definitions. Geographical shifts of viticultural suitability are projected, often moving into regions and countries different from current ones. Some authors propose adaptation in viticulture, winemaking and product innovation. We show evidence of climate changing wine characteristics in the Douro valley, home of 270-year-old Port GI. We discuss herein resist or adapt stances for when climate changes the nexus between terroir and wine characteristics. Using the MED-GOLD[iii] dashboard, a tool allowing for easy visual navigation of past and future climates, we demonstrate how policymakers can identify future moments, throughout the 21st century under different emission scenarios, when GI specifications will likely need updates (e.g., boundaries, varieties) to reduce climate-change impacts.

Adaptation to soil and climate through the choice of plant material

Choosing the rootstock, the scion variety and the training system best suited to the local soil and climate are the key elements for an economically sustainable production of wine. The choice of the rootstock/scion variety best adapted to the characteristics of the soil is essential but, by changing climatic conditions, ongoing climate change disrupts the fine-tuned local equilibrium. Higher temperatures induce shifts in developmental stages, with on the one hand increasing fears of spring frost damages and, on the other hand, ripening during the warmest periods in summer. Expected higher water demand and longer and more frequent drought events are also major concerns. The genetic control of the phenotypes, by genomic information but also by the epigenetic control of gene expression, offers a lot of opportunities for adapting the plant material to the future. For complex traits, genomic selection is also a promising method for predicting phenotypes. However, ecophysiological modelling is necessary to better anticipate the phenotypes in unexplored climatic conditions Genetic approaches applied on parameters of ecophysiological models rather than raw observed data are more than ever the basis for finding, or building, the ideal varieties of the future.

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.

Climate and the evolving mix of grape varieties in Australia’s wine regions

The purpose of this study is to examine the changing mix of winegrape varieties in Australia so as to address the question: In the light of key climate indicators and predictions of further climate change, how appropriate are the grape varieties currently planted in Australia’s wine regions? To achieve this, regions are classified into zones according to each region’s climate variables, particularly average growing season temperature (GST), leaving aside within-region variations in climates. Five different climatic classifications are reported. Using projections of GSTs for the mid- and late 21st century, the extent to which each region is projected to move from its current zone classification to a warmer one is reported. Also shown is the changing proportion of each of 21 key varieties grown in a GST zone considered to be optimal for premium winegrape production. Together these indicators strengthen earlier suggestions that the mix of varieties may be currently less than ideal in many Australian wine regions, and would become even less so in coming decades if that mix was not altered in the anticipation of climate change. That is, grape varieties in many (especially the warmest) regions will have to keep changing, or wineries will have to seek fruit from higher latitudes or elevations if they wish to retain their current mix of varieties and wine styles.

Late season canopy management practices to reduce sugar loading and improve color profile of Cabernet-Sauvignon grapes and wines in the high irradiance and hot conditions of California Central Valley

Global warming is accelerating grape ripening, leading to unbalanced wines from fruit with high sugar content but poor aroma and colour development. Reducing the size of the photosynthetic apparatus after veraison has been shown to delay technological ripeness in cool climates, but methods have not been tested in areas with high irradiance and temperature where fruit exposure could have disastrous effects on berry composition. In this Cabernet-Sauvignon trial, we compared the application of an antitranspirant (pinolene), to severe canopy topping and above bunch zone leaf removal, all performed at mid-ripening, with an untouched control. We monitored the vines weekly by measuring stem water potential, gas exchange, fruit zone light exposure. We sampled berries to measure berry weight, total soluble solids, pH, titratable acidity, and the anthocyanin profile. At harvest, we assessed yield components, measured carbon isotope discrimination, rated sunburn on clusters, and produced experimental wines. We submitted harvest samples to metabolomic profiling through PFP-Q Exactive MS/MS and wines to sensory analysis. Application of the antitranspirant significantly reduced stomatal conductance and assimilation rate but did not affect the stem water potential. Inversely, leaf removal and topping increased water potential but did not affect leaf gas exchange. The late topping was the only treatment able to decrease sugar content (up to 2Bx), increase titratable acidity and pH, and improve anthocyanin content because of lower degradation of di-hydroxylated forms. Late leaf removal above the bunch zone increased lightning conditions in the canopy and produced the most significant damage on fruits. Yield components were not affected. This work suggests that late-season canopy management can effectively control ripening speeds and improve grapes and wines. Still, the effect on grape exposure in a critical time must be well balanced to avoid problems with the appropriate technique.