Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

FUNGAL DIVERSITY AND DYNAMICS IN CHAMPAGNE VINEYARDS: FROM VINE TO WINE

Champagne is a well-known wine region in Northern France with distinct terroirs and three main grape varieties. As for any vineyard, wine quality is highly linked to the microbiological characteristics of the raw materials. However, Champagne grape microbiota, especially its fungal component, has yet to be fully characterized. Our study focused on describing this mycobiota, from vine to small scale model wine, for the two main Champagne grape varieties, Pinot Noir and Meunier, using complementary cultural and omics approaches.

Exploring the inhibitor effect of different commercial chitosan-based preparations on malolactic fermentation in rosé wine

Chitosan is a natural polymer of β-D-linked N-acetyl-D-glucosamine units (1,2), that has only recently been approved by OIV for its use in winemaking to help with microbial control, metal chelation, clarification, and reducing contaminants.

Extreme canopy management for vineyard adaptation to climate change: is it a good idea?

Climate change constitutes an enormous challenge for humankind and for all human activities, viticulture not being an exception. Long-term strategic changes are probably needed the most, but growers also need to deal with short-term changes: summers that are getting progressively warmer, earlier harvest dates and higher pH in musts and wines. In the last 10-15 years, a relevant corpus of research is being developed worldwide in order to evaluate to which extent extreme canopy management operations, aimed at reducing leaf area and, thus, limiting the source to sink ratio, could be useful to delay ripening. Although extreme canopy management can result in relevant delays in harvest dates, longer term studies, as well as detailed analysis of their implications on carbohydrate reserves, bud fertility and future yield are desirable before these practices can be recommended.

Mycorrhizal symbiosis modulates flavonoid and amino acid profiles in grapes of Tempranillo and Cabernet Sauvignon 

Arbuscular mycorrhizal fungi (AMF) symbiosis is probably the most widespread beneficial interaction between plants and microorganisms. AMF has been widely reported to promote grapevine growth, water and nutrient uptake as well as both biotic and abiotic stress tolerance[1]. However, the impact of AMF on grape composition has been less studied. The aim of this work was to evaluate the effects of the association between two commercial grapevine cultivars (Tempranillo and Cabernet Sauvignon grafted onto 110 rootstock) and AMF on the anthocyanin, flavonol and amino acid concentrations and profiles of grapes.

The rootstock, the neglected player in the scion transpiration even during the night

Water is the main limiting factor for yield in viticulture. Improving drought adaptation in viticulture will be an increasingly important issue under climate change. Genetic variability of water deficit responses in grapevine partly results from the rootstocks, making them an attractive and relevant mean to achieve adaptation without changing the scion genotype. The objective of this work was to characterize the rootstock effect on the diurnal regulation of scion transpiration. A large panel of 55 commercial genotypes were grafted onto Cabernet Sauvignon. Three biological repetitions per genotype were analyzed. Potted plants were phenotyped on a greenhouse balance platform capable of assessing real-time water use and maintaining a targeted water deficit intensity. After a 10 days well-watered baseline period, an increasing water deficit was applied for 10 days, followed by a stable water deficit stress for 7 days. Pruning weight, root and aerial dry weight and transpiration were recorded and the experiment was repeated during two years. Transpiration efficiency (ratio between aerial biomass and transpiration) was calculated and δ13C was measured in leaves for the baseline and stable water deficit periods. A large genetic variability was observed within the panel. The rootstock had a significant impact on nocturnal transpiration which was also strongly and positively correlated with maximum daytime transpiration. The correlations with growth and water use efficiency related traits will be discussed. Transpiration data were also related with VPD and soil water content demonstrating the influence of environmental conditions on transpiration. These results highlighted the role of the rootstock in modulating water deficit responses and give insights for rootstock breeding programs aimed at identifying drought tolerant rootstocks. It was also helpful to better define the mechanisms on which the drought tolerance in grapevine rootstocks is based on.