Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Upscaling the integrated terroir zoning through digital soil mapping: a case study in the Designation of Origin Campo de Borja

homogeneous zones by intersecting several partial zonings of major factors that influence vineyard growth. Each of them follows specific process from their corresponding disciplines. Soil zoning specifically refers to a Soil Resource Inventory map that has traditionally been generated by conventional soil mapping methods. These methods have shortcomings in reaching fine cartographic and categorical details and involve significant expenses, which undermines their applicability. A new framework named Digital Soil Mapping has introduced quantitative models by statistical techniques to establish soil-landscape relationships and is able to provide intensive scale cartography.

In the present study, a microzoning at 1:10.000 scale is generated from an initial zoning, where the conventional soil map with polytaxic map units is replaced by a new one from digital techniques that disaggregates them. The comparison between the zonings considers a quantitative evaluation of capability for each Homogeneous Terroir Unit by means of the Viticultural Quality Index and its categorization based on its distribution by map. The spatial intersection of both maps gives rise to a confusion matrix in which the flows of class variations after the substitution are assessed.

The results show a five-fold increase in the number of Homogeneous Terroir Units identified and a larger differentiation among them, evidenced by a wider range in the capability index distribution. Both elements are accompanied by an increase in the detection of areas of higher potential within previously undervalued uniform zones.These features are a direct effect of the improvements brought by Digital Soil Mapping techniques and would verify the advantages of their implementation in the Integrated Terroir zoning. Eventually, such new highly detailed terroir units would benefit precision viticulture and sustainable management practices.

Effect of vigour and number of clusters on eonological parameters and metabolic profile of Cabernet Sauvignon red wines

Vegetative growth and yield are reported to affect grape and wine quality. They can be controlled through different techniques linked to vine management. The objective of this research was to determine the effect of vine vigour and number of clusters per vine on physicochemical composition and phenolic profile of red wines. The experiment was carried out during two vegetative cycles, with cv. Cabernet Sauvignon grafted onto Paulsen 1103. Three vine vigour were defined, according to shoot weight at previous harvests, being low, medium and high. Five treatments of number of clusters were used for each vigour, with 15, 22, 29, 36, and 45 clusters per vine. Grapes from all treatments were harvested in the same day from Brix and total acidity criteria. Thirty days after bottling, classical analyzes and phenolic compounds were performed. As results, different responses were obtained from each vintage. In 2020, a dry season from veraison to harvest, grapes and wines obtained from low vigour treatment and 45 clusters per vine was the highest in sugar and alcohol content respectively, while grapes and wines from high vigour and 15 clusters presented the lowest sugar and alcohol content. Total anthocyanins were higher in treatment with low vigour and 15 clusters, while the lowest amounts were found in low vigour with 45 clusters, as well as medium and high vigour with 36 clusters per vine. Total tannins were higher in high vigour with 22 clusters and medium vigour with 29 clusters, while were lower in low vigour with 36 clusters. In 2021, a wet season at harvest, responses were different, and great variations were observed between treatments. As conclusions, yield and vine vigour had strong influence on grape and wine quality, promoting different enological potentials on which can be indicated/used for aging strategies of red and even rosé wines.

Influence of climatic conditions on grape composition of Tempranillo in La Mancha DO (Spain)

The aim of this work was to analyze the variability in grape composition of the Tempranillo cultivar related to climatic conditions, in La Mancha Designation of Origin. Grape composition (sugar content, total acidity, pH, malic acid, and total and extractable anthocyanins) recorded during ripening, were analysed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The relationships between grape parameters with climatic variables related to temperature and to water deficits, referring different periods between phenological events along the growing cycle, were evaluated using regression analysis. High variability in grape composition was observed in the period analysed. Total acidity varied between 3.7 and 7.3 gL-1 while malic acid varied between 1.2 and 4 gL-1. The extractable anthocyanins ranged between 526 and 972 mgL-1, and total anthocyanins ranged between 922 and 1388 mgL-1, being the lowest values recorded in the hottest year (2017). Total acidity decreased 0.77 gL-1 for an increase of 100 GDD, while malic acid decrease in 0.42 gL-1 for the same GDD increase, being the period between veraison and harvest the one that seemed to have higher influence on acidity. In addition, it was confirmed that increasing water deficits decreased acidity. Total and extractable anthocyanins increased in about 210 and 105 mgL-1, respectively, with an increase of 100 GDD from veraison to harvest, and the increase in water deficits favour the increase of anthocyanins, both total and extractable anthocyanins. Total and extractable anthocyanins concentration increased in 35 and 22 mgL-1 per an increase of 10 mm in the water deficit. These results can be of interest to understand the potential changes that grapes composition may suffer under future warmer climates.

Amino nitrogen content in grapes: the impact of crop limitation

As an essential element for grapevine development and yield, nitrogen is also involved in the winemaking process and largely affects wine composition. Grape must amino nitrogen deficiency affects the alcoholic fermentation kinetics and alters the development of wine aroma precursors. It is therefore essential to control and optimize nitrogen use efficiency by the plant to guarantee suitable grape nitrogen composition at harvest. Understanding the impact of environmental conditions and cultural practices on the plant nitrogen metabolism would allow us to better orientate our technical choices with the objective of quality and sustainability (less inputs, higher efficiency). This trial focuses on the impact of crop limitation – that is a common practice in European viticulture – on nitrogen distribution in the plant and particularly on grape nitrogen composition. A wide gradient of crop load was set up in a homogeneous plot of Chasselas (Vitis vinifera) in the experimental vineyard of Agroscope, Switzerland. Dry weight and nitrogen dynamics were monitored in the roots, trunk, canopy and grapes, during two consecutive years, using a 15N-labeling method. Grape amino nitrogen content was assessed in both years, at veraison and at harvest. The close relationship between fruits and roots in the maintenance of plant nitrogen balance was highlighted. Interestingly, grape nitrogen concentration remained unchanged regardless of crop load to the detriment of the growth and nitrogen content of the roots. Meanwhile, the size and the nitrogen concentration of the canopy were not affected. Leaf gas exchange rates were reduced in response to lower yield conditions, reducing carbon and nitrogen assimilation and increasing intrinsic water use efficiency. The must amino nitrogen profiles could be discriminated as a function of crop load. These findings demonstrate the impact of plant balance on grape nitrogen composition and contribute to the improvement of predictive models and sustainable cultural practices in perennial crops.

Teasing apart terroir: the influence of management style on native yeast communities within Oregon wineries and vineyards

Newer sequencing technologies have allowed for the addition of microbes to the story of terroir. The same environmental factors that influence the phenotypic expression of a crop also shape the composition of the microbial communities found on that crop. For fermented goods, such as wine, that microbial community ultimately influences the organoleptic properties of the final product that is delivered to customers. Recent studies have begun to study the biogeography of wine-associated microbes within different growing regions, finding that communities are distinct across landscapes. Despite this new knowledge, there are still many questions about what factors drive these differences. Our goal was to quantify differences in yeast communities due to management style between seven pairs of conventional and biodynamic vineyards (14 in total) throughout Oregon, USA. We wanted to answer the following questions: 1) are yeast communities distinct between biodynamic vineyards and conventional vineyards? 2) are these differences consistent across a large geographic region? 3) can differences in yeast communities be tied to differences in metabolite profiles of the bottled wine? To collect our data we took soil, bark, leaf, and grape samples from within each vineyard from five different vines of pinot noir. We also collected must and a 10º brix sample from each winery. Using these samples, we performed 18S amplicon sequencing to identify the yeast present. We then used metabolomics to characterize the organoleptic compounds present in the bottled wine from the blocks the year that we sampled. We are actively in the process of analysing our data from this study.