Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Impact of climate variability and change on grape yield in Italy

Viticulture is entangled with weather and climate. Therefore, areas currently suitable for grape production can be challenged by climate change. Winegrowers in Italy already experiences the effect of climate change, especially in the form of warmer growing season, more frequent drought periods, and increased frequency of weather extremes.
The aim of this study is to investigate the impact of climate variability and change on grape yield in Italy to provide winegrowers the information needed to make their business more sustainable and resilient to climate change. We computed a specific range of bioclimatic indices, selected by the International Organisation of Vine and Wine (OIV), and correlated them to grape yield data. We have worked in collaboration with some wine consortiums in northern and central Italy, which provided grape yield data for our analysis.
Using climate variables from the E-OBS dataset we investigate how the bioclimatic indices changed in the past, and the impact of this change on grape productivity in the study areas. The climate impact on productivity is also investigated by using high-resolution convection-permitting models (CPMs – 2.2 horizontal resolution), with the purpose of estimating productivity in future emission scenarios. The CPMs are likely the best available option for this kind of impact studies since they allow a better representation of small-scale processes and features, explicitly resolve deep convection, and show an improved representation of extremes. In our study, we also compare CPMs with regional climate models (RCMs – 12 km horizontal resolution) to assess the added value of high-resolution models for impact studies. Further development of our study will lead to assessing the future suitability for vine cultivation and could lead to the construction of a statistical model for future projection of grape yield.

Grape berry size is a key factor in determining New Zealand Pinot noir wine composition

Making high quality but affordable Pinot noir (PN) wine is challenging in most terroirs and New Zealand’s (NZ) situation is no exception. To increase the probability of making highly typical PN wines producers choose to grow grapes in cool climates on lower fertility soils while adopting labour intensive practices. Stringent yield targets and higher input costs necessarily mean that PN wine cost is high, and profitability lower, in line-priced varietal wine ranges. To understand the reasons why higher yielding vines are perceived to produce wines of lower quality we have undertaken an extensive study of PN in NZ. Since 2018, we established a network of twelve trial sites in three NZ regions to find individual vines that produced acceptable commercial yields (above 2.5kg per vine) and wines of composition comparable to “Icon” labels. Approximately 20% of 660 grape lots (N = 135) were selected from within a narrow juice Total Soluble Solids (TSS) range and made into single vine wines under controlled conditions. Principal Component Analysis of the vine, berry, juice and wine parameters from three vintages found grape berry mass to be most effective clustering variable. As berry mass category decreased there was a systematic increase in the probability of higher berry red colour and total phenolics with a parallel increase in wine phenolics, changed aroma fraction and decreased juice amino acids. The influence of berry size on wine composition would appear stronger than the individual effects of vintage, region, vineyard or vine yield. Our observations support the hypothesis that it is possible to produce PN wines that fall within an “Icon” benchmark composition range at yields above 2.5kg per vine provided that the Leaf Area:Fruit Weight ratio is above 12cm2 per g, mean berry mass is below 1.2g and juice TSS is above 22°Brix.

How distinctive are single vineyard Gewürztraminer musts and wines from Alto Adige (Italy) based on untargeted analysis, sensory profiling, and chemometric elaboration?

Vitis vinifera L. ‘Gewürztraminer’ is a historical grape variety of Alto Adige (Südtirol), Italy, which is widely grown in the area of Tramin an der Weinstraße, but is also grown globally. It produces highly aromatic wines that are strongly influenced by the terroir of the vineyard sites where they are grown. This study looked at musts and young wines from ‘Gewürztraminer’ grapes harvested in seven distinct vineyards near Tramin and then processed at Cantina di Termeno, minimizing winemaking protocol variability. Samples were profiled using bidimensional gas chromatography–time-of-flight mass spectrometry, liquid chromatography coupled to electrochemical detection, and near-IR spectrometry. The data were subjected to Principle Component Analysis and Hierarchical Clustering Analysis. Sensory discriminant testing was undertaken using the sorting method with a semi-trained panel, and the data were processed using Multidimensional Scaling. Seven must/wine pairs could be distinguished based on their untargeted volatilome profiles and on sensory evaluation. As expected, there were greater differences in the volatile compounds between the wines than between the musts. The wines from vineyards 4 and 5 were nonetheless quite homogenous in terms of chemical and sensory analyses, as were the wines from vineyards 1 and 3. For the phenolic profile, differences were noted between the musts and wines of vineyards 2, 3, and 4, but the musts from vineyards 5 and 7 were similar. Sensory analysis showed the wines from vineyards 6 and 7 to be distinct from the rest. These results reinforce that the composition of ‘Gewürztraminer’ musts and wines is strongly determined by vineyard site, even in a small geographic area with high variability of the terroir (soil and microclimate), and that these differences are apparent in the flavours and aromas of the finished wines. Further confirmation would require a larger sample of wines, preferably from several vintages.

Modeling the suitability of Pinot Noir in Oregon’s Willamette Valley in a changing climate

Air temperature is the key driver of grapevine phenology and a significant environmental factor impacting yield and quality for a winegrape growing region. In this study the optimal downscaled CMIP5 ensemble for computing thegrowing season average temperature (GST) viticulture climate classification index was determined to spatially compute on a decadal basis predictions of the GST climate index and the grapevine sugar ripeness (GSR) model for Pinot Noir throughout the Willamette Valley (WV) American Viticultural Area (AVA). Forecasts for average temperature and a 220 g/L target sugar concentration level were computed using daily Localized Constructed Analogs (LOCA) downscaled CMIP5 historic and Representative Concentration Pathways (RCP) future climate projections of minimum and maximum daily temperature. We explore spatiotemporal trends of the GST climate classification index and Pinot Noir specific applications of the GSR phenology model for the WV AVA. Spatiotemporal computations of the GST climate index and Pinot Noir specific applications of the GSR model enable the opportunity to explore relationships between their computed values with one intent being to provide updated GST ranges that better align with current temperature-based modeling understanding of Pinot Noir grapevine phenology and the viticultural application of LOCA CMIP5 climate projections for the WV AVA. The Pinot Noir specific applications of the GSR model or the GST index with updated bounds indicate that the percent of the WV AVA area suitable for Pinot Noir production is currently at or near its peak value in the upper 80s to lower 90s of this century.

From a local to an international scale: sensory benchmarking of PDO wines. Quincy and Reuilly PDO wines (Sauvignon blanc) as a case study (France)

In a collective marketing strategy, the Protected Designation of Origin (PDO) can be used as a quality indicator. To highlight terroir specificities, it is useful to know how the wines are positioned on the local, national or international market from a sensory point of view. This is especially true for a comparison of varietal wines (e.g. Sauvignon blanc). We focus on the case of two closed Loire Valley PDO (France): Quincy and Reuilly. Three distinct tastings were organized. Firstly, at the local level comparing the 2 PDO (11 and 9 wines, 17 professional assessors); secondly at a regional level adding 3 closed PDO: Menetou-Salon, Sancerre and Pouilly-Fumé (3 wines per PDO, 16 assessors) and thirdly at an international level comparing these 5 PDO with Sauvignon Blanc wines coming from South Africa, New Zealand and Chile (1 to 3 wines per PDO, 19 assessors). All the wines were from the 2019 vintage and were considered to have a traditional elaboration process without contact with oak. A sensory descriptive analysis was performed using an aroma wheel allowing to combine a Check-All-That-Apply methodology, often used in sensory benchmarking, with a hierarchical structuration of the attributes. The aim is to facilitate data acquisition in a professional context without common training, to consider the hierarchical relationships among the attributes during the data analysis and to be able to characterize wines with a large range of sensorial variability. We use univariate, multivariate and clustering analyses. Similarities and differences between Quincy and Reuilly PDO wines and other Sauvignon blanc wines were identified. Specific attributes can distinguish the two PDO and different proximities exist with other local PDO, while clear differences were observed compared to international wines. Our study contributes to propose and discuss a method to do a wine sensory benchmarking highlighting sensory specificities linked to origin.