Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Assessing the climate change vulnerability of European winegrowing regions by combining exposure, sensitivity and adaptive capacity indicators

Winegrowing regions recognized as protected designations of origin (PDOs) are closely tied to well defined geographic locations with a specific set of pedoclimatic attributes and strictly regulated by legal specifications. However, climate change is increasingly threatening these regions by changing local conditions and altering winegrowing processes. The vulnerability to these changes is largely heterogenous across different winegrowing regions because it is determined by individual characteristics of each region, including the capacity to adapt to new climatic conditions and the sensitivity to climate change, which depend not only on natural, but also socioeconomic and legal factors. Accurate vulnerability assessments therefore need to combine information about adaptive capacity and climate change sensitivity with projected exposure to new climatic conditions. However, most existing studies focus on specific impacts neglecting important interactions between the different factors that determine climate change vulnerability. Here, we present the first comprehensive vulnerability assessment of European wine PDOs that spatially combines multiple indicators of adaptive capacity and climate change sensitivity with high-resolution climate projections. We found that the climate change vulnerability of PDO areas largely depends on the complex interactions between physical and socioeconomic factors. Homogenous topographic conditions and a narrow varietal spectrum increase climate change vulnerability, while the skills and education of farmers, together with a good economic situation, decrease their vulnerability. Assessments of climate change consequences therefore need to consider multiple variables as well as their interrelations to provide a comprehensive understanding of the expected impacts of climate change on European PDOs. Our results provide the first vulnerability assessment for European winegrowing regions at high spatiotemporal resolution that includes multiple factors related to climate exposure, sensitivity, and adaptive capacity on the level of single winegrowing regions. They will therefore help to identify hot spots of climate change vulnerability among European PDOs and efficiently direct adaptation strategies.

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

1H-NMR-based Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

The aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through untargeted and targeted 1H-NMR metabolomics. One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz. The spectra were recorded by applying the NOESYGPPS1D pulse sequency, to achieve water and ethanol signals suppression. No modification of the pH was performed to avoid any chemical alteration of the matrix. The generation of input variables for untargeted analysis was done via bucketing the spectra. The resulting dataset was preprocessed prior to perform unsupervised PCA, by means of MetaboAnalyst web-based tool suite. The identification of compounds for the targeted analysis was performed by comparison to pure compounds spectra by means of SMA plug-in of MNova 14.2.3 software. The dataset containing the concentrations (%) of identified compounds was subjected to one-way analysis of variance (ANOVA) to highlight significant differences among the wines. The untargeted analysis, carried out through the PCA, revealed a clear differentiation among the wines. The fragments of the spectra contributing mostly to the separation were attributed to flavonoids, aroma compounds and amino acids. The targeted analysis leaded to the identification of 68 compounds, whose concentrations were significant different among the wines. The results were related to soils physical-chemical analysis and showed that: 1) high concentrations of flavan-3-ols and flavonols are correlated with high clay content in soils; 2) high concentrations of anthocyanins, amino acids, and aroma compounds are correlated with neutral and moderately alkaline soil pH; 3) low concentrations of flavonoids and aroma compounds are correlated with high soil organic matter content and acidic pH. The 1H-NMR metabolomic analysis proved to be an excellent tool to discriminate between wines originating from grapes grown on different soil types and revealed that soils in the Mediterranean area exert a strong impact on the chemical composition of the wines.

A spatial explicit inventory of EU wine protected designation of origin to support decision making in a changing climate

Winemaking areas recognized as protected designations of origin (PDOs) shape important economic, environmental and cultural values that are tied to closely defined geographic locations. To preserve wine products and wine-growing practices adopted in different PDOs these areas are strictly regulated by legal specifications. However, quality viticulture is increasingly under pressure from climate change, which is altering the local conditions of many winegrowing areas. Therefore, maintaining traditional wine products will require the adoption of tailored adaptation strategies, including possible changes in the legal regulation of protected wines. To this end, it is necessary to have a comprehensive knowledge on PDOs including their extension, products and allowed practices. While there have been efforts to build databases that summarize the characteristics for individual wine PDO areas and to quantify the related effects of climate change, much information is still included only in the official documentation of the EU geographical indication register and has never been collected in a comprehensive manner. With this study we aim at filling this gap by building a spatial inventory of European wine PDOs that supports decision making in viticulture in the context of climate change. To map and characterize European wine PDOs, we analysed their legal documents and extracted relevant information useful for climate change adaptation. The output consists of a comprehensive geographical dataset that identifies the boundaries of all 1200 European wine PDOs at unprecedented spatial resolution and includes a set of legally binding regulations, such as authorized vine varieties, maximum yields and planting density. The inventory will allow researchers to analyse the impacts of climate change on European wine PDOs and support decision makers in developing tailored adaptation strategies. This includes, among others, the evaluation of new vineyard site selection, the expansion of cultivated varieties or the authorization of irrigation in vineyards.

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.