Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate and the evolving mix of grape varieties in Australia’s wine regions

The purpose of this study is to examine the changing mix of winegrape varieties in Australia so as to address the question: In the light of key climate indicators and predictions of further climate change, how appropriate are the grape varieties currently planted in Australia’s wine regions? To achieve this, regions are classified into zones according to each region’s climate variables, particularly average growing season temperature (GST), leaving aside within-region variations in climates. Five different climatic classifications are reported. Using projections of GSTs for the mid- and late 21st century, the extent to which each region is projected to move from its current zone classification to a warmer one is reported. Also shown is the changing proportion of each of 21 key varieties grown in a GST zone considered to be optimal for premium winegrape production. Together these indicators strengthen earlier suggestions that the mix of varieties may be currently less than ideal in many Australian wine regions, and would become even less so in coming decades if that mix was not altered in the anticipation of climate change. That is, grape varieties in many (especially the warmest) regions will have to keep changing, or wineries will have to seek fruit from higher latitudes or elevations if they wish to retain their current mix of varieties and wine styles.

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Effect of vigour and number of clusters on eonological parameters and metabolic profile of Cabernet Sauvignon red wines

Vegetative growth and yield are reported to affect grape and wine quality. They can be controlled through different techniques linked to vine management. The objective of this research was to determine the effect of vine vigour and number of clusters per vine on physicochemical composition and phenolic profile of red wines. The experiment was carried out during two vegetative cycles, with cv. Cabernet Sauvignon grafted onto Paulsen 1103. Three vine vigour were defined, according to shoot weight at previous harvests, being low, medium and high. Five treatments of number of clusters were used for each vigour, with 15, 22, 29, 36, and 45 clusters per vine. Grapes from all treatments were harvested in the same day from Brix and total acidity criteria. Thirty days after bottling, classical analyzes and phenolic compounds were performed. As results, different responses were obtained from each vintage. In 2020, a dry season from veraison to harvest, grapes and wines obtained from low vigour treatment and 45 clusters per vine was the highest in sugar and alcohol content respectively, while grapes and wines from high vigour and 15 clusters presented the lowest sugar and alcohol content. Total anthocyanins were higher in treatment with low vigour and 15 clusters, while the lowest amounts were found in low vigour with 45 clusters, as well as medium and high vigour with 36 clusters per vine. Total tannins were higher in high vigour with 22 clusters and medium vigour with 29 clusters, while were lower in low vigour with 36 clusters. In 2021, a wet season at harvest, responses were different, and great variations were observed between treatments. As conclusions, yield and vine vigour had strong influence on grape and wine quality, promoting different enological potentials on which can be indicated/used for aging strategies of red and even rosé wines.

Impact of climate change on the viticultural climate of the Protected Designation of Origin “Jumilla” (SE Spain)

Protected Designation of Origin “Jumilla” (PDO Jumilla) is located in the Spanish provinces of Albacete and Murcia, in the South-eastern part of the Iberian Peninsula, where most of the models predict a severe impact of climate change in next decades. PDO Jumilla covers an area of 247,054 hectares, of which more than 22,000 hectares

Climate ethnography and wine environmental futures

Globalisation and climate change have radically transformed world wine production upsetting the established order of wine ecologies. Ecological risks and the future of traditional agricultural systems are widely debated in anthropology, but very little is understood of the particular challenges posed by climate change to viticulture which is seen by many as the canary in the coalmine of global agriculture. Moreover, wine as a globalised embedded commodity provides a particularly telling example for the study of climate change having already attracted early scientific attention. Studies of climate change in viticulture have focused primarily on the production of systematic models of adaptation and vulnerability, while the human and cultural factors, which are key to adaptation and sustainable futures, are largely missing. Climate experts have been unanimous in recognising the urgent need for a better understanding of the complex dynamics that shape how climate change is experienced and responded to by human systems. Yet this call has not yet been addressed. Climate ethnography, coined by the anthropologist Susan Crate (2011), aims to bridge this growing disjuncture between climate science and everyday life through the exploration of the social meaning of climate change. It seeks to investigate the confrontation of its social salience in different locations and under different environmental guises (Goodman 2018: 340). By understanding how wine producers make sense of the world (and the environment) and act in it, it proposes to focus on the co-production of interdisciplinary knowledge by identifying and foreshadowing problems (Goodman 2018: 342; Goodman & Marshall 2018). It seeks to offer an original, transformative and contrasted perspective to climate change scenarios by investigating human agency -individual or collective- in all its social, political and cultural diversity. An anthropological approach founded on detailed ethnographies of wine production is ideally placed to address economic, social and cultural disruptions caused by the emergence of these new environmental challenges. Indeed, the community of experts in environmental change have recently called for research that will encompass the human dimension and for more broad-based, integrated through interdisciplinarity, useful knowledge (Castree & al 2014). My paper seeks to engage with climate ethnography and discuss what it brings to the study of wine environmental futures while exploring the limitations of the anthropological environmental approach.