Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Postveraison shoot trimming in Tannat and Merlot: preliminary results on yield components, plant balance and berry composition

There is currently a trend towards the production of wines with low alcohol content. To achieve this, grapes with low sugar content must be used. There are techniques at the vineyard level that can delay ripening and avoid excessive sugar accumulation without, a priori, affecting the final polyphenol content. Postveraison shoot trimming (PVST) is experimentally evaluated for these purposes, but its impact under Uruguayan climatic conditions with high interannual variability is not known. The aim of this work is to assess the PVST in Tannat and Merlot cultivars and their impact on yield components, plant balance and berry primary composition. In this study, two commercial vineyards of 10 years old Tannat and Merlot (grafted on SO4) at Canelones Department were selected. During the 2020-201 growing season, grapevines were submitted to PVST when grapes reached 15º Brix. In a randomized block, trimmed (T) and control (C) plants were evaluated with three repetitions each cultivar. Evaluation of the evolution of primary berry composition during ripening, measurement of yield components and plant balance were performed. For both cultivars, PVST did not affect yield components. Merlot reached 5.4 kg per plant and Tannat 7.1 kg, with not statistical significance between treatments. However, statistical differences were observed in terms of plant balance. In Merlot Ravaz Index reached a difference of 5.3 (12.0 in T and 6.7 in C) meanwhile Tannat reached 3.5 of statistical difference (13.7 in T and 10.2 in C). The tendency to imbalance for the treated plants had an impact on the final grape composition. Merlot grapes showed statistical difference in final total acidity (0.3 g of difference between treatments) while treatments impact final sugar content on Tannat grapes (10.0 g of difference between treatments). Further studies are needed to assess the impact of different canopy management techniques in our conditions.

Simulating climate change impact on viticultural systems in historical and emergent vineyards

Global climate change affects regional climates and hold implications for wine growing regions worldwide. Although winegrowers are constantly adapting to internal and external factors, it seems relevant to develop tools, which will allow them to better define actual and future agro-climatic potentials. Within this context, we develop a modelling approach, able to simulate the impact of environmental conditions and constraints on vine behaviour and to highlight potential adaptation strategies according to different climate change scenarios. Our modeling approach, named SEVE (Simulating Environmental impacts on Viticultural Ecosystems), provides a generic modeling framework for simulating grapevine growth and berry ripening under different conditions and constraints (slope, aspect, soil type, climate variability…) as well as production strategies and adaptation rules according to climate change scenarios. Each activity is represented by an autonomous agent able to react and adapt its reaction to the variability of environmental constraints. Using this model, we have recently analyzed the evolution of vineyards’ exposure to climatic risks (frost, pathogen risk, heat wave) and the adaptation strategies potentially implemented by the winegrowers. This approach, implemented for two climate change scenarios, has been initiated in France on traditional (Loire Valley) and emerging (Brittany) vineyards. The objective is to identify the time horizons of adaptations and new opportunities in these two regions. Carried out in collaboration with wine growers, this approach aims to better understand the variability of climate change impacts at local scale in the medium and long term.

The concept of terroir: what place for microbiota?

Microbes play key roles on crop nutrient availability via biogeochemical cycles, rhizosphere interactions with roots as well as on plant growth and health. Recent advances in technologies, such as High Throughput Sequencing Techniques, allowed to gain deeper insight on the structure of bacterial and fungal communities associated with soil, rhizosphere and plant phyllosphere. Over the past 10 years, numerous scientific studies have been carried out on the microbial component of the vineyard. Whether the soil or grape compartments have been taken into account, many studies agree on the evidence of regional delineations of microbial communities, that may contribute to regional wine characteristics and typicity. Some authors proposed the term “microbial terroir” including “yeast terroir” for grapes to describe the connection between microbial biogeography and regional wine characteristics. Many factors are involved in terroir including climate, soil, cultivar and human practices as well as their interactions. Studies considering “microbial terroir” greatly contributed to improve our knowledge on factors that shape the vineyard microbial structure and diversity. However, the potential impact of “microbial terroir” on wine composition has yet not received strong scientific evidence and many questions remain to be addressed, related to the functional characterization of the microbial community and its impact on plant physiology and grape composition, the origins and interannual stability of vineyard microbiota, as well as their impact on wine sensorial attributes. The presentation will give an overview on the role of microbiota as a terroir component and will highlight future perspectives and challenges on this key subject for the wine industry.

Estimating bulk stomatal conductance of grapevine canopies

In response to changes in their environment, grapevines regulate transpiration using various physiological mechanisms that alter conductance of water through the soil-plant-atmosphere continuum. Expressed as bulk stomatal conductance at the canopy scale, it varies diurnally in response to changes in vapor pressure deficit and net radiation, and over the season to changes in soil water deficits and hydraulic conductivity of both soil and plant. It is necessary to characterize the response of conductance to these variables to better model how vine transpiration also responds to these variables. Furthermore, to be relevant for vineyard-scale modeling, conductance is best characterized using data collected in a vineyard setting. Applying a crop canopy energy flux model developed by Shuttleworth and Wallace, bulk stomatal conductance was estimated using measurements of individual vine sap flow, temperature and humidity within the vine canopy, and estimates of net radiation absorbed by the vine canopy. These measurements were taken on several vines in a non-irrigated vineyard in Bordeaux France, using equipment that did not interfere with ongoing vineyard operations. An inverted Penman-Monteith equation was then used to calculate bulk stomatal conductance on 15-minute intervals from July to mid-September 2020. Time-series plots show significant diurnal variation and seasonal decreases in conductance, with overall values similar to those in the literature. Global sensitivity analysis using non-parametric regression found transpiration flux and vapor pressure deficit to be the most important input variables to the calculation of bulk stomatal conductance, with absorbed net radiation and bulk boundary layer conductance being much less important. Conversely, bulk stomatal conductance was one of the most important inputs when calculating vine transpiration, further emphasizing the need for characterizing its response to environmental changes for use in vineyard water use modeling.

Grapevine sugar concentration model in the Douro Superior, Portugal

Increasingly warm and dry climate conditions are challenging the viticulture and winemaking sector. Digital technologies and crop modelling bear the promise to provide practical answers to those challenges. As viticultural activities strongly depend on harvest date, its early prediction is particularly important, since the success of winemaking practices largely depends upon this key event, which should be based on an accurate and advanced plan of the annual cycle. Herein, we demonstrate the creation of modelling tools to assess grape ripeness, through sugar concentration monitoring. The study area, the Portuguese Côa valley wine region, represents an important terroir in the “Douro Superior” subregion. Two varieties (cv. Touriga Nacional and Touriga Franca) grown in five locations across the Côa Region were considered. Sugar accumulation in grapes, with concentrations between 170 and 230 g l-1, was used from 2014 to 2020 as an indicator of technological maturity conditioned by meteorological factors. The climatic time series were retrieved from the EU Copernicus Service, while sugar data were collected by a non-profit organization, ADVID, and by Sogrape, a leading wine company. The software for calibrating and validating this model framework was the Phenology Modeling Platform (PMP), version 5.5, using Sigmoid and growing degree-day (GDD) models for predictions. The performance was assessed through two metrics: Roots Mean Square Error (RMSE) and efficiency coefficient (EFF), while validation was undertaken using leave-one-out cross-validation. Our findings demonstrate that sugar content is mainly dependent on temperature and air humidity. The models achieved a performance of 0.65