Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate change impacts: a multi-stress issue

With the aim of producing premium wines, it is admitted that moderate environmental stresses may contribute to the accumulation of compounds of interest in grapes. However the ongoing climate change, with the appearance of more limiting conditions of production is a major concern for the wine industry economic. Will it be possible to maintain the vineyards in place, to preserve the current grape varieties and how should we anticipate the adaptation measures to ensure the sustainability of vineyards? In this context, the question of the responses and adaptation of grapevine to abiotic stresses becomes a major scientific issue to tackle. An abiotic stress can be defined as the effect of a specific factor of the physico-chemical environment of the plants (temperature, availability of water and minerals, light, etc.) which reduces growth, and for a crop such as the vine, the yield, the composition of the fruits and the sustainability of the plants. Water stress is in many minds, but a systemic vision is essential for at least two reasons. The first reason is that in natural environments, a single factor is rarely limiting, and plants have to deal with a combination of constraints, as for example heat and drought, both in time and at a given time. The second reason is that plants, including grapevine, have central mechanisms of stress responses, as redox regulatory pathways, that play an important role in adaptation and survival. Here we will review the most recent studies dealing with this issue to provide a better understanding of the grapevine responses to a combination of environmental constraints and of the underlying regulatory pathways, which may be very helpful to design more adapted solutions to cope with climate change.

Long-term drought resilience of traditional red grapevine varieties from a semi-arid region

In recent decades, the scarcity of water resources in agriculture in certain areas has been aggravated by climate change, which has caused an increase in temperatures, changes in rainfall patterns, as well as an increase in the frequency of extreme phenomena such as droughts and heat waves. Although the vine is considered a drought-tolerant specie, it has to satisfy important water requirements to complete its cycle, which coincides with the hottest and driest months. Achieving sustainable viticulture in this scenario requires high levels of efficiency in the use of water, a scarce resource whose use is expected to be severely restricted in the near future. In this regard, the use of drought-tolerant varieties that are able to maintain grape yield and quality could be an effective strategy to face this change. During three consecutive seasons (2018-2020) the behavior in rainfed regime of 13 traditional red grapevine varieties of the Spain central region was studied. These varieties were cultivated in a collection at Centro de Investigación de la Vid y el Vino de Castilla-La Mancha (IVICAM-IRIAF) located in Tomelloso (Castilla-La Mancha, Spain). Yield components (yield, mean bunch and berry weight, pruning weight), physicochemical parameters of the musts (brix degree, total acidity, pH) and some physiological parameters related with water stress during ripening period (δ13C, δ18O) were analysed. The application of different statistical techniques to the results showed the existence of significant differences between varieties in their response to stressful conditions. A few varieties highlighted for their high ability to adapt to drought, being able to maintain high yields due to their efficiency in the use of water. In addition, it was possible quantify to what extent climate can be a determinant in the δ18O of musts under severe water stress conditions.

Postveraison shoot trimming in Tannat and Merlot: preliminary results on yield components, plant balance and berry composition

There is currently a trend towards the production of wines with low alcohol content. To achieve this, grapes with low sugar content must be used. There are techniques at the vineyard level that can delay ripening and avoid excessive sugar accumulation without, a priori, affecting the final polyphenol content. Postveraison shoot trimming (PVST) is experimentally evaluated for these purposes, but its impact under Uruguayan climatic conditions with high interannual variability is not known. The aim of this work is to assess the PVST in Tannat and Merlot cultivars and their impact on yield components, plant balance and berry primary composition. In this study, two commercial vineyards of 10 years old Tannat and Merlot (grafted on SO4) at Canelones Department were selected. During the 2020-201 growing season, grapevines were submitted to PVST when grapes reached 15º Brix. In a randomized block, trimmed (T) and control (C) plants were evaluated with three repetitions each cultivar. Evaluation of the evolution of primary berry composition during ripening, measurement of yield components and plant balance were performed. For both cultivars, PVST did not affect yield components. Merlot reached 5.4 kg per plant and Tannat 7.1 kg, with not statistical significance between treatments. However, statistical differences were observed in terms of plant balance. In Merlot Ravaz Index reached a difference of 5.3 (12.0 in T and 6.7 in C) meanwhile Tannat reached 3.5 of statistical difference (13.7 in T and 10.2 in C). The tendency to imbalance for the treated plants had an impact on the final grape composition. Merlot grapes showed statistical difference in final total acidity (0.3 g of difference between treatments) while treatments impact final sugar content on Tannat grapes (10.0 g of difference between treatments). Further studies are needed to assess the impact of different canopy management techniques in our conditions.

Climate modeling at local scale in the Waipara winegrowing region in the climate change context

In viticulture, a warming climate can have a very significant impact on grapevine development and therefore on the quality and characteristics of wines across different spatial scales, ranging from global to local. In order to adapt wine-growing to climate change, global climate models can be used to define future scenarios, but only at the scale of major wine regions. Despite the huge progress made over the last ten years in terms of the spatial resolution of climate models (now downscaled to a few square kilometres), they are not yet sufficiently precise to account for the local climate variability associated with such parameters as local topography, in spite of these parameters being decisive for vine and wine characteristics. This study describes a method to downscale future climate scenarios to vineyard scale. Networks of data loggers have been used to collect air temperature at canopy level in the Waipara winegrowing region (New Zealand) over five growing seasons. These measurements allow the creation of fine-scale geostatistical models and maps of temperature (at 100 m resolution) for the growing season. In order to model climate change at pilot site scale, these geostatistical models have been combined with regional climate change predictions for the periods 2031-2050 and 2081-2100 based on the RCP8.5 climate change scenario. The integration of local climate variability with regionalized climate change simulations allows assessment of the impacts of climate change at the vineyard scale. The improved knowledge gained using this methodology results from the increased horizontal resolution that better addresses the concerns of winegrowers. The results provide the local winegrowers with information necessary to understand current processes, as well as historical and future viticulture trends at the scale of their site, thereby facilitating decisions about future response strategies.

Teasing apart terroir: the influence of management style on native yeast communities within Oregon wineries and vineyards

Newer sequencing technologies have allowed for the addition of microbes to the story of terroir. The same environmental factors that influence the phenotypic expression of a crop also shape the composition of the microbial communities found on that crop. For fermented goods, such as wine, that microbial community ultimately influences the organoleptic properties of the final product that is delivered to customers. Recent studies have begun to study the biogeography of wine-associated microbes within different growing regions, finding that communities are distinct across landscapes. Despite this new knowledge, there are still many questions about what factors drive these differences. Our goal was to quantify differences in yeast communities due to management style between seven pairs of conventional and biodynamic vineyards (14 in total) throughout Oregon, USA. We wanted to answer the following questions: 1) are yeast communities distinct between biodynamic vineyards and conventional vineyards? 2) are these differences consistent across a large geographic region? 3) can differences in yeast communities be tied to differences in metabolite profiles of the bottled wine? To collect our data we took soil, bark, leaf, and grape samples from within each vineyard from five different vines of pinot noir. We also collected must and a 10º brix sample from each winery. Using these samples, we performed 18S amplicon sequencing to identify the yeast present. We then used metabolomics to characterize the organoleptic compounds present in the bottled wine from the blocks the year that we sampled. We are actively in the process of analysing our data from this study.