Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate change projections to support the transition to climate-smart viticulture

The Earth’s system is undergoing major changes through a wide range of spatial and temporal scales as a response to growing anthropogenic radiative forcing, which is pushing the whole system far beyond its natural variability. Sources of greenhouse gases largely exceed their sinks, thus leading to a strengthened greenhouse effect. More energy is thereby being supplied to the system, with inevitable shifts in climatic patterns and weather regimes. Over the last decades, these modifications have been manifested in the full statistical distributions of the atmospheric variables, with dramatic changes in the frequency and intensity of extremes. Natural hazards, such as severe droughts, floods, forest fires, or heatwaves, are being triggered by extreme atmospheric events worldwide, thus threatening human activities. Viticultculture is not only exposed to changing climates but is also highly vulnerable, as grapevine phenology and physiological development are strongly controlled by atmospheric conditions. Therefore, the assessment of climate change projections for a given region is critical for climate change adaptation and risk reduction in viticulture. By adopting timely and suitable measures, the future sustainability and resiliency of the sector can be fostered. Climate-grapevine chain modelling is an essential tool for better planning and management. However, the accuracy of the resulting projections is limited by many uncertainties that must be duly taken into account when transferring knowledge to stakeholders and decision-makers. Climate-smart viticulture will comprise ensembles of locally tuned strategies, envisioning both adaptation and mitigation, assisted by emerging technologies and decision-support systems.

Anthocyanin profile is differentially affected by high temperature, elevated CO2 and water deficit in Tempranillo (Vitis vinifera L.) clones

Anthocyanin potential of grape berries is an important quality factor in wine production. Anthocyanin concentration and profile differ among varieties but it also depends on the environmental conditions, which are expected to be greatly modified by climate change in the future. These modifications may significantly modify the biochemical composition of berries at harvest, and thus wine typicity. Among the diverse approaches proposed to reduce the potential negative effects that climate change may have on grape quality, genetic diversity among clones can represent a source of potential candidates to select better adapted plant material for future climatic conditions. The effects of individual and combined factors associated to climate change (increase of temperature, rise of air CO2 concentration and water deficit) on the anthocyanin profile of different clones of Tempranillo that differ in the length of their reproductive cycle were studied. The aim was to highlight those clones more adapted to maintain specific Tempranillo typicity in the future. Fruit-bearing cuttings were grown in controlled conditions under two temperatures (ambient temperature versus ambient temperature + 4ºC), two CO2 levels (400 ppm versus 700 ppm) and two water regimes (well-watered versus water deficit), both in combination or independently, in order to simulate future climate change scenarios. Elevated temperature increased anthocyanin acylation, whereas elevated CO2 and water deficit favoured the accumulation of malvidin derivatives, as well as the acylation and tri-hydroxylation level of anthocyanins. Although the changes in anthocyanin profile observed followed a common pattern among clones, such impact of environmental conditions was especially noticeable in one of the most widely distributed Tempranillo clones, the accession RJ43.

First step in the preparation of a soil map of the Protected Designation of Origin Valdepeñas (Central, Spain)

This work is a first step to make a map of vineyard soils. The characterization of the soils of the Protected Designation of Origin (D.P.O.) Valdepeñas will allow to group the studied profiles according to their physico-chemical characteristics and the concentrations of most relevant chemical elements. 90 soil profiles were analysed throughout the territory and the soils were sampled and described according to FAO (2006) and classified according to and Soil Taxonomy (2014). All samples were air dried, sieved and some physico-chemical parameters were determined following standard protocols. Also, major and trace elements were analysed by X-ray fluorescence. The statistically study was made using the SPSS program. Trend maps were made using the ArcGIS program. The studied soils have the following average properties: pH, 8.3; electrical conductivity, 0,20 dS/m (low); clay, 18.8% (medium) and CaCO3, 17.1% (high). In the study for the major elements. The major elements of these soils are Si, followed by Ca and Al, with an average content of 203.7 g/kg, 105.5 g/kg and 74.0 g/kg respectively. On the other hand, 27 trace elements have been studied. Of all of them, it can be highlighted the average values of Ba (361.8 mg/kg), Sr (129.3 mg/kg), Rb (83.4 mg/kg), V (74.2 mg/kg) and Ce (70.6 mg/kg). Ba, V and Ce values are higher and the values of Sr and Rb are lower to those found in the literature. The discriminant analysis shows a percentage of grouping of 91%. The content of chemical elements together with the physico-chemical characteristics allows grouping the soils in 4 group according to their order in the classification to Soil Taxonomy; due to the importance of the Calcisols in Castilla-La Mancha, it has been decided to establish them as their own group even if they do not appear in Soil Taxonomy classification.

Making sense of available information for climate change adaptation and building resilience into wine production systems across the world

Effects of climate change on viticulture systems and winemaking processes are being felt across the world. The IPCC 6thAssessment Report concluded widespread and rapid changes have occurred, the scale of recent changes being unprecedented over many centuries to many thousands of years. These changes will continue under all emission scenarios considered, including increases in frequency and intensity of hot extremes, heatwaves, heavy precipitation and droughts. Wine companies need tools and models allowing to peer into the future and identify the moment for intervention and measures for mitigation and/or avoidance. Previously, we presented conceptual guidelines for a 5-stage framework for defining adaptation strategies for wine businesses. That framework allows for direct comparison of different solutions to mitigate perceived climate change risks. Recent global climatic evolution and multiple reports of severe events since then (smoke taint, heatwave and droughts, frost, hail and floods, rising sea levels) imply urgency in providing effective tools to tackle the multiple perceived risks. A coordinated drive towards a higher level of resilience is therefore required. Recent publications such as the Australian Wine Future Climate Atlas and results from projects such as H2020 MED-GOLD inform on expected climate change impacts to the wine sector, foreseeing the climate to expect at regional and vineyard scale in coming decades. We present examples of practical application of the Climate Change Adaptation Framework (CCAF) to impacts affecting wine production in two wine regions: Barossa (Australia) and Douro (Portugal). We demonstrate feasibility of the framework for climate adaptation from available data and tools to estimate historical climate-induced profitability loss, to project it in the future and to identify critical moments when disruptions may occur if timely measures are not implemented. Finally, we discuss adaptation measures and respective timeframes for successful mitigation of disruptive risk while enhancing resilience of wine systems.

Grapevine yield estimation in a context of climate change: the GraY model

Grapevine yield is a key indicator to assess the impacts of climate change and the relevance of adaptation strategies in a vineyard landscape. At this scale, a yield model should use a number of parameters and input data in relation to the information available and be able to reproduce vineyard management decisions (e.g. soil and canopy management, irrigation). In this study, we used data from six experimental sites in Southern France (cv. Syrah) to calibrate a model of grapevine yield limited by water constraint (GraY). Each yield component (bud fertility, number of berries per bunch, berry weight) was calculated as a function of the soil water availability simulated by the WaLIS water balance model at critical phenological phases. The model was then evaluated in 10 grapegrowers’ plots, covering a diversity of biophysical and technical contexts (soil type, canopy size, irrigation, cover crop). We identified three critical periods for yield formation: after flowering on the previous year for the number of bunches and berries, around pre-veraison and post-veraison of the same year for mean berry weight. Yields were simulated with a model efficiency (EF) of 0.62 (NRMSE = 0.28). Bud fertility and number of berries per bunch were more accurately simulated (EF = 0.90 and 0.77, NRMSE = 0.06 and 0.10, respectively) than berry weight (EF = -0.31, NRMSE = 0.17). Model efficiency on the on-farm plots reached 0.71 (NRMSE = 0.37) simulating yields from 1 to 8 kg/plant. The GraY model is an original model estimating grapevine yield evolution on the basis of water availability under future climatic conditions.  It allows to evaluate the effects of various adaptation levers such as planting density, cover crop management, fruit/leaf ratio, shading and irrigation, in various production contexts.