Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Legacy of land-cover changes on soil erosion and microbiology in Burgundian vineyards

Soils in vineyards are recognized as complex agrosystems whose characteristics reflect complex interactions between natural factors (lithology, climate, slope, biodiversity) and human activities. To date, most of the unknown lies in an incomplete understanding of soil ecosystems, and specifically in the microbial biodiversity even though soil microbiota is involved in many key functions, such as nutrient cycling and carbon sequestration. Soil biological properties are indicative of soil quality. Therefore, understanding how soil communities are related to soil ecosystem functioning is becoming an essential issue for soil strategy conservation. Here, we propose to assess the importance of land-cover history on the present-day microbiological and physico-chemical properties. The studied area was selected in the Burgundian vineyards (Pernand-Vergelesses, Burgundy, France) where land occupation has been reconstructed over the last 40 years. Soil samples were collected in five areas reflecting various land cover history (forest, vineyards, shifting from forest to vineyards). For each area, physico-chemical parameters (pH, C, N, P, grain size) were measured and DNA was extracted to characterize the abundance and diversity of microbial communities. The obtained results show significant differences in the five areas suggesting that present-day microbial molecular biomass and bacterial taxonomic is partly inherited from past land occupation. Over longer period of time, such study of land-uses legacies may help to better assess ecosystem recovery and the impact of management practices for a better soil quality and vineyards sustainability.

A blueprint for managing vine physiological balance at different spatial and temporal scales in Champagne

In Champagne, the vine adaptation to different climatic and technical changes during these last 20 years can be seen through physiological balance disruptions. These disruptions emphasize the general grapevine decline. Since the 2000s, among other nitrogen stress indicators, the must nitrogen has been decreasing. The combination of restricted mineral fertilizers and herbicide use, the growing variability of spring rainfall, the increasing thermal stress as well as the soil type heterogeneity are only a few underlying factors that trigger loss of physiological balance in the vineyards. It is important to weigh and quantify the impact of these factors on the vine. In order to do so, the Comité Champagne uses two key-tools: networking and modelization. The use of quantitative and harmonized ecophysiological indicators is necessary, especially in large spatial scales such as the Champagne appellation. A working group with different professional structures of Champagne has been launched by the Comité Champagne in order to create a common ecophysiology protocol and thus monitor the vine physiology, yearly, around 100 plots, with various cultural practices and types of soil. The use of crop modelling to follow the vine physiological balance within different pedoclimatic conditions enables to understand the present balance but also predict the possible disruptions to come in future climatic scenarios. The physiological references created each year through the working group, benefit the calibration of the STICS model used in Champagne. In return, the model delivers ecophysiology indicators, on a daily scale and can be used on very different types of soils. This study will present the bottom-up method used to give accurate information on the impacts of soil, climate and cultural practices on vine physiology.

Rootstock regulation of scion phenotypes: the relationship between rootstock parentage and petiole mineral concentration

Grapevine is grown as a graft since the end of the 19th century. Rootstocks not only provide tolerance to Phylloxera but also ensure the supply of water and mineral nutrients to the scion. Rootstocks are an important mean of adaptation to environmental conditions, because the scion controls the typical features of the grapes and wine. However, among the large diversity of rootstocks worldwide, few of them are commercially used in the vineyard. The aim of this study was to investigate the extent to which rootstocks modify the mineral composition of the petioles of the scion. Vitis vinifera cvs. Cabernet-Sauvignon, Pinot noir, Syrah and Ugni blanc were grafted onto 55 different rootstock genotypes and planted in a vineyard as three replicates of 5 vines. Petioles were collected in the cluster zone with 6 replicates per combination. Petiolar concentrations of 13 mineral elements (N, P, K, S, Mg, Ca, Na, B, Zn, Mn, Fe, Cu, Al) at veraison were determined. Scion, rootstock and the interaction explained the same proportion of the phenotypic variance for most mineral elements. Rootstock genotype showed a significant influence on the petiole mineral element composition. Rootstock effect explained from 7 % for Cu to 25 % for S of the variance. The difference of rootstock conferred mineral status is discussed in relation to vigor and fertility. Rootstocks were also genotyped with 23 microsatellite markers. Data were analysed according to genetic groups in order to determine whether the petiole mineral composition could be related to the genetic parentage of the rootstock. Thanks to a highly powerful design, it is the first time that such a large panel of rootstocks grafted with 4 scions has been studied. These results give the opportunity to better characterize the rootstocks and to enlarge the diversity used in the vineyard.

Delaying irrigation initiation linearly reduces yield with little impact on maturity in Pinot noir

When to initiate irrigation is a critical annual management decision that has cascading effects on grapevine productivity and wine quality in the context of climate change. A multi-site trial was begun in 2021 to optimize irrigation initiation timing using midday stem water potential (ψstem) thresholds characterized as departures from non-stressed baseline ψstemvalues (Δψstem). Plant material, vine and row spacing, and trellising systems were concomitant among sites, while vine age, soil type, and pruning systems varied. Five target Δψstem thresholds were arranged in an RCBD and replicated eight times at each site: 0.2, 0.4, 0.6, 0.8, and 1.0 MPa (T1, T2, T3, T4, and T5, respectively). When thresholds were reached, plots were irrigated weekly at 70% ETc. Yield components and berry composition were quantified at harvest. To better generalize inferences across sites, data were analyzed by ANOVA using a mixed model including site as a random factor. Across sites, irrigation was initiated at Δψstem = 0.24, 0.50, 0.65, 0.93, and 0.98 MPa for T1, T2, T3, T4, and T5, respectively. Consistent significant negative linear trends were found for several key yield and berry composition variables. Yield decreased by 12.9, 15.9, 19.5, and 27.4% for T2, T3, T4, and T5, respectively, compared to T1 (p < 0.0001) across sites that were driven by similarly linear reductions in berry weight (p < 0.0001). Comparatively, berry composition varied little among treatments. Juice total soluble solids decreased linearly from T1 to T5 – though only ranged 0.9 Brix (p = 0.012). Because producers are paid by the ton, and contracts simply stipulate a target maturity level, first-year results suggest that there is no economic incentive to induce moderate water deficits before irrigation initiation, regardless of vineyard site. Subsequent years will further elucidate the carryover effects of delaying irrigation initiation on productivity over the long term.

Modeling island and coastal vineyards potential in the context of climate change

Climate change impacts regional and local climates, which in turn affects the world’s wine regions. In the short term, these modifications rises issues about maintaining quality and style of wine, and in a longer term about the suitability of grape varieties and the sustainability of traditional wine regions. Thus, adaptation to climate change represents a major challenge for viticulture. In this context, island and coastal vineyards could become coveted areas due to their specific climatic conditions. In regions subject to warming, the proximity of the sea can moderate extremes temperatures, which could be an advantage for wine. However, coastal and island areas are particular prized spaces and subject to multiple pressures that make the establishment or extension of viticulture complex.
In this perspective, it seems relevant to assess the potentialities of coastal and island areas for viticulture. This contribution will present a spatial optimization model that tends to characterize most suitable agroclimatic patterns in historical or emerging vineyards according to different scenarios. Thanks to an in-depth bibliography a global inventory of coastal and insular vineyards on a worldwide scale has been realized. Relevant criteria have been identified to describe the specificities of these vineyards. They are used as input data in the optimization process, which will optimize some objectives and spatial aspects. According to a predefined scenario, the objectives are set in three main categories associated with climatic characteristics, vineyards characteristics and management strategies. At the end of this optimization process, a series of maps presents the different spatial configurations that maximize the scenario objectives.