Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Influence of a spontaneous cover crop on the vineyard and soil erosion under Mediterranean climate

Sixty five % of the agricultural area of the Basque Country located in the DO Ca Rioja corresponds to vineyards. More than 40% of it has an average slope greater than 10%, which makes it sensitive to erosive processes. Furthermore, it is foreseeable that extreme weather events (storms, hail, extreme heat and cold, etc.) will be favored due to climate change. Cover cropping can mitigate this risk, and therefore the objective of this work is to evaluate the impact that a vegetable cover has on the agronomic behavior of the vineyard, the quality of the grape and soil erosion. For this, a trial has been carried out with a Graciano variety vineyard with a slope between 10% -20% during the years 2020 and 2021. Conventional tillage management in the area has been compared (4-6 passes per year of tillage machinery) versus spontaneous vegetation cover management in the vineyard. This implies not tilling and allowing the grass of the land to colonize the range between the lines of vines, controlling their height through 1-3 mowing passes per year, always trying to affect the surface of the land as little as possible. The vegetative growth, yield and quality of the grape and wine was measured. Furthermore, erosion has been measured using Gerlasch boxes. The yield was lower in the second year of the trial in the cover crop treatment, but erosion was significantly reduced.

Spatial determination of areas in the Western Balkans region favorable for organic production

In problematic conditions for production of grapes and wine caused by the COVID-19 pandemic and the resulting occurrence of wine surpluses, producers are increasingly turning to the innovative viticulture and winemaking of products that are more appealing to the market and the consumers. On the other hand, consumption of the food safety or organic products, and therefore of organic grapes and wine, is increasingly common in the world, in particular in Europe. The Regional Rural Development Standing Working Group (SWG RRD), as a regional intergovernmental organization gathers actors in the viticulture and winemaking sector from states and territories of the Western Balkans (South-East Europe) in the Expert Working Group for Wine, with the aim of improving viticulture and winemaking in this region through joint activities. In accordance with the aforementioned, the SWG RRD is working on advancing organic production of grapes and wine, and on recognition of specificities of the terroir of wine-growing areas in Western Balkans. In addition, as part of the project “Facilitation of Exchange and Advice on Wine Regulations in Western Balkan Countries” helmed by the German Federal Ministry of Food and Agriculture, in addition to harmonization of relevant legislation with EU regulations, efforts are being invested towards recognition of organic wines. Within activities and project implemented by this organization, expert analyses and scientific research of the terroir of Western Balkans were carried out, and some of the results are presented in this paper.

20-Year-Old data set: scion x rootstock x climate, relationships. Effects on phenology and sugar dynamics

Global warming is one of the biggest environmental, social, and economic threats. In the Douro Valley, change to the climate are expected in the coming years, namely an increase in average temperature and a decrease in annual precipitation. Since vine cultivation is extremely vulnerable and influenced by the climate, these changes are likely to have negative effects on the production and quality of wine.
Adaptation is a major challenge facing the viticulture sector where the choice of plant material plays an important role, particularly the rootstock as it is a driver for adaptation with a wide range of effects, the most important being phylloxera, nematode and salt, tolerance to drought and a complex set of interactions in the grafted plant.
In an experimental vineyard, established in the Douro Region in 1997, with four randomized blocs, with five varieties, Touriga Nacional, Tinta Barroca, Touriga Franca and Tinta Roriz, grafted in four rootstocks, Rupestris du Lot, R110, 196-17C, R99 and 1103P, data was collected consecutively over 20 years (2001-2020). Phenological observations were made two to three times a week, following established criteria, to determine the average dates of budbreak, flowering and veraison. During maturation, weekly berry samples were taken to study the dynamics of sugar accumulation, amongst other parameters. Climate data was collected from a weather station located near the vineyard parcel, with data classified through several climatic indices.
The results achieved show a very low coefficient of variations in the average date of the phenophases and an important contribution from the rootstock in the dynamic of the phenology, allowing a delay in the cycle of up to10-12 days for the different combinations. The Principal Component Analysis performed, evaluating trends in the physical-chemical parameters, highlighted the effect of the climate and rootstock on fruit quality by grape varieties.

Anthocyanin profile is differentially affected by high temperature, elevated CO2 and water deficit in Tempranillo (Vitis vinifera L.) clones

Anthocyanin potential of grape berries is an important quality factor in wine production. Anthocyanin concentration and profile differ among varieties but it also depends on the environmental conditions, which are expected to be greatly modified by climate change in the future. These modifications may significantly modify the biochemical composition of berries at harvest, and thus wine typicity. Among the diverse approaches proposed to reduce the potential negative effects that climate change may have on grape quality, genetic diversity among clones can represent a source of potential candidates to select better adapted plant material for future climatic conditions. The effects of individual and combined factors associated to climate change (increase of temperature, rise of air CO2 concentration and water deficit) on the anthocyanin profile of different clones of Tempranillo that differ in the length of their reproductive cycle were studied. The aim was to highlight those clones more adapted to maintain specific Tempranillo typicity in the future. Fruit-bearing cuttings were grown in controlled conditions under two temperatures (ambient temperature versus ambient temperature + 4ºC), two CO2 levels (400 ppm versus 700 ppm) and two water regimes (well-watered versus water deficit), both in combination or independently, in order to simulate future climate change scenarios. Elevated temperature increased anthocyanin acylation, whereas elevated CO2 and water deficit favoured the accumulation of malvidin derivatives, as well as the acylation and tri-hydroxylation level of anthocyanins. Although the changes in anthocyanin profile observed followed a common pattern among clones, such impact of environmental conditions was especially noticeable in one of the most widely distributed Tempranillo clones, the accession RJ43.

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.