Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Use of a new, miniaturized, low-cost spectral sensor to estimate and map the vineyard water status from a mobile 

Optimizing the use of water and improving irrigation strategies has become increasingly important in most winegrowing countries due to the consequences of climate change, which are leading to more frequent droughts, heat waves, or alteration of precipitation patterns. Optimized irrigation scheduling can only be based on a reliable knowledge of the vineyard water status.

In this context, this work aims at the development of a novel methodology, using a contactless, miniaturized, low-cost NIR spectral tool to monitor (on-the-go) the vineyard water status variability. On-the-go spectral measurements were acquired in the vineyard using a NIR micro spectrometer, operating in the 900–1900 nm spectral range, from a ground vehicle moving at 3 km/h. Spectral measurements were collected on the northeast side of the canopy across four different dates (July 8th, 14th, 21st and August 12th) during 2021 season in a commercial vineyard (3 ha). Grapevines of Vitis vinifera L. Graciano planted on a VSP trellis were monitored at solar noon using stem water potential (Ψs) as reference indicators of plant water status. In total, 108 measurements of Ψs were taken (27 vines per date).

Calibration and prediction models were performed using Partial Least Squares (PLS) regression. The best prediction models for grapevine water status yielded a determination coefficient of cross-validation (r2cv) of 0.67 and a root mean square error of cross-validation (RMSEcv) of 0.131 MPa. This predictive model was employed to map the spatial variability of the vineyard water status and provided useful, practical information towards the implementation of appropriate irrigation strategies. The outcomes presented in this work show the great potential of this low-cost methodology to assess the vineyard stem water potential and its spatial variability in a commercial vineyard.

Assessing the climate change vulnerability of European winegrowing regions by combining exposure, sensitivity and adaptive capacity indicators

Winegrowing regions recognized as protected designations of origin (PDOs) are closely tied to well defined geographic locations with a specific set of pedoclimatic attributes and strictly regulated by legal specifications. However, climate change is increasingly threatening these regions by changing local conditions and altering winegrowing processes. The vulnerability to these changes is largely heterogenous across different winegrowing regions because it is determined by individual characteristics of each region, including the capacity to adapt to new climatic conditions and the sensitivity to climate change, which depend not only on natural, but also socioeconomic and legal factors. Accurate vulnerability assessments therefore need to combine information about adaptive capacity and climate change sensitivity with projected exposure to new climatic conditions. However, most existing studies focus on specific impacts neglecting important interactions between the different factors that determine climate change vulnerability. Here, we present the first comprehensive vulnerability assessment of European wine PDOs that spatially combines multiple indicators of adaptive capacity and climate change sensitivity with high-resolution climate projections. We found that the climate change vulnerability of PDO areas largely depends on the complex interactions between physical and socioeconomic factors. Homogenous topographic conditions and a narrow varietal spectrum increase climate change vulnerability, while the skills and education of farmers, together with a good economic situation, decrease their vulnerability. Assessments of climate change consequences therefore need to consider multiple variables as well as their interrelations to provide a comprehensive understanding of the expected impacts of climate change on European PDOs. Our results provide the first vulnerability assessment for European winegrowing regions at high spatiotemporal resolution that includes multiple factors related to climate exposure, sensitivity, and adaptive capacity on the level of single winegrowing regions. They will therefore help to identify hot spots of climate change vulnerability among European PDOs and efficiently direct adaptation strategies.

The plantation frame as a measure of adaptation to climate change

The mechanization of vineyard work originally led to a reduction in planting densities due to the lack of machinery adapted to the vineyard. The current availability of specific machinery makes it possible to establish higher planting densities. In this work, three planting densities (1.40×0.80 m, 1.80×1 m and 2.20×1.20 m, corresponding to 8928, 5555 and 3787 plants/ha respectively) were studied with four varieties autochthonous of Galicia (northwestern Spain): Albariño and Treixadura (white), Sousón and Mencía (red). The vines were trained in a vertical shoot positioning system using a single Royat cordon, and pruned to spurs with two buds each. Agronomic data (yield, pruning wood weight, Ravaz index) and oenological data in must were collected. The higher planting density (1.40×0.80 m) had no significant effect on grape yield per vine in white varieties, although production per hectare was much higher due to the greater number of plants. In red varieties, this planting density resulted in a significantly lower production per vine, compensated by the greater number of plants. In addition, it significantly reduced the Brix degree in the must of the Albariño, Treixadura and Sousón varieties, and increased the total acidity in the latter two and Mencía. It also caused an increase in extractable and total anthocyanins and IPT in red grapes. The effects of high planting density on grapes are of great interest for the adaptation of varieties in the context of climate change. In the future, it could be advisable to modify the limits imposed by the appellations of origin on the planting density of these varieties in order to obtain more balanced wines.

Sustaining wine identity through intra-varietal diversification

With contemporary climate change, cultivated Vitis vinifera L. is at risk as climate is a critical component in defining ecologically fitted plant materiel. While winegrowers can draw on the rich diversity among grapevine varieties to limit expected impacts (Morales-Castilla et al., 2020), replacing a signature variety that has created a sense of local distinctiveness may lead to several challenges. In order to sustain wine identity in uncertain climate outcomes, the study of intra-varietal diversity is important to reflect the adaptive and evolutionary potential of current cultivated varieties. The aim of this ongoing study is to understand to what extent can intra-varietal diversity be a climate change adaptation solution. With a focus on early (Sauvignon blanc, Riesling, Grolleau, Pinot noir) to moderate late (Chenin, Petit Verdot, Cabernet franc) ripening varieties, data was collected for flowering and veraison for the various studied accessions (from conservatory plots) and clones. For these phenological growing stages, heat requirements were established using nearby weather stations (adapted from the GFV model, Parker et al., 2013) and model performances were verified. Climate change projections were then integrated to predict the future behaviour of the intra-varietal diversity. Study findings highlight the strong phenotypic diversity of studied varieties and the importance of diversification to enhance climate change resilience. While model performances may require improvements, this study is the first step towards quantifying heat requirements of different clones and how they can provide adaptation solutions for winegrowers to sustain local wine identity in a global changing climate. As genetic diversity is an ongoing process through point mutations and epigenetic adaptations, perspective work is to explore clonal data from a wide variety of geographic locations.

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.