Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Upscaling the integrated terroir zoning through digital soil mapping: a case study in the Designation of Origin Campo de Borja

homogeneous zones by intersecting several partial zonings of major factors that influence vineyard growth. Each of them follows specific process from their corresponding disciplines. Soil zoning specifically refers to a Soil Resource Inventory map that has traditionally been generated by conventional soil mapping methods. These methods have shortcomings in reaching fine cartographic and categorical details and involve significant expenses, which undermines their applicability. A new framework named Digital Soil Mapping has introduced quantitative models by statistical techniques to establish soil-landscape relationships and is able to provide intensive scale cartography.

In the present study, a microzoning at 1:10.000 scale is generated from an initial zoning, where the conventional soil map with polytaxic map units is replaced by a new one from digital techniques that disaggregates them. The comparison between the zonings considers a quantitative evaluation of capability for each Homogeneous Terroir Unit by means of the Viticultural Quality Index and its categorization based on its distribution by map. The spatial intersection of both maps gives rise to a confusion matrix in which the flows of class variations after the substitution are assessed.

The results show a five-fold increase in the number of Homogeneous Terroir Units identified and a larger differentiation among them, evidenced by a wider range in the capability index distribution. Both elements are accompanied by an increase in the detection of areas of higher potential within previously undervalued uniform zones.These features are a direct effect of the improvements brought by Digital Soil Mapping techniques and would verify the advantages of their implementation in the Integrated Terroir zoning. Eventually, such new highly detailed terroir units would benefit precision viticulture and sustainable management practices.

Organic recycled mulches in sustainable viticulture: assessment of spontaneous plants communities and weed coverage

In recent years, developing more efficient and sustainable viticulture management has been essential due to the impact of climate change in semiarid regions. For this reason, the use of recycled organic mulching (ROM) in the vineyard has become an interesting strategy to cope with water stress, isolated soil from extreme temperatures and improving soil humidity, control the presence of weeds and therefore reduce the inputs of herbicides and improve soil fertility. This work aimed to analyse the effect of three different organic mulches [straw (S), grape pruning debris (GPD) and spent mushroom compost (SMC)] and two traditional soil management techniques [herbicide (H) and interrow (IN)] on weed coverage and the spontaneous plant communities’ presence. Data sampling was collected throughout the vine vegetative cycle of 2021 in La Rioja, Spain. The different soil management techniques had a clear effect on weed coverage and his development during the vine vegetative cycle. SMC and H were the treatments with the highest and the lowest coverage percentage, respectively. IN had a delayed weed emergence at the beginning of the vine vegetative cycle, but finally it reached maximum values nearby SMC. GPD and S had similar effects on weed emergence, reaching 25-30% of the maximum coverage values. A total of 29 herbaceous species were identified during the vegetative cycle, some of them very isolated and occasional. Principal component analysis (PCAs) showed a good association between spontaneous species and treatments, furthermore, specific species-treatment associations were found. Moreover, three clear groups of herbaceous communities were identified by cluster analysis. This study provides interesting information about the effect of different alternative soil management on herbaceous plant coverage and weed species communities which could contribute to making more sustainable viticulture.

Adaptation to soil and climate through the choice of plant material

Choosing the rootstock, the scion variety and the training system best suited to the local soil and climate are the key elements for an economically sustainable production of wine. The choice of the rootstock/scion variety best adapted to the characteristics of the soil is essential but, by changing climatic conditions, ongoing climate change disrupts the fine-tuned local equilibrium. Higher temperatures induce shifts in developmental stages, with on the one hand increasing fears of spring frost damages and, on the other hand, ripening during the warmest periods in summer. Expected higher water demand and longer and more frequent drought events are also major concerns. The genetic control of the phenotypes, by genomic information but also by the epigenetic control of gene expression, offers a lot of opportunities for adapting the plant material to the future. For complex traits, genomic selection is also a promising method for predicting phenotypes. However, ecophysiological modelling is necessary to better anticipate the phenotypes in unexplored climatic conditions Genetic approaches applied on parameters of ecophysiological models rather than raw observed data are more than ever the basis for finding, or building, the ideal varieties of the future.

Ecophysiological performance of Vitis rootstocks under water stress

The use of rootstocks tolerant to soil water deficit is an interesting strategy to cope with limited water availability. Currently, several nurseries are breeding new genotypes, but the physiological basis of its responses under water stress are largely unknown. To this end, an ecophysiological assessment of the conventional 110-Richter (110R) and SO4, and the new M1 and M4 rootstocks was carried out in potted ungrafted plants. During one season, these Vitis genotypes were grown under greenhouse conditions and subjected to two water regimes, well-watered and water deficit. Water potentials of plants under water deficit down to < -1.4 MPa, and net photosynthesis (AN) <5 μmol m-2 s-1 did not cause leaf oxidative stress damage compared to well-watered conditions in any of the genotypes. The antioxidant capacity was sufficient to neutralize the mild oxidative stress suffered. Under both treatments, gravimetric differences in daily water use were observed among genotypes, leading to differences in the biomass of root, shoot and leaf. Under well-watered conditions, SO4 and 110R were the most vigorous and M1 and M4 the least. However, under water stress, SO4 exhibited the greatest reduction in biomass while M4 showed the lowest. Remarkably, under these conditions, SO4 reached the least negative stem water potential (Ψstem), while M1 reduced stomatal conductance (gs) and AN the most. In addition, SO4 and M1 genotypes also showed the highest and lowest hydraulic conductance values, respectively. Our results suggest that there are differences in water use regulation among genotypes, not only attributed to differences in stomatal regulation or intrinsic water use efficiency at the leaf level. Therefore, because no differences in canopy-to-root ratio were achieved, it is hypothesized that xylem vessel anatomical differences may be driving the reported differences among rootstocks performance. Results demonstrate that each Vitis rootstock differs in its ecophysiological responses under water stress.

Phenological characterization of a wide range of Vitis Vinifera varieties

In order to study the impact of climate change on Bordeaux grape varieties and to assess the adaptation capacities of candidates to the grape varieties of this wine region to the new climatic conditions, an experimental block design composed of 52 grape varieties was set up in 2009 at the INRAE Bordeaux Aquitaine center. Among the many parameters studied, the three main phenological stages of the vine (budburst, flowering and veraison) have been closely monitored since 2012. Observations for each year, stage and variety were carried out on four independent replicates. Precocity indices have been calculated from the data obtained over the 2012-2021 period (Barbeau et al. 1998). This work allowed to group the phenological behaviour of the grapevine varieties, not only based on the timing of the subsequent developmental stages, but also on the overall precocity of the cycle and the total length of the cycle between budburst and veraison. Results regarding the variability observed among the different grape varieties for these phenological stages are presented as heat maps.