Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Biodiversity in the vineyard agroecosystem: exploring systemic approaches

Biodiversity conservation and restoration are essential for guarantee the provision of ecosystem services associated to vineyard agroecosystem such as climate regulation trough carbon sequestration and control of pests and diseases. Most of published research dealing with the complexity of the vineyard agroecosystems emphasizes the necessity of innovative approaches, including the integration of information at different temporal and spatial scales and development of systemic analysis based on modelling. A biodiversity survey was conducted in the Franciacorta wine-growing area (Lombardy, Italy), one of the most important Italian wine-growing regions for sparkling wine production, considering a portion of the territory of 112 ha. The area was divided into several Environmental Units (EUs), defined as a whole vineyard or portion of vineyard homogenous in terms of four agronomic characteristics: planting year, planting density, cultivar, and training system. In each EU a set of compartments was identified and characterised by specific variables. The compartments are meteorology, morphology (altitude, slope, aspect, row orientation, and solar irradiance), ecological infrastructures and management. The landscape surrounding EU was also characterised in terms of land-use in a buffer zone of 500 m. For each component a specific methodology was identified and applied. Different statistical approaches were used to evaluate the method to integrate the information related to different compartments within the EU and related to the buffer zone. These approaches were also preliminarily evaluated for their ability to describe the contribution of biodiversity and landscape components to ecosystem services. This methodological exploration provides useful indication for the development of a fully systemic approach to structural and functional biodiversity in vineyard agroecosystems, contributing to promote a multifunctional perspective for the all wine-growing sector.

Rootstock regulation of scion phenotypes: the relationship between rootstock parentage and petiole mineral concentration

Grapevine is grown as a graft since the end of the 19th century. Rootstocks not only provide tolerance to Phylloxera but also ensure the supply of water and mineral nutrients to the scion. Rootstocks are an important mean of adaptation to environmental conditions, because the scion controls the typical features of the grapes and wine. However, among the large diversity of rootstocks worldwide, few of them are commercially used in the vineyard. The aim of this study was to investigate the extent to which rootstocks modify the mineral composition of the petioles of the scion. Vitis vinifera cvs. Cabernet-Sauvignon, Pinot noir, Syrah and Ugni blanc were grafted onto 55 different rootstock genotypes and planted in a vineyard as three replicates of 5 vines. Petioles were collected in the cluster zone with 6 replicates per combination. Petiolar concentrations of 13 mineral elements (N, P, K, S, Mg, Ca, Na, B, Zn, Mn, Fe, Cu, Al) at veraison were determined. Scion, rootstock and the interaction explained the same proportion of the phenotypic variance for most mineral elements. Rootstock genotype showed a significant influence on the petiole mineral element composition. Rootstock effect explained from 7 % for Cu to 25 % for S of the variance. The difference of rootstock conferred mineral status is discussed in relation to vigor and fertility. Rootstocks were also genotyped with 23 microsatellite markers. Data were analysed according to genetic groups in order to determine whether the petiole mineral composition could be related to the genetic parentage of the rootstock. Thanks to a highly powerful design, it is the first time that such a large panel of rootstocks grafted with 4 scions has been studied. These results give the opportunity to better characterize the rootstocks and to enlarge the diversity used in the vineyard.

Use of multispectral satellite for monitoring vine water status in mediterranean areas

The development of new generations of multispectral satellites such as Sentinel-2 opens possibilities as to vine water status assessment (Cohen et al., 2019). Based on a three years field campaign, a model of Stem Water Potential (SWP) estimation on vine using four satellite bands in Red, Red-Edge, NIR and SWIR domains was developed (Laroche-Pinel et al., 2021). The model relies on SWP field measures done using a pressure chamber (Scholander et al., 1965), which is a common, robust and precise method to assess vine water status (Acevedo-Opazo et al., 2008). The model was mainly developed from from SWP measures on Syrah N (Laroche Pinel E., 2021).

A large scale monitoring was organized in different vineyards in the Mediterranean region in 2021. 10 varieties amongst the most represented in this area were monitored (Cabernet sauvignon N, Chardonnay B, Cinsault N, Grenache N, Merlot N, Mourvèdre N, Sauvignon B, Syrah N, Vermentino B, Viognier B). The model was used to produce water status maps from Sentinel-2 images, starting from the beginning of June (fruit set) up to September (harvest). The average estimated SWP for each vine was compared to actual field SWP measures done by wine growers or technicians during usual monitoring of irrigation programs. The correlations between mean estimated SWP and mean measured SWP were at the same level than expected by the model. (Laroche Pinel, 2021) The general SWP kinetics were comparable. The estimated SWP would have led to same irrigation decisions concerning the date of first irrigation in comparison with measured SWP.

Acevedo-Opazo, C., Tisseyre, B., Ojeda, H., Ortega-Farias, S., Guillaume, S. (2008). Is it possible to assess the spatial variability of vine water status? OENO One, 42(4), 203.
Cohen, Y., Gogumalla, P., Bahat, I., Netzer, Y., Ben-Gal, A., Lenski, I., … Helman, D. (2019). Can time series of multispectral satellite images be used to estimate stem water potential in vineyards? In Precision agriculture ’19, The Netherlands: Wageningen Academic Publishers, pp. 445–451.
Laroche-Pinel, E., Duthoit, S., Albughdadi, M., Costard, A. D., Rousseau, J., Chéret, V., & Clenet, H. (2021). Towards vine water status monitoring on a large scale using sentinel-2 images. remote sensing, 13(9), 1837.
Laroche-Pinel,E. (2021). Suivi du statut hydrique de la vigne par télédétection hyper et multispectrale. Thèse INP Toulouse, France.
Scholander, P.F., Bradstreet, E.D., Hemmingsen, E.A., & Hammel, H.T. (1965). Sap pressure in vascular plants: Negative hydrostatic pressure can be measured in plants. Science, 148(3668), 339–346.

Effect of partial net shading on the temperature and radiation in the grapevine canopy, consequences on the grape quality of cv. Gros Manseng in PDO Pacherenc-du-vic-Bilh

As elsewhere, southwestern France vineyards face more recurrent summer heat waves these last years. Among the possibilities of adaptation to this climate changing parameter, the use of net shading is a technique that allow for limiting canopy exposure to radiations. In this trial, we tested net shading installed on one face of the canopy, on a north-south row-oriented plot of cv. Gros Manseng trained on VSP system in the PDO Pacherenc-du-Vic-Bilh. The purpose was to characterize the effects on the ambient canopy temperatures and radiations during the season and to observe the consequences on the composition of grapes and wines. Two sorts of net were used with two levels of obstruction (50% and 75%) of the photosynthesis active radiation (PAR). They have been installed on the west side of the canopy and compared to a netless control. Temperature and PAR sensors registered hourly data during the season. On specific summer day (hot and sunny) manual measurements took also place on bunches (temperature) and in different spots of the canopy (PAR). The results showed that, on clear days, the radiation is lowered by the shade nets respecting the supplier criteria. The effects on the ambient canopy temperature were inconstant on this plot when we observed the data from the global period of shading between fruit set and harvest. However, during hot days (>30°C), the temperature in the canopy was reduced during afternoon and the temperature of the bunch surface was reduced as well comparing to the control. A decrease of the maturity parameters of the berries, sugar and acidity, was also observed. Concerning the wine aromatic potential, no differences clearly appeared.

Impact of geographical location on the phenolic profile of minority varieties grown in Spain. II: red grapevines

Because terroir and cultivar are drivers of wine quality, is essential to investigate theirs effects on polyphenolic profile before promoting the implantation of a red minority variety in a specific area. This work, included in MINORVIN project, focuses in the polyphenolic profile of 7 red grapevines minority varieties of Vitis vinifera L. (Morate, Sanguina, Santafe, Terriza Tinta Jeromo Tortozona Tinta) and Tempranillo) from six typical viticulture Spanish areas: Aragón (A1), Cataluña (A2), Castilla la Mancha (A3), Castilla –León (A4), Madrid (A5) and Navarra (A6) of 2020 season. Polyphenolic substances were extracted from grapes. 35 compounds were identified and quantified (mg subtance/kg fresh berry) by HPLC and grouped in anthocyanins (ANT) flavanols (FLAVA), flavonols (FLAVO), hydroxycinnamic (AH), benzoic (BA) acids and stilbenes (ST). Antioxidant activity (AA, mmol TE /g fresh berry) was determined by DPPH method. The results were submitted to a two-way ANOVA to investigate the influence of variety, area and their interaction for each polyphenolic family and cluster analysis was used to construct hierarchical dendrograms, searching the natural groupings among the samples. Sanguina (A3) had the most of total polyphenols while Tempranillo (A5) those of ANT. Sanguina (A2) and (A3) reached the highest values of FLAVO, FLAVA and AA. These two last samples had also the maximum of AA. The effect cultivar and area were significant for all polyphenolic families analyzed. A high variability due to variety (>50%) was observed in FLAVA and the maximum value of variability due to growing area was detected in AA (86.41%), ANT and FLAVO (51%); the interaction variety*zone was significant only for ANT, FLAVO, EST and AA. Finally, dendrograms presented five cluster: i) Sanguina (A2); ii) Sanguina (A3); iii) Tempranillo (A5); iv) Tempranillo (A3); Terriza (A3,A5), Morate (A5,A6); v) Santafé (A1,A6); Tortozona tinta (A1,A3,A6); Tinta Jeromo (A3,A4).