Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

First step in the preparation of a soil map of the Protected Designation of Origin Valdepeñas (Central, Spain)

This work is a first step to make a map of vineyard soils. The characterization of the soils of the Protected Designation of Origin (D.P.O.) Valdepeñas will allow to group the studied profiles according to their physico-chemical characteristics and the concentrations of most relevant chemical elements. 90 soil profiles were analysed throughout the territory and the soils were sampled and described according to FAO (2006) and classified according to and Soil Taxonomy (2014). All samples were air dried, sieved and some physico-chemical parameters were determined following standard protocols. Also, major and trace elements were analysed by X-ray fluorescence. The statistically study was made using the SPSS program. Trend maps were made using the ArcGIS program. The studied soils have the following average properties: pH, 8.3; electrical conductivity, 0,20 dS/m (low); clay, 18.8% (medium) and CaCO3, 17.1% (high). In the study for the major elements. The major elements of these soils are Si, followed by Ca and Al, with an average content of 203.7 g/kg, 105.5 g/kg and 74.0 g/kg respectively. On the other hand, 27 trace elements have been studied. Of all of them, it can be highlighted the average values of Ba (361.8 mg/kg), Sr (129.3 mg/kg), Rb (83.4 mg/kg), V (74.2 mg/kg) and Ce (70.6 mg/kg). Ba, V and Ce values are higher and the values of Sr and Rb are lower to those found in the literature. The discriminant analysis shows a percentage of grouping of 91%. The content of chemical elements together with the physico-chemical characteristics allows grouping the soils in 4 group according to their order in the classification to Soil Taxonomy; due to the importance of the Calcisols in Castilla-La Mancha, it has been decided to establish them as their own group even if they do not appear in Soil Taxonomy classification.

Copper contamination in vineyard soils of Bordeaux: spatial risk assessment for the replanting of vines and crops

Copper (Cu) is widely and historically used in viticulture as a fungicide against mildew. Cu has a strong affinity for soil organic matter and accumulates in topsoil horizons. Thus, Cu may negatively affect soil organisms and plants, consequently reducing soil fertility and productivity. The Bordeaux vineyards have the largest vineyard surfaces (26%) within French controlled appellation and a great proportion of French wine production (around 5 million hl per year). Considering the local context of vineyard surfaces decreasing (vine uprooting) and possible new crop plantation, the issue of Cu potential toxicity rises. Therefore, the aims of this work are firstly to evaluate the Cu contamination in vineyard soils of Bordeaux, secondly to produce a risk assessment map for new vine or crop plantation. We used soil analyses from several local studies to build a database with 4496 soil horizon samples. The database was enhanced by means of pedotransfer functions in order to estimate the bioaccessible (EDTA-extractable) Cu in soils of samples without measurements. From this database, 1797 georeferenced samples with CuEDTA concentrations in the topsoil (0-50 cm depth) were used for kriging interpolation in order to produce the spatial distribution map of CuEDTA in vineyard soils. Then, the spatial distribution of Cu was crossed with vine uprooting surfaces and municipality boundaries. CuEDTAconcentrations ranged from 0.52 to 459 mg/kg and showed clear anomalies. Our results from spatial analysis showed that almost 50% of vineyard soil surfaces have CuEDTA concentrations higher than 30 mg/kg (moderate risk for new plantation) and 20% with concentrations higher than 50 mg/kg (high risk for new plantation). A decision-support map based on municipalities was realised to provide a simple tool to stakeholders concerned by land use management.

Phenolic composition of Tempranillo Blanco grapes changes after foliar application of urea

Our research aimed to determine the effect and efficiency of foliar application of urea on the phenolic composition of Tempranillo Blanco grapes. The field experiment was carried out in 2019 and 2020 seasons and the plot was located in D.O.Ca Rioja (North of Spain). The vineyard was Vitis vinifera L. Tempranillo Blanco and grafted on Richter-110 rootstock. The treatments were control (C), whose plants were sprayed with water and three doses of urea: plants were sprayed with urea 3 kg N/ha (U3), 6 kg N/ha (U6) and 9 kg N/ha (U9). The applications were performed in two phenological stages, pre-veraison (Pre) and veraison (Ver). Also, each of the treatments was repeated one week later. Control and treatments were performed in triplicate and arranged in a randomised block design. Grapes were harvested at optimum ripening stage. High-performance liquid chromatography was used to analyse the phenolic composition of the grapes. Finally, the results obtained from the analytical determinations – flavonols, flavanols and non-flavonoid (hydroxybenzoic acids, hydroxycinnamic acids and stilbenes) – were studied statistically by analysis of variance. The results showed that, in 2019, U6-Pre and U9-Pre treatments increased the hydroxybenzoic acid content in grapes, and also all foliar treatments applied at Pre enhanced the stilbene concentration. Moreover, U3-Ver was the only treatment that rose flavonol and stilbene contents in the Tempranillo Blanco grapes. In 2020, all treatments applied at Pre enhanced the flavonol concentration in grapes. Furthermore, U3-Pre and U9-Pre treatments increased stilbene content in grapes. Nevertheless, the hydroxybenzoic acid content was improved by U6-Ver and U9-Ver and besides, hydroxycinnamic acid concentration in grapes was increased by all treatments applied at Ver. In conclusion, the lower and highest dose of urea (U3 and U9), applied at pre-veraison, were the best treatments to improve the Tempranillo Blanco grape phenolic composition.

The use of rootstock as a lever in the face of climate change and dieback of vineyard

As viticulture faces challenges such as climate change or vineyard dieback, the choice of the variety and rootstock becomes more and more crucial. To study rootstock levers in the Bordeaux region, a parcel of Cabernet Sauvignon (CS) was planted with four rootstocks in 2014. Twenty repetitions of each of the following four rootstocks were set up: 101-14 MGt, Nemadex AB, 420A MGt and Gravesac. The number of bunches, yields and pruning weights of the vine shoots were measured individually on 240 vines from 2017 to 2021. Since 2020, nitrogen status assessed by assimilable nitrogen level, hydric status assessed by δ13C and berry maturity were measured on 80 samples taken from 20 repetitions of the four rootstocks. A lower yield was measured for CS grafted onto Nemadex AB due to the lower number of bunches and the lower weight of berries. The differences between the other three rootstocks are small, but CS grafted onto 420A MGt was the most productive. The CS grafted onto Nemadex AB had the lowest pruning weight while 101-14 MGt had the highest. In 2020, δ13C showed a more moderate water stress with 101-14 MGt and 420A MGt than with Nemadex AB. Surprisingly, the Gravesac was under more stress than the 101-14 MGt. The nitrogen status in the berries was better for Nemadex AB but this was perhaps due to the significantly lower weight of the berries.Rootstock 101-14 MGt attained the highest accumulation of sugars in the berries while 420A MGt allows to preserve higher acidity. The parcel is still young which may explain some of the results. These measures must therefore be continued over the next several years to fully assess the effects of these rootstocks on the development of the vines and the quality of the production under new climatic conditions.

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.