Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Local adaptation tools to ensure the viticultural sustainability in a changing climate

[lwp_divi_breadcrumbs home_text="IVES" use_before_icon="on" before_icon="||divi||400" module_id="publication-ariane" _builder_version="4.19.4" _module_preset="default" module_text_align="center" module_font_size="16px" text_orientation="center"...

Characterization of variety-specific changes in bulk stomatal conductance in response to changes in atmospheric demand and drought stress

In wine growing regions around the world, climate change has the potential to affect vine transpiration and overall vineyard water use due to related changes in atmospheric demand and soil water deficits. Grapevines control their transpiration in response to a changing environment by regulating conductance of water through the soil-plant-atmosphere continuum. Most vineyard water use models currently estimate vine transpiration by applying generic crop coefficients to estimates of reference evapotranspiration, but this does not account for changes in vine conductance associated with water stress, nor differences thought to exist between varieties. The response of bulk stomatal conductance to daily weather variability and seasonal drought stress was studied on Cabernet-Sauvignon, Merlot, Tempranillo, Ugni blanc, and Semillon vines in a non-irrigated vineyard in Bordeaux France. Whole vine sap flow, temperature and humidity in the vine canopy, and net radiation absorbed by the vine canopy were measured on 15-minute intervals from early July through mid-September 2020, together with periodic measurement of leaf area, canopy porosity, and predawn leaf water potential. From this data, bulk stomatal conductance was calculated on 15-minute intervals, and multiple regression analysis was performed to identify key variables and their relative effect on conductance. Attention was focused on addressing multicollinearity and time-dependency in the explanatory variables and developing regression models that were readily interpretable. Variability of vapor pressure deficit over the day, and predawn water potential over the season explained much of the variability in conductance, with relative differences in response coefficients observed across the five varieties. By characterizing this conductance response, the dynamics of vine transpiration can be better parameterized in vineyard water use modeling of current and future climate scenarios.

Protected Designation of Origin (D.P.O.) Valdepeñas: classification and map of soils

The objective of the work described here is the elaboration of a map of the different types of vineyard soils that to guide the famers in the choice of the most productive vine rootstocks and varieties. 90 vineyard soils profiles were analysed in the entire territory of the Origen Denominations of Valdepeñas. The sampling was carried out in 2018 (June to October) by making a sampling grid, followed by photointerpretation and control in the field. The studied soils can be grouped into 9 different soil types (according to FAO 2006 classification): Leptosols, Regosols, Fluvisols, Gleysols, Cambisols, Calcisols, Luvisols and Anthrosols. A map showing the soil distribution with different type of soils has been made with the ArcGIS program. Regarding to the choice of rootstock, Calcisoles are soils with a high active limestone content, so the rootstocks used in these soils must be resistant to this parameter; Luvisols are deep soils with high clay content, so they will support vigorous rootstocks. Because the cartographic units are composed of two or more subgroups, with are associated in variable proportions, 9 different soil associations have been established; Unit 1: Leptosols, Cambisols and Luvisols (80%, 15% and 5% respectively); Unit 2: Cambisols with Regosols and Luvisols (40%, 30% and 30% respectively); Unit 3: Cambisols and Gleysols with Regosols (40%, 40% and 20% respectively); Unit 4: Regosols with Cambisols, Leptosols and Calcisols (40%, 30%, 15% and 15% respectively); Unit 5: Cambisols, Leptosols, Calcisols and Regosols (25% each of them); Unit 6: Luvisols with Cambisol and Calcisols (80%, 10% and 10% respectively); Unit 7: Luvisols and Calcisols with Cambisols (40%, 40% and 20% respectively); Unit 8: Calcisols with, Cambisols and Luvisols (80%, 10% and 10% respectively); Unit 9: Anthrosols. These study allow to elaborate the first map of vineyard soils of this Protected Designation of Origin in Castilla-La Mancha.

Impact of long term agroecological and conventional practices on subsurface soil microbiota in Macabeu and Xarel·lo vineyards

There is a growing trend on the transition from conventional to agroecological management of vineyards. However, the impact of practices, such as reduced-tillage, organic fertilization and cover crops, is not well-understood regarding the soil microbial diversity, and its relationship with the soil physicochemical properties in the subsurface depth near the rooting zone. Soil bacterial diversity is an important contributor towards plant health, productivity and response to environmental stresses. A field experiment was conducted by sampling subsurface soil bacterial community (NGS and qPCR) near to the root zone of Macabeu and Xarel·lo vineyards, located at the Penedes. 3 organic (ECO) and 3 conventional (CON) vineyards, with more than 10 years of respective management were sampled (n=5 each plot). ECO practices did not affect bacterial and fungal abundance but increased significantly the ammonium oxidizing bacteria and alpha-diversity (Inv.Simpson). Interestingly beta-diversity was significantly affected by the management strategy. ANOSIM-tests revealed a significative effect of the management (ecological vs conventional) and plot, on the soil microbial structure (ASV abundance). Main phyla depicted were Proteobacteria, Actinobacteria and Acidobacteria, whose relative abundances were not affected by the management. EdgeR assay revealed a significant increase of Cyanobacteria and decrease of Gemmatimonadetes and Firmicutes phyla in ECO. Interestingly, the grapevine variety was not correlated with the soil microbial community structure. Mantel-test revealed an important correlation (Spearman) of some physicochemical parameters with the soil microbiota structure, in order of importance: texture, EC, pH Ca/Mg, Mg/P, K+, Mg2+, Ca2+, SO42-, and OM. N-NH4 and NTK, which were higher in the ECO managed soils, did not correlated significantly with the soil microbiome population. The results revealed the importance of combining a deep physicochemical characterization of each replicate with the microbial diversity assessment to gain better insights on the relationship between soil microbiome and vineyard management.

Optimizing stomatal traits for future climates

Stomatal traits determine grapevine water use, carbon supply, and water stress, which directly impact yield and berry chemistry. Breeding for stomatal traits has the strong potential to improve grapevine performance under future, drier conditions, but the trait values that breeders should target are unknown. We used a functional-structural plant model developed for grapevine (HydroShoot) to determine how stomatal traits impact canopy gas exchange, water potential, and temperature under historical and future conditions in high-quality and hot-climate California wine regions (Napa and the Central Valley). Historical climate (1990-2010) was collected from weather stations and future climate (2079-99) was projected from 4 representative climate models for California, assuming medium- and high-emissions (RCP 4.5 and 8.5). Five trait parameterizations, representing mean and extreme values for the maximum stomatal conductance (gmax) and leaf water potential threshold for stomatal closure (Ψsc), were defined from meta-analyses. Compared to mean trait values, the water-spending extremes (highest gmax or most negative Ysc) had negligible benefits for carbon gain and canopy cooling, but exacerbated vine water use and stress, for both sites and climate scenarios. These traits increased cumulative transpiration by 8 – 17%, changed cumulative carbon gain by -4 – 3%, and reduced minimum water potentials by 10 – 18%. Conversely, the water-saving extremes (lowest gmax or least negative Ψsc) strongly reduced water use and stress, but potentially compromised the carbon supply for ripening. Under RCP 8.5 conditions, these traits reduced transpiration by 22 – 35% and carbon gain by 9 – 16% and increased minimum water potentials by 20 – 28%, compared to mean values. Overall, selecting for more water-saving stomatal traits could improve water-use efficiency and avoid the detrimental effects of highly negative canopy water potentials on yield and quality, but more work is needed to evaluate whether these benefits outweigh the consequences of minor declines in carbon gain for fruit production.