Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Effect of the commercial inoculum of arbuscular mycorrhiza in the establishment of a commercial vineyard of the cultivar “Manto negro

The favorable effect of symbiosis with arbuscular mycorrhizal fungi (AMF) has been known and studied since the 60s. Nowadays, many companies took the chance to start promoting and selling commercial inoculants of AMF, in order to be used as biofertilizers and encourage sustainable biological agriculture. However, the positive effect of these commercial biofertilizers on plant growth is not always demonstrated, especially under field conditions. In this study, we used a commercial inoculum on newly planted grapevines of a local cultivar grafted on a common rootstock R110. We followed the physiological status of vines, growth and productivity and functional biodiversity of soil bacteria during the first and second years of 20 inoculated with commercial inoculum bases on Rhizophagus irregularis and Funeliformis mosseaeAMF at field planting time and 20 non-inoculated control plants. All the parameters measured showed a neutral to negative effect on plant growth and production. The inoculated plants always presented lower values of photosynthesis, growth and grape production, although in some cases the differences did not reach statistical significance. On the contrary, the inoculation supposed an increase of the bacterial functional diversity, although the differences were not statistically significant either. Several studies show that the effect of inoculation with AMF is context-dependent. The non-favorable effects are probably due to inoculation ineffectiveness under complex field conditions and/or that, under certain conditions, AMF presence may be a parasitic association. This puts into question the effectiveness of its application in the field. Therefore, it is recommended to only resort to this type of biofertilizer when the cultivation conditions require it (e.g., very low previous microbial diversity, foreseeable stress due to drought, salinity, or lack of nutrients) and not as a general fertilization practice.

Modulation of berry composition by different vineyard management practices

High concentration of sugars in grapes and alcohol in wines is one of the consequences of climate change on viticulture production in several wine-growing regions. In order to investigate the possibilities of adaptation of vineyard management practices aimed to reduce the accumulation of sugar during the maturation phase without reducing the accumulation of anthocyanins in grapes, a study with severe shoot trimming, shoot thinning, cluster thinning and date of harvest was conducted on Merlot variety in Istria region (Croatia), under the Mediterranean climate. Four factors which may affect grape maturation and its composition at harvest were investigated in a two-years experiment; severe shoot trimming applied at veraison when >80% of berries changed colour (in comparison to untreated control), shoot thinning (0 and 30%), cluster thinning (0 and 30%), and the date of harvest (early and standard harvest dates). Shoot thinning had no significant impact on berry composition, despite the obtained reduction in yield per vine. Lower Brix in grapes were obtained with earlier harvest date and if no cluster thinning was applied, although at the same time a reduction in the concentration of anthocyanins in berries was observed in these treatments. On the other hand, if severe shoot trimming was applied when >80% of berries changed colour, a reduction of Brix was obtained without a negative impact on berry anthocyanins concentration. We conclude that in cases when undesirably high sugar concentrations at harvest are expected, severe shoot trimming at 80% veraison may effectively be used in order to obtain moderate sugar concentration in berries together with the adequate phenolic composition.

Spatiotemporal patterns of chemical attributes in Vitis vinifera L. cv. Cabernet Sauvignon vineyards in Central California

Spatial variability of vine productivity in winegrapes is important to characterise as both yield and quality are relevant for the production of different wine styles and products. The objectives were to understand how patterns of variability of Cabernet Sauvignon fruit composition changed over time and space, how these patterns could be characterised with indirect measurements, and how spatial patterns of the variation in fruit compositional attributes can aid in improving management. Prior to the 2017 vintage, 125 data vines were distributed across each of four vineyards in the Lodi American Viticultural Area (AVA) of California. Each data vine was sampled at commercial harvest in 2017, 2018, and 2019. Yield components and fruit composition were measured at harvest for each data vine, and maps of yield and fruit composition were produced for eight ‘objective measures of fruit quality’: total anthocyanins, polymeric tannins, quercetin glycosides, malic acid, yeast assimilable nitrogen, β-damascenone, C6 alcohols and aldehydes, and 3-isobutyl-2-methoxypyrazine. Patterns of variation in anthocyanins and phenolic compounds were found to be most stable over time. Given this relative stability, management decisions focused on fruit quality could be based on zonal descriptions of anthocyanins or phenolics to increase profitability in some vineyards. In each vineyard, dormant season pruning weights and soil cores were collected at each location, elevation and soil apparent electrical conductivity surveys were completed, and remotely sensed imagery was captured by fixed wing aircraft and two satellite platforms at major phenological stages. The data collected were used to develop relationships among biophysical data, soil, imagery, and fruit composition. The standardised and aggregated samples from four vineyards over three seasons were included in the estimation of ‘common variograms’ to assess how this technique could aid growers in producing geostatistically rigorous maps of fruit composition variability without cumbersome, single season sampling efforts.

Heatwaves and grapevine yield in the Douro region, crop model simulations

Heatwaves or extreme heat events can be particularly harmful to agriculture. Grapevines grown in the Douro winemaking region are particularly exposed to this threat, due to the specificities of the already warm and dry climatic conditions. Furthermore, climate change simulations point to an increase in the frequency of occurrence of these extreme heat events, therefore posing a major challenge to winegrowers in the Mediterranean type climates. The current study focuses on the application of the STICS crop model to assess the potential impacts of heatwaves in grapevine yields over the Douro valley winemaking region. For this purpose, STICS was applied to grapevines using high-resolution weather, soil and terrain datasets over the Douro. To assess the impact of heatwaves, the weather dataset (1989-2005) was artificially modified, generating periods with anomalously high temperatures (+5 ºC), at certain onset dates and with specific durations (from 5 to 9 days). The model was run with this modified weather dataset and results were compared to the original unmodified runs. The results show that heatwaves can have a very strong impact on grapevine yields, strongly depending on the onset dates and duration of the heatwaves. The highest negative impacts may result in a decrease in the yield by up to -35% in some regions. Despite some uncertainties inherent to the current modelling assessment, the present study highlights the negative impacts of heatwaves on viticultural yields in the Douro region, which is critical information for stakeholders within the winemaking sector for planning suitable adaptation measures.

Metabolomic discrimination of grapevine water status for Chardonnay and Pinot noir

Water status impact in viticulture has been widely explored, as it strongly affects grapevine physiology and grape chemical composition. It is considered as a key component of vitivinicultural terroir. Most of the studies concerning grapevine water status have focused on either physiological traits, or berry compounds, or traits involved in wine quality. Here, the response of grapevine to water availability during the ripening period is assessed through non-targeted metabolomics analysis of grape berries by ultra-high resolution mass spectrometry. The grapevine water status has been assessed during 2 consecutive years (2019 & 2020), through carbon isotope discrimination on juices from berries collected at maturity (21.5 brix approx.) for 2 Vitis vinifera cv. Pinot noir (PN) and Chardonnay (CH). A total of 220 grape juices were collected from 5 countries worldwide (Italy; Argentina; France; Germany; Portugal). Measured δ13C (‰) varied from -28.73 to -22.6 for PN, and from -28.79 to -21.67 for CH. These results also clearly revealed higher water stress for the 2020 vintage. The same grape juices have been analysed by Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS) and Liquid Chromatography coupled to Mass Spectrometry (LC-qTOF-MS), leading to the detection of up to 4500 CHONS containing elemental compositions, and thus likely tens of thousands of individual compounds, which include fatty acids, organic acids, peptides, phenolics, also with high levels of glycosylation. Multivariate statistical analysis revealed that up to 160 elemental compositions, covering the whole range of detected masses (100 –1000 m/z), were significantly correlated to the observed gradients of water status. Examples of chemical markers, which are representative of these complex fingerprints, include various derivatives of the known abscisic acid (ABA), such as phaesic acid or abscisic acid glucose ester, which are significantly correlated with higher water stress, regardless of the variety. Cultivar-specific behaviours could also be identified from these fingerprints. Our results provide an unprecedented representation of the metabolic diversity, which is involved in the water status regulation at the grape level, and which could contribute to a better knowledge of the grapevine mitigation strategy in a climate change context.