Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Photoselective shade films affect grapevine berry secondary metabolism and wine composition

Grapevine physiology and production are challenged by forecasted increases in temperature and water deficits. Within this scenario, photoselective overhead shade films are promising tools in warm viticulture areas to overcome climate change related factors. The aim of this study was to evaluate the vulnerability of ‘Cabernet Sauvignon’ grape berry to solar radiation overexposure and optimize shade film use for berry integrity. A randomized complete block design field study was conducted across two years (2020-2021) in Oakville, Napa Valley, CA, with four shade films (D1, D3, D4, D5) differing in the percent of radiation spectra transmitted and compared to an uncovered control (C0). Integrals for gas exchange parameters and mid-day stem water potential were unaffected by the shade films in 2020 and 2021. By harvest, berries from uncovered and shaded vines did not differ in their size or primary metabolism in either year. Despite precipitation exclusion during the dormant season in the shaded treatments, yield did not differ between them and the control in either season. In 2020, total skin anthocyanins (mg/g fresh mass) in the shaded treatments was greater than C0 during berry ripening and at harvest. Conversely, flavonol concentrations in 2020 were reduced in shaded vines compared to C0. The 2020 growing season highlighted the impact of heat degradation on flavonoids. Flavonoid concentrations in 2021 increased until harvest while flavonoid degradation was apparent from veraison to harvest in 2020 across shaded and control vines. Wine analyses highlighted the importance of light spectra to modify wine composition. Wine color intensity, tonality and anthocyanin values were enhanced in D4 whereas antioxidant properties were enhanced in C0 and D5 wines. Altogether, our results highlighted the need of new approaches in warm viticulture areas given the impact that composition of light has on berry and wine quality.

Impact of climate change on the viticultural climate of the Protected Designation of Origin “Jumilla” (SE Spain)

Protected Designation of Origin “Jumilla” (PDO Jumilla) is located in the Spanish provinces of Albacete and Murcia, in the South-eastern part of the Iberian Peninsula, where most of the models predict a severe impact of climate change in next decades. PDO Jumilla covers an area of 247,054 hectares, of which more than 22,000 hectares

Organic recycled mulches in sustainable viticulture: assessment of spontaneous plants communities and weed coverage

In recent years, developing more efficient and sustainable viticulture management has been essential due to the impact of climate change in semiarid regions. For this reason, the use of recycled organic mulching (ROM) in the vineyard has become an interesting strategy to cope with water stress, isolated soil from extreme temperatures and improving soil humidity, control the presence of weeds and therefore reduce the inputs of herbicides and improve soil fertility. This work aimed to analyse the effect of three different organic mulches [straw (S), grape pruning debris (GPD) and spent mushroom compost (SMC)] and two traditional soil management techniques [herbicide (H) and interrow (IN)] on weed coverage and the spontaneous plant communities’ presence. Data sampling was collected throughout the vine vegetative cycle of 2021 in La Rioja, Spain. The different soil management techniques had a clear effect on weed coverage and his development during the vine vegetative cycle. SMC and H were the treatments with the highest and the lowest coverage percentage, respectively. IN had a delayed weed emergence at the beginning of the vine vegetative cycle, but finally it reached maximum values nearby SMC. GPD and S had similar effects on weed emergence, reaching 25-30% of the maximum coverage values. A total of 29 herbaceous species were identified during the vegetative cycle, some of them very isolated and occasional. Principal component analysis (PCAs) showed a good association between spontaneous species and treatments, furthermore, specific species-treatment associations were found. Moreover, three clear groups of herbaceous communities were identified by cluster analysis. This study provides interesting information about the effect of different alternative soil management on herbaceous plant coverage and weed species communities which could contribute to making more sustainable viticulture.

Optimizing stomatal traits for future climates

Stomatal traits determine grapevine water use, carbon supply, and water stress, which directly impact yield and berry chemistry. Breeding for stomatal traits has the strong potential to improve grapevine performance under future, drier conditions, but the trait values that breeders should target are unknown. We used a functional-structural plant model developed for grapevine (HydroShoot) to determine how stomatal traits impact canopy gas exchange, water potential, and temperature under historical and future conditions in high-quality and hot-climate California wine regions (Napa and the Central Valley). Historical climate (1990-2010) was collected from weather stations and future climate (2079-99) was projected from 4 representative climate models for California, assuming medium- and high-emissions (RCP 4.5 and 8.5). Five trait parameterizations, representing mean and extreme values for the maximum stomatal conductance (gmax) and leaf water potential threshold for stomatal closure (Ψsc), were defined from meta-analyses. Compared to mean trait values, the water-spending extremes (highest gmax or most negative Ysc) had negligible benefits for carbon gain and canopy cooling, but exacerbated vine water use and stress, for both sites and climate scenarios. These traits increased cumulative transpiration by 8 – 17%, changed cumulative carbon gain by -4 – 3%, and reduced minimum water potentials by 10 – 18%. Conversely, the water-saving extremes (lowest gmax or least negative Ψsc) strongly reduced water use and stress, but potentially compromised the carbon supply for ripening. Under RCP 8.5 conditions, these traits reduced transpiration by 22 – 35% and carbon gain by 9 – 16% and increased minimum water potentials by 20 – 28%, compared to mean values. Overall, selecting for more water-saving stomatal traits could improve water-use efficiency and avoid the detrimental effects of highly negative canopy water potentials on yield and quality, but more work is needed to evaluate whether these benefits outweigh the consequences of minor declines in carbon gain for fruit production.

Metabolomic discrimination of grapevine water status for Chardonnay and Pinot noir

Water status impact in viticulture has been widely explored, as it strongly affects grapevine physiology and grape chemical composition. It is considered as a key component of vitivinicultural terroir. Most of the studies concerning grapevine water status have focused on either physiological traits, or berry compounds, or traits involved in wine quality. Here, the response of grapevine to water availability during the ripening period is assessed through non-targeted metabolomics analysis of grape berries by ultra-high resolution mass spectrometry. The grapevine water status has been assessed during 2 consecutive years (2019 & 2020), through carbon isotope discrimination on juices from berries collected at maturity (21.5 brix approx.) for 2 Vitis vinifera cv. Pinot noir (PN) and Chardonnay (CH). A total of 220 grape juices were collected from 5 countries worldwide (Italy; Argentina; France; Germany; Portugal). Measured δ13C (‰) varied from -28.73 to -22.6 for PN, and from -28.79 to -21.67 for CH. These results also clearly revealed higher water stress for the 2020 vintage. The same grape juices have been analysed by Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS) and Liquid Chromatography coupled to Mass Spectrometry (LC-qTOF-MS), leading to the detection of up to 4500 CHONS containing elemental compositions, and thus likely tens of thousands of individual compounds, which include fatty acids, organic acids, peptides, phenolics, also with high levels of glycosylation. Multivariate statistical analysis revealed that up to 160 elemental compositions, covering the whole range of detected masses (100 –1000 m/z), were significantly correlated to the observed gradients of water status. Examples of chemical markers, which are representative of these complex fingerprints, include various derivatives of the known abscisic acid (ABA), such as phaesic acid or abscisic acid glucose ester, which are significantly correlated with higher water stress, regardless of the variety. Cultivar-specific behaviours could also be identified from these fingerprints. Our results provide an unprecedented representation of the metabolic diversity, which is involved in the water status regulation at the grape level, and which could contribute to a better knowledge of the grapevine mitigation strategy in a climate change context.