Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.

Different soil types and relief influence the quality of Merlot grapes in a relatively small area in the Vipava Valley (Slovenia) in relation to the vine water status

Besides location and microclimatic conditions, soil plays an important role in the quality of grapes and wine. Soil properties influence…

Use of multispectral satellite for monitoring vine water status in mediterranean areas

The development of new generations of multispectral satellites such as Sentinel-2 opens possibilities as to vine water status assessment (Cohen et al., 2019). Based on a three years field campaign, a model of Stem Water Potential (SWP) estimation on vine using four satellite bands in Red, Red-Edge, NIR and SWIR domains was developed (Laroche-Pinel et al., 2021). The model relies on SWP field measures done using a pressure chamber (Scholander et al., 1965), which is a common, robust and precise method to assess vine water status (Acevedo-Opazo et al., 2008). The model was mainly developed from from SWP measures on Syrah N (Laroche Pinel E., 2021).

A large scale monitoring was organized in different vineyards in the Mediterranean region in 2021. 10 varieties amongst the most represented in this area were monitored (Cabernet sauvignon N, Chardonnay B, Cinsault N, Grenache N, Merlot N, Mourvèdre N, Sauvignon B, Syrah N, Vermentino B, Viognier B). The model was used to produce water status maps from Sentinel-2 images, starting from the beginning of June (fruit set) up to September (harvest). The average estimated SWP for each vine was compared to actual field SWP measures done by wine growers or technicians during usual monitoring of irrigation programs. The correlations between mean estimated SWP and mean measured SWP were at the same level than expected by the model. (Laroche Pinel, 2021) The general SWP kinetics were comparable. The estimated SWP would have led to same irrigation decisions concerning the date of first irrigation in comparison with measured SWP.

Acevedo-Opazo, C., Tisseyre, B., Ojeda, H., Ortega-Farias, S., Guillaume, S. (2008). Is it possible to assess the spatial variability of vine water status? OENO One, 42(4), 203.
Cohen, Y., Gogumalla, P., Bahat, I., Netzer, Y., Ben-Gal, A., Lenski, I., … Helman, D. (2019). Can time series of multispectral satellite images be used to estimate stem water potential in vineyards? In Precision agriculture ’19, The Netherlands: Wageningen Academic Publishers, pp. 445–451.
Laroche-Pinel, E., Duthoit, S., Albughdadi, M., Costard, A. D., Rousseau, J., Chéret, V., & Clenet, H. (2021). Towards vine water status monitoring on a large scale using sentinel-2 images. remote sensing, 13(9), 1837.
Laroche-Pinel,E. (2021). Suivi du statut hydrique de la vigne par télédétection hyper et multispectrale. Thèse INP Toulouse, France.
Scholander, P.F., Bradstreet, E.D., Hemmingsen, E.A., & Hammel, H.T. (1965). Sap pressure in vascular plants: Negative hydrostatic pressure can be measured in plants. Science, 148(3668), 339–346.

Modelling vine water stress during a critical period and potential yield reduction rate in European wine regions: a retrospective analysis

Most European vineyards are managed under rainfed conditions, where seasonal water deficit has become increasingly important. The flowering-veraison phenophase represents an important period for vine response to water stress, which is seldomly thoroughly evaluated. Therefore, we aim to quantify the flowering-veraison water stress levels using Crop Water Stress Indicator (CWSI) over 1986–2015 for important European wine regions, and to assess the respective potential Yield Lose Rate (YLR). Additionally, we also investigate whether an advanced flowering-veraison phase may help alleviating the water stress with improved yield. A process-based grapevine model STICS is employed, which has been extensively calibrated for flowering and veraison stages using observed data at 38 locations with 10 different grapevine varieties. Subsequently, the model is being implemented at the regional level, considering site-specific calibration results and gridded climate and soil datasets. The findings suggest wine regions with stronger flowering-veraison CWSI tend to have higher potential YLR. However, contrasting patterns are found between wine regions in France-Germany-Luxembourg and Italy-Portugal-Spain. The former tends to have slight-to-moderate drought conditions (CWSI<0.5) and a negligible-to-moderate YLR (<30%), whereas the latter possesses severe-to-extreme CWSI (>0.5) and substantial YLR (>40%). Wine regions prone to a high drought risk (CWSI>0.75) are also identified, which are concentrated in southern Mediterranean Europe. An advanced flowering-veraison phase may have benefited from cooler temperatures and a higher fraction of spring precipitation in wine regions of Italy-Portugal-Spain, resulting in alleviated CWSI and moderate reductions of YLR. For those of France-Germany-Luxembourg, this can have reduced flowering-veraison precipitation, but prevalent alleviations of YLR are also found, possibly because of shifted phase towards a cooler growing season with reduced evaporative demands. Overall, such a retrospective analysis might provide new insights towards better management of seasonal water deficit for conventionally vulnerable Mediterranean wine regions, but also for relatively cooler and wetter Central European regions.

Organic recycled mulches in sustainable viticulture: assessment of spontaneous plants communities and weed coverage

In recent years, developing more efficient and sustainable viticulture management has been essential due to the impact of climate change in semiarid regions. For this reason, the use of recycled organic mulching (ROM) in the vineyard has become an interesting strategy to cope with water stress, isolated soil from extreme temperatures and improving soil humidity, control the presence of weeds and therefore reduce the inputs of herbicides and improve soil fertility. This work aimed to analyse the effect of three different organic mulches [straw (S), grape pruning debris (GPD) and spent mushroom compost (SMC)] and two traditional soil management techniques [herbicide (H) and interrow (IN)] on weed coverage and the spontaneous plant communities’ presence. Data sampling was collected throughout the vine vegetative cycle of 2021 in La Rioja, Spain. The different soil management techniques had a clear effect on weed coverage and his development during the vine vegetative cycle. SMC and H were the treatments with the highest and the lowest coverage percentage, respectively. IN had a delayed weed emergence at the beginning of the vine vegetative cycle, but finally it reached maximum values nearby SMC. GPD and S had similar effects on weed emergence, reaching 25-30% of the maximum coverage values. A total of 29 herbaceous species were identified during the vegetative cycle, some of them very isolated and occasional. Principal component analysis (PCAs) showed a good association between spontaneous species and treatments, furthermore, specific species-treatment associations were found. Moreover, three clear groups of herbaceous communities were identified by cluster analysis. This study provides interesting information about the effect of different alternative soil management on herbaceous plant coverage and weed species communities which could contribute to making more sustainable viticulture.