Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate ethnography and wine environmental futures

Globalisation and climate change have radically transformed world wine production upsetting the established order of wine ecologies. Ecological risks and the future of traditional agricultural systems are widely debated in anthropology, but very little is understood of the particular challenges posed by climate change to viticulture which is seen by many as the canary in the coalmine of global agriculture. Moreover, wine as a globalised embedded commodity provides a particularly telling example for the study of climate change having already attracted early scientific attention. Studies of climate change in viticulture have focused primarily on the production of systematic models of adaptation and vulnerability, while the human and cultural factors, which are key to adaptation and sustainable futures, are largely missing. Climate experts have been unanimous in recognising the urgent need for a better understanding of the complex dynamics that shape how climate change is experienced and responded to by human systems. Yet this call has not yet been addressed. Climate ethnography, coined by the anthropologist Susan Crate (2011), aims to bridge this growing disjuncture between climate science and everyday life through the exploration of the social meaning of climate change. It seeks to investigate the confrontation of its social salience in different locations and under different environmental guises (Goodman 2018: 340). By understanding how wine producers make sense of the world (and the environment) and act in it, it proposes to focus on the co-production of interdisciplinary knowledge by identifying and foreshadowing problems (Goodman 2018: 342; Goodman & Marshall 2018). It seeks to offer an original, transformative and contrasted perspective to climate change scenarios by investigating human agency -individual or collective- in all its social, political and cultural diversity. An anthropological approach founded on detailed ethnographies of wine production is ideally placed to address economic, social and cultural disruptions caused by the emergence of these new environmental challenges. Indeed, the community of experts in environmental change have recently called for research that will encompass the human dimension and for more broad-based, integrated through interdisciplinarity, useful knowledge (Castree & al 2014). My paper seeks to engage with climate ethnography and discuss what it brings to the study of wine environmental futures while exploring the limitations of the anthropological environmental approach.

Use of multispectral satellite for monitoring vine water status in mediterranean areas

The development of new generations of multispectral satellites such as Sentinel-2 opens possibilities as to vine water status assessment (Cohen et al., 2019). Based on a three years field campaign, a model of Stem Water Potential (SWP) estimation on vine using four satellite bands in Red, Red-Edge, NIR and SWIR domains was developed (Laroche-Pinel et al., 2021). The model relies on SWP field measures done using a pressure chamber (Scholander et al., 1965), which is a common, robust and precise method to assess vine water status (Acevedo-Opazo et al., 2008). The model was mainly developed from from SWP measures on Syrah N (Laroche Pinel E., 2021).

A large scale monitoring was organized in different vineyards in the Mediterranean region in 2021. 10 varieties amongst the most represented in this area were monitored (Cabernet sauvignon N, Chardonnay B, Cinsault N, Grenache N, Merlot N, Mourvèdre N, Sauvignon B, Syrah N, Vermentino B, Viognier B). The model was used to produce water status maps from Sentinel-2 images, starting from the beginning of June (fruit set) up to September (harvest). The average estimated SWP for each vine was compared to actual field SWP measures done by wine growers or technicians during usual monitoring of irrigation programs. The correlations between mean estimated SWP and mean measured SWP were at the same level than expected by the model. (Laroche Pinel, 2021) The general SWP kinetics were comparable. The estimated SWP would have led to same irrigation decisions concerning the date of first irrigation in comparison with measured SWP.

Acevedo-Opazo, C., Tisseyre, B., Ojeda, H., Ortega-Farias, S., Guillaume, S. (2008). Is it possible to assess the spatial variability of vine water status? OENO One, 42(4), 203.
Cohen, Y., Gogumalla, P., Bahat, I., Netzer, Y., Ben-Gal, A., Lenski, I., … Helman, D. (2019). Can time series of multispectral satellite images be used to estimate stem water potential in vineyards? In Precision agriculture ’19, The Netherlands: Wageningen Academic Publishers, pp. 445–451.
Laroche-Pinel, E., Duthoit, S., Albughdadi, M., Costard, A. D., Rousseau, J., Chéret, V., & Clenet, H. (2021). Towards vine water status monitoring on a large scale using sentinel-2 images. remote sensing, 13(9), 1837.
Laroche-Pinel,E. (2021). Suivi du statut hydrique de la vigne par télédétection hyper et multispectrale. Thèse INP Toulouse, France.
Scholander, P.F., Bradstreet, E.D., Hemmingsen, E.A., & Hammel, H.T. (1965). Sap pressure in vascular plants: Negative hydrostatic pressure can be measured in plants. Science, 148(3668), 339–346.

Long-term drought resilience of traditional red grapevine varieties from a semi-arid region

In recent decades, the scarcity of water resources in agriculture in certain areas has been aggravated by climate change, which has caused an increase in temperatures, changes in rainfall patterns, as well as an increase in the frequency of extreme phenomena such as droughts and heat waves. Although the vine is considered a drought-tolerant specie, it has to satisfy important water requirements to complete its cycle, which coincides with the hottest and driest months. Achieving sustainable viticulture in this scenario requires high levels of efficiency in the use of water, a scarce resource whose use is expected to be severely restricted in the near future. In this regard, the use of drought-tolerant varieties that are able to maintain grape yield and quality could be an effective strategy to face this change. During three consecutive seasons (2018-2020) the behavior in rainfed regime of 13 traditional red grapevine varieties of the Spain central region was studied. These varieties were cultivated in a collection at Centro de Investigación de la Vid y el Vino de Castilla-La Mancha (IVICAM-IRIAF) located in Tomelloso (Castilla-La Mancha, Spain). Yield components (yield, mean bunch and berry weight, pruning weight), physicochemical parameters of the musts (brix degree, total acidity, pH) and some physiological parameters related with water stress during ripening period (δ13C, δ18O) were analysed. The application of different statistical techniques to the results showed the existence of significant differences between varieties in their response to stressful conditions. A few varieties highlighted for their high ability to adapt to drought, being able to maintain high yields due to their efficiency in the use of water. In addition, it was possible quantify to what extent climate can be a determinant in the δ18O of musts under severe water stress conditions.

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.

Traditional agroforestry vineyards, sources of inspiration for the agroecological transition of viticulture

A unique “terroir” can be found in southern Bolivia, which combines the specific features of climate, topography and altitude of high valleys, with the management of grapevines staked on trees. It is one of the rare remnants of agroforestry viticulture. A survey was carried out among 29 grapegrowers in three valleys, to characterize the structure and management of these vineyards, and identify the services they expect from trees. Farms were small (2.2 ha on average) and 85% of vineyards were less than 1 ha. Viticulture was associated with vegetable, fruit and fodder production, sometimes in the same fields. Molle trees were found in all plots, together with one or two other native tree species. Traditional grapevine varieties such as Negra Criolla, Moscatel de Alejandría and Vicchoqueña were grown with a large range of densities from 1550 to 9500 vines ha-1. From 18 to 30% of them were staked on trees, with 1.2 to 4.9 vines per tree. The management of these vineyards (irrigation, fertilization and grapevine protection) was described, the most particular technical operation being the coordinated pruning of trees and grapevines. Three types of management could be identified in the three valleys. Grapegrowers had a clear idea of the ecosystem services they expected from trees in their vineyards. The main one was protection against climate hazards (hail, frost, flood). Then they expected benefits in terms of pest and disease control, improvement of soil fertility and resulting yield. At last, some producers claimed that tree-staking was quicker and cheaper than conventional trellising. It can be hypothesized then that agroforestry is a promising technique for the agroecological transition of viticulture. Its contribution to the “terroir” of the high valleys of southern Bolivia and its link with the specificities of the wines and spirits produced there remain to be explored.