Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Analysis of Cabernet Sauvignon and Aglianico winegrape (V. vinifera L.) responses to different pedo-climatic environments in southern Italy

Water deficit is one of the most important effects of climate change able to affect agricultural sectors. In general, it determines a reduction in biomass production, and for some plants, as in the case of grapevine, it can endorse fruit quality. The monitoring and management of plant water stress in the vineyard

Use of a new, miniaturized, low-cost spectral sensor to estimate and map the vineyard water status from a mobile 

Optimizing the use of water and improving irrigation strategies has become increasingly important in most winegrowing countries due to the consequences of climate change, which are leading to more frequent droughts, heat waves, or alteration of precipitation patterns. Optimized irrigation scheduling can only be based on a reliable knowledge of the vineyard water status.

In this context, this work aims at the development of a novel methodology, using a contactless, miniaturized, low-cost NIR spectral tool to monitor (on-the-go) the vineyard water status variability. On-the-go spectral measurements were acquired in the vineyard using a NIR micro spectrometer, operating in the 900–1900 nm spectral range, from a ground vehicle moving at 3 km/h. Spectral measurements were collected on the northeast side of the canopy across four different dates (July 8th, 14th, 21st and August 12th) during 2021 season in a commercial vineyard (3 ha). Grapevines of Vitis vinifera L. Graciano planted on a VSP trellis were monitored at solar noon using stem water potential (Ψs) as reference indicators of plant water status. In total, 108 measurements of Ψs were taken (27 vines per date).

Calibration and prediction models were performed using Partial Least Squares (PLS) regression. The best prediction models for grapevine water status yielded a determination coefficient of cross-validation (r2cv) of 0.67 and a root mean square error of cross-validation (RMSEcv) of 0.131 MPa. This predictive model was employed to map the spatial variability of the vineyard water status and provided useful, practical information towards the implementation of appropriate irrigation strategies. The outcomes presented in this work show the great potential of this low-cost methodology to assess the vineyard stem water potential and its spatial variability in a commercial vineyard.

Modeling island and coastal vineyards potential in the context of climate change

Climate change impacts regional and local climates, which in turn affects the world’s wine regions. In the short term, these modifications rises issues about maintaining quality and style of wine, and in a longer term about the suitability of grape varieties and the sustainability of traditional wine regions. Thus, adaptation to climate change represents a major challenge for viticulture. In this context, island and coastal vineyards could become coveted areas due to their specific climatic conditions. In regions subject to warming, the proximity of the sea can moderate extremes temperatures, which could be an advantage for wine. However, coastal and island areas are particular prized spaces and subject to multiple pressures that make the establishment or extension of viticulture complex.
In this perspective, it seems relevant to assess the potentialities of coastal and island areas for viticulture. This contribution will present a spatial optimization model that tends to characterize most suitable agroclimatic patterns in historical or emerging vineyards according to different scenarios. Thanks to an in-depth bibliography a global inventory of coastal and insular vineyards on a worldwide scale has been realized. Relevant criteria have been identified to describe the specificities of these vineyards. They are used as input data in the optimization process, which will optimize some objectives and spatial aspects. According to a predefined scenario, the objectives are set in three main categories associated with climatic characteristics, vineyards characteristics and management strategies. At the end of this optimization process, a series of maps presents the different spatial configurations that maximize the scenario objectives.

VINIoT: Precision viticulture service for SMEs based on IoT sensors network

The main innovation in the VINIoT service is the joint use of two technologies that are currently used separately: vineyard monitoring using multispectral imaging and deployed terrain sensors. One part of the system is based on the development of artificial intelligence algorithms that are feed on the images of the multispectral camera and IoT sensors, high-level information on water stress, grape ripening status and the presence of diseases. In order to obtain algorithms to determine the state of ripening of the grapes and avoid losing information due to the diversity of the grape berries, it was decided to work along the first year 2020 at berry scale in the laboratory, during the second year at the cluster scale and on the last year at plot scale. Different varieties of white and red grapes were used; in the case of Galicia we worked with the white grape variety Treixadura and the red variety Mencía. During the 2020 and 2021 campaigns, multispectral images were taken in the visible and infrared range of: 1) sets of 100 grapes classifying them by means of densimetric baths, 2) individual bunches. The images taken with the laboratory analysis of the ripening stage were correlated. Technological maturity, pH, probable degree, malic acid content, tartaric acid content and parameters for assessing phenolic maturity, IPT, anthocyanin content were determined. It has been calculated for each single image the mean value of each spectral band (only taking into account the pixels of interest) and a correlation study of these values with laboratory data has been carried out. These studies are still provisional and it will be necessary to continue with them, jointly with the training of the machine learning algorithms. Processed data will allow to determine the sensitivity of the multispectral images and select bands of interest in maturation.

Use of multispectral satellite for monitoring vine water status in mediterranean areas

The development of new generations of multispectral satellites such as Sentinel-2 opens possibilities as to vine water status assessment (Cohen et al., 2019). Based on a three years field campaign, a model of Stem Water Potential (SWP) estimation on vine using four satellite bands in Red, Red-Edge, NIR and SWIR domains was developed (Laroche-Pinel et al., 2021). The model relies on SWP field measures done using a pressure chamber (Scholander et al., 1965), which is a common, robust and precise method to assess vine water status (Acevedo-Opazo et al., 2008). The model was mainly developed from from SWP measures on Syrah N (Laroche Pinel E., 2021).

A large scale monitoring was organized in different vineyards in the Mediterranean region in 2021. 10 varieties amongst the most represented in this area were monitored (Cabernet sauvignon N, Chardonnay B, Cinsault N, Grenache N, Merlot N, Mourvèdre N, Sauvignon B, Syrah N, Vermentino B, Viognier B). The model was used to produce water status maps from Sentinel-2 images, starting from the beginning of June (fruit set) up to September (harvest). The average estimated SWP for each vine was compared to actual field SWP measures done by wine growers or technicians during usual monitoring of irrigation programs. The correlations between mean estimated SWP and mean measured SWP were at the same level than expected by the model. (Laroche Pinel, 2021) The general SWP kinetics were comparable. The estimated SWP would have led to same irrigation decisions concerning the date of first irrigation in comparison with measured SWP.

Acevedo-Opazo, C., Tisseyre, B., Ojeda, H., Ortega-Farias, S., Guillaume, S. (2008). Is it possible to assess the spatial variability of vine water status? OENO One, 42(4), 203.
Cohen, Y., Gogumalla, P., Bahat, I., Netzer, Y., Ben-Gal, A., Lenski, I., … Helman, D. (2019). Can time series of multispectral satellite images be used to estimate stem water potential in vineyards? In Precision agriculture ’19, The Netherlands: Wageningen Academic Publishers, pp. 445–451.
Laroche-Pinel, E., Duthoit, S., Albughdadi, M., Costard, A. D., Rousseau, J., Chéret, V., & Clenet, H. (2021). Towards vine water status monitoring on a large scale using sentinel-2 images. remote sensing, 13(9), 1837.
Laroche-Pinel,E. (2021). Suivi du statut hydrique de la vigne par télédétection hyper et multispectrale. Thèse INP Toulouse, France.
Scholander, P.F., Bradstreet, E.D., Hemmingsen, E.A., & Hammel, H.T. (1965). Sap pressure in vascular plants: Negative hydrostatic pressure can be measured in plants. Science, 148(3668), 339–346.