Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Soil, vine, climate change – what is observed – what is expected

To evaluate the current and future impact of climate change on Viticulture requires an integrated view on a complex interacting system within the soil-plant-atmospheric continuum under continuous change. Aside of the globally observed increase in temperature in basically all viticulture regions for at least four decades, we observe several clear trends at the regional level in the ratio of precipitation to potential evapotranspiration. Additionally the recently published 6th assessment report of the IPCC (The physical science basis) shows case-dependent further expected shifts in climate patterns which will have substantial impacts on the way we will conduct viticulture in the decades to come.
Looking beyond climate developments, we observe rising temperatures in the upper soil layers which will have an impact on the distribution of microbial populations, the decay rate of organic matter or the storage capacity for carbon, thus affecting the emission of greenhouse gases (GHGs) and the viscosity of water in the soil-plant pathway, altering the transport of water. If the upper soil layers dry out faster due to less rainfall and/or increased evapotranspiration driven by higher temperatures, the spectral reflection properties of bare soil change and the transport of latent heat into the fruiting zone is increased putting a higher temperature load on the fruit. Interactions between micro-organisms in the rhizosphere and the grapevine root system are poorly understood but respond to environmental factors (such as increased soil temperatures) and the plant material (rootstock for instance), respectively the cultivation system (for example bio-organic versus conventional). This adds to an extremely complex system to manage in terms of increased resilience, adaptation to and even mitigation of climate change. Nevertheless, taken as a whole, effects on the individual expressions of wines with a given origin, seem highly likely to become more apparent.

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).

Impact of climate variability and change on grape yield in Italy

Viticulture is entangled with weather and climate. Therefore, areas currently suitable for grape production can be challenged by climate change. Winegrowers in Italy already experiences the effect of climate change, especially in the form of warmer growing season, more frequent drought periods, and increased frequency of weather extremes.
The aim of this study is to investigate the impact of climate variability and change on grape yield in Italy to provide winegrowers the information needed to make their business more sustainable and resilient to climate change. We computed a specific range of bioclimatic indices, selected by the International Organisation of Vine and Wine (OIV), and correlated them to grape yield data. We have worked in collaboration with some wine consortiums in northern and central Italy, which provided grape yield data for our analysis.
Using climate variables from the E-OBS dataset we investigate how the bioclimatic indices changed in the past, and the impact of this change on grape productivity in the study areas. The climate impact on productivity is also investigated by using high-resolution convection-permitting models (CPMs – 2.2 horizontal resolution), with the purpose of estimating productivity in future emission scenarios. The CPMs are likely the best available option for this kind of impact studies since they allow a better representation of small-scale processes and features, explicitly resolve deep convection, and show an improved representation of extremes. In our study, we also compare CPMs with regional climate models (RCMs – 12 km horizontal resolution) to assess the added value of high-resolution models for impact studies. Further development of our study will lead to assessing the future suitability for vine cultivation and could lead to the construction of a statistical model for future projection of grape yield.

Local adaptation tools to ensure the viticultural sustainability in a changing climate

[lwp_divi_breadcrumbs home_text="IVES" use_before_icon="on" before_icon="||divi||400" module_id="publication-ariane" _builder_version="4.19.4" _module_preset="default" module_text_align="center" module_font_size="16px" text_orientation="center"...

Ecophysiological performance of Vitis rootstocks under water stress

The use of rootstocks tolerant to soil water deficit is an interesting strategy to cope with limited water availability. Currently, several nurseries are breeding new genotypes, but the physiological basis of its responses under water stress are largely unknown. To this end, an ecophysiological assessment of the conventional 110-Richter (110R) and SO4, and the new M1 and M4 rootstocks was carried out in potted ungrafted plants. During one season, these Vitis genotypes were grown under greenhouse conditions and subjected to two water regimes, well-watered and water deficit. Water potentials of plants under water deficit down to < -1.4 MPa, and net photosynthesis (AN) <5 μmol m-2 s-1 did not cause leaf oxidative stress damage compared to well-watered conditions in any of the genotypes. The antioxidant capacity was sufficient to neutralize the mild oxidative stress suffered. Under both treatments, gravimetric differences in daily water use were observed among genotypes, leading to differences in the biomass of root, shoot and leaf. Under well-watered conditions, SO4 and 110R were the most vigorous and M1 and M4 the least. However, under water stress, SO4 exhibited the greatest reduction in biomass while M4 showed the lowest. Remarkably, under these conditions, SO4 reached the least negative stem water potential (Ψstem), while M1 reduced stomatal conductance (gs) and AN the most. In addition, SO4 and M1 genotypes also showed the highest and lowest hydraulic conductance values, respectively. Our results suggest that there are differences in water use regulation among genotypes, not only attributed to differences in stomatal regulation or intrinsic water use efficiency at the leaf level. Therefore, because no differences in canopy-to-root ratio were achieved, it is hypothesized that xylem vessel anatomical differences may be driving the reported differences among rootstocks performance. Results demonstrate that each Vitis rootstock differs in its ecophysiological responses under water stress.