Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Modulation of berry composition by different vineyard management practices

High concentration of sugars in grapes and alcohol in wines is one of the consequences of climate change on viticulture production in several wine-growing regions. In order to investigate the possibilities of adaptation of vineyard management practices aimed to reduce the accumulation of sugar during the maturation phase without reducing the accumulation of anthocyanins in grapes, a study with severe shoot trimming, shoot thinning, cluster thinning and date of harvest was conducted on Merlot variety in Istria region (Croatia), under the Mediterranean climate. Four factors which may affect grape maturation and its composition at harvest were investigated in a two-years experiment; severe shoot trimming applied at veraison when >80% of berries changed colour (in comparison to untreated control), shoot thinning (0 and 30%), cluster thinning (0 and 30%), and the date of harvest (early and standard harvest dates). Shoot thinning had no significant impact on berry composition, despite the obtained reduction in yield per vine. Lower Brix in grapes were obtained with earlier harvest date and if no cluster thinning was applied, although at the same time a reduction in the concentration of anthocyanins in berries was observed in these treatments. On the other hand, if severe shoot trimming was applied when >80% of berries changed colour, a reduction of Brix was obtained without a negative impact on berry anthocyanins concentration. We conclude that in cases when undesirably high sugar concentrations at harvest are expected, severe shoot trimming at 80% veraison may effectively be used in order to obtain moderate sugar concentration in berries together with the adequate phenolic composition.

Variations of soil attributes in vineyards influence their reflectance spectra

Knowledge on the reflectance spectrum of soil is potentially useful since it carries information on soil chemical composition that can be used to the planning of agricultural practices. If compared with analytical methods such as conventional chemical analysis, reflectance measurement provides non-destructive, economic, near real-time data. This paper reports results from reflectance measurements performed by spectroradiometry on soils from two vineyards in south Brazil. The vineyards are close to each other, are on different geological formations, but were subjected to the same management. The objective was to detect spectral differences between the two areas, correlating these differences to variations in their chemical composition, to assess the technique’s potential to predict soil attributes from reflectance data.To that end, soil samples were collected from ten selected vine parcels. Chemical analysis yield data on concentration of twenty-one soil attributes, and spectroradiometry was performed on samples. Chemical differences significant to a 95% confidence level between the two studied areas were found for six soil attributes, and the average reflectance spectra were separated by this same level along most of the observed spectral domain. Correlations between soil reflectance and concentrations of soil attributes were looked for, and for ten soil traits it was possible to define wavelength domains were reflectance and concentrations are correlated to confidence levels from 95% to 99%. Partial Least Squares Regression (PLSR) analyses were performed comparing measured and predicted concentrations, and for fifteen out of 21 soil traits we found Pearson correlation coefficients r > 0.8. These preliminary results, which have to be validated, suggest that variations of concentration in the investigated soil attributes induce differences in reflectance that can be detected by spectroradiometry. Applications of these observations include the assessment of the chemical content of soils by spectroradiometry as a fast, low-cost alternative to chemical analytical methods.

Modelling vine water stress during a critical period and potential yield reduction rate in European wine regions: a retrospective analysis

Most European vineyards are managed under rainfed conditions, where seasonal water deficit has become increasingly important. The flowering-veraison phenophase represents an important period for vine response to water stress, which is seldomly thoroughly evaluated. Therefore, we aim to quantify the flowering-veraison water stress levels using Crop Water Stress Indicator (CWSI) over 1986–2015 for important European wine regions, and to assess the respective potential Yield Lose Rate (YLR). Additionally, we also investigate whether an advanced flowering-veraison phase may help alleviating the water stress with improved yield. A process-based grapevine model STICS is employed, which has been extensively calibrated for flowering and veraison stages using observed data at 38 locations with 10 different grapevine varieties. Subsequently, the model is being implemented at the regional level, considering site-specific calibration results and gridded climate and soil datasets. The findings suggest wine regions with stronger flowering-veraison CWSI tend to have higher potential YLR. However, contrasting patterns are found between wine regions in France-Germany-Luxembourg and Italy-Portugal-Spain. The former tends to have slight-to-moderate drought conditions (CWSI<0.5) and a negligible-to-moderate YLR (<30%), whereas the latter possesses severe-to-extreme CWSI (>0.5) and substantial YLR (>40%). Wine regions prone to a high drought risk (CWSI>0.75) are also identified, which are concentrated in southern Mediterranean Europe. An advanced flowering-veraison phase may have benefited from cooler temperatures and a higher fraction of spring precipitation in wine regions of Italy-Portugal-Spain, resulting in alleviated CWSI and moderate reductions of YLR. For those of France-Germany-Luxembourg, this can have reduced flowering-veraison precipitation, but prevalent alleviations of YLR are also found, possibly because of shifted phase towards a cooler growing season with reduced evaporative demands. Overall, such a retrospective analysis might provide new insights towards better management of seasonal water deficit for conventionally vulnerable Mediterranean wine regions, but also for relatively cooler and wetter Central European regions.

Analysis of Cabernet Sauvignon and Aglianico winegrape (V. vinifera L.) responses to different pedo-climatic environments in southern Italy

Water deficit is one of the most important effects of climate change able to affect agricultural sectors. In general, it determines a reduction in biomass production, and for some plants, as in the case of grapevine, it can endorse fruit quality. The monitoring and management of plant water stress in the vineyard

Bioclimatic shifts and land use options for Viticulture in Portugal

Land use, plays a relevant role in the climatic system. It endows means for agriculture practices thus contributing to the food supply. Since climate and land are closely intertwined through multiple interface processes, climate change may lead to significant impacts in land use. In this study, 1-km observational gridded datasets are used to assess changes in the Köppen–Geiger and Worldwide Bioclimatic (WBCS)