Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Evaluation of climate change impacts at the Portuguese Dão terroir over the last decades: observed effects on bioclimatic indices and grapevine phenology

In the last decades the growers of the Portuguese Dão winegrowing region (center of Portugal) are experiencing changes in climate that are influencing either grape phenology berry health and ripening. Aiming to study the relationships between climate indices (CI), seasonal weather and grapevine phenology, in this work long-term climate and phenological data collected at the experimental vineyard of the Portuguese Dão research centre between 1958 and 2019 (61 years) for the red variety Touriga Nacional, was analyzed. The trends over time for the classical temperature-based indices (Growing Season Temperature – GST -, Growing Degree Days – GDD, Huglin Index – HI and Cool Night Index – CI) presented a significantly positive slope while the Dryness Index (DI) showed a negative trend over the last 61 years. Regarding grapevine phenology, an average advance of 4.5 days per decade in the harvest day was observed throughout the last 61 years. Consequently, the weather conditions during the ripening period have changed, showing an increasing trend over time in the average temperature (higher magnitude in the maximum than in the minimum temperature) and a decrease in the accumulated rainfall. A regression analysis showed that ~50% of harvest date variability over years was explained by the temperature-based indices variability. These observed effects of climate change on bioclimatic indices and corresponding anticipation of harvest date can still be considered advantageous for the Dão terroir as it allows to achieve an optimal berry ripening before the common equinox rains and, therefore, avoid the potential negative impacts of the rainfall on berry health and composition.

Ecophysiological performance of Vitis rootstocks under water stress

The use of rootstocks tolerant to soil water deficit is an interesting strategy to cope with limited water availability. Currently, several nurseries are breeding new genotypes, but the physiological basis of its responses under water stress are largely unknown. To this end, an ecophysiological assessment of the conventional 110-Richter (110R) and SO4, and the new M1 and M4 rootstocks was carried out in potted ungrafted plants. During one season, these Vitis genotypes were grown under greenhouse conditions and subjected to two water regimes, well-watered and water deficit. Water potentials of plants under water deficit down to < -1.4 MPa, and net photosynthesis (AN) <5 μmol m-2 s-1 did not cause leaf oxidative stress damage compared to well-watered conditions in any of the genotypes. The antioxidant capacity was sufficient to neutralize the mild oxidative stress suffered. Under both treatments, gravimetric differences in daily water use were observed among genotypes, leading to differences in the biomass of root, shoot and leaf. Under well-watered conditions, SO4 and 110R were the most vigorous and M1 and M4 the least. However, under water stress, SO4 exhibited the greatest reduction in biomass while M4 showed the lowest. Remarkably, under these conditions, SO4 reached the least negative stem water potential (Ψstem), while M1 reduced stomatal conductance (gs) and AN the most. In addition, SO4 and M1 genotypes also showed the highest and lowest hydraulic conductance values, respectively. Our results suggest that there are differences in water use regulation among genotypes, not only attributed to differences in stomatal regulation or intrinsic water use efficiency at the leaf level. Therefore, because no differences in canopy-to-root ratio were achieved, it is hypothesized that xylem vessel anatomical differences may be driving the reported differences among rootstocks performance. Results demonstrate that each Vitis rootstock differs in its ecophysiological responses under water stress.

Influence of weather and climatic conditions on the viticultural production in Croatia

The research includes an analysis of the impact of weather conditions on phenological development of the vine and grape quality, through monitoring of four experimental cultivars (Chardonnay, Graševina, Merlot and Plavac mali) over two production years. In each experimental vineyard, which were evenly distributed throughout the regions of Slavonia and The Croatian Danube, Croatian Uplands,

Mesoclimate impact on Tannat in the Atlantic terroir of Uruguay

The study of climate is relevant as an element conditioning the typicity of a product, its quality and sustainability over the years. The grapevine development and growth and the final grape and wine composition are closely related to temperature, while climate components vary at mesoscale according to topography and/or proximity to large bodies of water. The objective of this work is to assess the mesoclimate of the Atlantic region of Uruguay and to determine the effect of topography and the ocean on temperature and consequently on Tannat grapevine behavior.

Effect of vigour and number of clusters on eonological parameters and metabolic profile of Cabernet Sauvignon red wines

Vegetative growth and yield are reported to affect grape and wine quality. They can be controlled through different techniques linked to vine management. The objective of this research was to determine the effect of vine vigour and number of clusters per vine on physicochemical composition and phenolic profile of red wines. The experiment was carried out during two vegetative cycles, with cv. Cabernet Sauvignon grafted onto Paulsen 1103. Three vine vigour were defined, according to shoot weight at previous harvests, being low, medium and high. Five treatments of number of clusters were used for each vigour, with 15, 22, 29, 36, and 45 clusters per vine. Grapes from all treatments were harvested in the same day from Brix and total acidity criteria. Thirty days after bottling, classical analyzes and phenolic compounds were performed. As results, different responses were obtained from each vintage. In 2020, a dry season from veraison to harvest, grapes and wines obtained from low vigour treatment and 45 clusters per vine was the highest in sugar and alcohol content respectively, while grapes and wines from high vigour and 15 clusters presented the lowest sugar and alcohol content. Total anthocyanins were higher in treatment with low vigour and 15 clusters, while the lowest amounts were found in low vigour with 45 clusters, as well as medium and high vigour with 36 clusters per vine. Total tannins were higher in high vigour with 22 clusters and medium vigour with 29 clusters, while were lower in low vigour with 36 clusters. In 2021, a wet season at harvest, responses were different, and great variations were observed between treatments. As conclusions, yield and vine vigour had strong influence on grape and wine quality, promoting different enological potentials on which can be indicated/used for aging strategies of red and even rosé wines.