Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The rootstock, the neglected player in the scion transpiration even during the night

Water is the main limiting factor for yield in viticulture. Improving drought adaptation in viticulture will be an increasingly important issue under climate change. Genetic variability of water deficit responses in grapevine partly results from the rootstocks, making them an attractive and relevant mean to achieve adaptation without changing the scion genotype. The objective of this work was to characterize the rootstock effect on the diurnal regulation of scion transpiration. A large panel of 55 commercial genotypes were grafted onto Cabernet Sauvignon. Three biological repetitions per genotype were analyzed. Potted plants were phenotyped on a greenhouse balance platform capable of assessing real-time water use and maintaining a targeted water deficit intensity. After a 10 days well-watered baseline period, an increasing water deficit was applied for 10 days, followed by a stable water deficit stress for 7 days. Pruning weight, root and aerial dry weight and transpiration were recorded and the experiment was repeated during two years. Transpiration efficiency (ratio between aerial biomass and transpiration) was calculated and δ13C was measured in leaves for the baseline and stable water deficit periods. A large genetic variability was observed within the panel. The rootstock had a significant impact on nocturnal transpiration which was also strongly and positively correlated with maximum daytime transpiration. The correlations with growth and water use efficiency related traits will be discussed. Transpiration data were also related with VPD and soil water content demonstrating the influence of environmental conditions on transpiration. These results highlighted the role of the rootstock in modulating water deficit responses and give insights for rootstock breeding programs aimed at identifying drought tolerant rootstocks. It was also helpful to better define the mechanisms on which the drought tolerance in grapevine rootstocks is based on.

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Local adaptation tools to ensure the viticultural sustainability in a changing climate

[lwp_divi_breadcrumbs home_text="IVES" use_before_icon="on" before_icon="||divi||400" module_id="publication-ariane" _builder_version="4.19.4" _module_preset="default" module_text_align="center" module_font_size="16px" text_orientation="center"...

Geospatial trends of bioclimatic indexes in the topographically complex region of Barolo DOCG

Barolo DOCG is an economically important wine producing region in Northwest Italy. It is a small region of approximately 70 km2 gross area. The topography is very complex with steep sloped hills ranging in elevation from below 200 m to 550 m. Barolo DOCG wine is made exclusively from the Nebbiolo grape. Bioclimatic indexes are often used in viticulture to gain a better understanding of broader climate trends which can be compared temporally and geographically. These indexes are also used for identifying potential phenological timing, growing region suitability, and potential risks associated with expected climatic changes. Understanding how topography influences bioclimatic indexes can help with understanding of mesoscale climate behaviour leading to improved decision making and risk management strategies. The average monthly maximum and minimum temperatures, the Cool Night Index, the Huglin Index, and the monthly diurnal range (from July to October) were calculated using data from 45 weather stations within a 40 km radius of the Barolo DOCG growing area between the years 1996 and 2019. Linear and multiple regression models were developed using independent variables (elevation, aspect, slope) extracted from a digital elevation model to identify significant relationships. Bioclimatic indexes were then kriged with external drift using independent variables that showed significant relationships with the bioclimatic index using a 100 m resolution grid. The maximum monthly temperatures and the Huglin Index showed consistent significant negative relationships with elevation in all years. The minimum monthly temperatures showed no relationship with elevation but in some months a small but significant relationship was observed with aspect. Due to the lack of a relationship between minimum monthly temperatures and elevation compared to the significant relationship between maximum monthly temperatures and elevation, monthly diurnal range had a negative relationship with elevation.

Effects of organic mulches on the soil environment and yield of grapevine

Farming management practices aiming at conserving soil moisture have been developed in arid and semiarid-areas facing water scarcity problems. Organic mulching is an effective method to manipulate the crop-growing microclimate increasing crop yield by controlling soil temperature, and retaining soil moisture by reducing soil evaporation. In this sense, the effectiveness of different organic mulching materials (straw mulch and grapevine pruning debris) applied within the row of a vineyard was evaluated on the soil and on the vine in a Tempranillo vineyard located in La Rioja (Spain). Organic mulches were compared with a traditional bare soil management technique (based on the use of herbicides to avoid weed incidence). Mulching coverages favourably influenced the soil water retention throughout all the grapevine vegetative cycle. However, the soil-moisture variation was not the same under different mulching materials, being the straw mulch (SM) the one that retained more water in comparison with grapevine pruning debris (GPD) based-cover. The changes of soil moisture in the upper surface layer (0–10 cm) were highly dynamic, probably due to water vapour fluxes across the soil-atmospheric interface. However, both, SM and GPD reduced these fluctuations as compared with bare soils. A similar trend occurred with soil temperature. Both organic mulches altered soil temperature in comparison with bare soil by reducing soil temperature in summer and raising it in winter. Moreover, the same buffering effect for the temperature on the covered soil also remains in the deeper layers. To conclude, we could see that organic mulching had a positive impact on soil-moisture storage and soil temperature and the extent of this effect depends on the type of mulching materials. These changes led to higher rates of photosynthesis and stomatal conductivity compared to bare soils, also favouring crop growth and grape yields.