Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Adapting the vineyard to climate change in warm climate regions with cultural practices

Since the 1980s global regime shift, grape growers have been steadily adapting to a changing climate. These adaptations have preserved the region-climate-cultivar rapports that have established the global trade of wine with lucrative economic benefits since the middle of 17th century. The advent of using fractions of crop and actual evapotranspiration replacement in vineyards with the use of supplemental irrigation has furthered the adaptation of wine grape cultivation. The shift in trellis systems, as well as pruning methods from positioned shoot systems to sprawling canopies, as well as adapting the bearing surface from head-trained, cane-pruned to cordon-trained, spur-pruned systems have also aided in the adaptation of grapevine to warmer temperatures. In warm climates, the use of shade cloth or over-head shade films not only have aided in arresting the damage of heat waves, but also identified opportunities to reduce the evapotranspiration from vineyards, reducing environmental footprint of vineyard. Our increase in knowledge on how best to understand the response of grapevine to climate change was aided with the identification of solar radiation exposure biomarker that is now used for phenotyping cultivars in their adaptability to harsh environments. Using fruit-based metrics such as sugar-flavonoid relationships were shown to be better indicators of losses in berry integrity associated with a warming climate, rather than solely focusing on region-climate-cultivar rapports. The resilience of wine grape was further enhanced by exploitation of rootstock × scion combinations that can resist untoward droughts and warm temperatures by making more resilient grapevine combinations. Our understanding of soil-plant-atmosphere continuum in the vineyard has increased within the last 50 years in such a manner that growers are able to use no-till systems with the aid of arbuscular mycorrhiza fungi inoculation with permanent cover cropping making the vineyard more resilient to droughts and heat waves. In premium wine grape regions viticulture has successfully adapted to a rapidly changing climate thus far, but berry based metrics are raising a concern that we may be approaching a tipping point.

VINIoT: Precision viticulture service for SMEs based on IoT sensors network

The main innovation in the VINIoT service is the joint use of two technologies that are currently used separately: vineyard monitoring using multispectral imaging and deployed terrain sensors. One part of the system is based on the development of artificial intelligence algorithms that are feed on the images of the multispectral camera and IoT sensors, high-level information on water stress, grape ripening status and the presence of diseases. In order to obtain algorithms to determine the state of ripening of the grapes and avoid losing information due to the diversity of the grape berries, it was decided to work along the first year 2020 at berry scale in the laboratory, during the second year at the cluster scale and on the last year at plot scale. Different varieties of white and red grapes were used; in the case of Galicia we worked with the white grape variety Treixadura and the red variety Mencía. During the 2020 and 2021 campaigns, multispectral images were taken in the visible and infrared range of: 1) sets of 100 grapes classifying them by means of densimetric baths, 2) individual bunches. The images taken with the laboratory analysis of the ripening stage were correlated. Technological maturity, pH, probable degree, malic acid content, tartaric acid content and parameters for assessing phenolic maturity, IPT, anthocyanin content were determined. It has been calculated for each single image the mean value of each spectral band (only taking into account the pixels of interest) and a correlation study of these values with laboratory data has been carried out. These studies are still provisional and it will be necessary to continue with them, jointly with the training of the machine learning algorithms. Processed data will allow to determine the sensitivity of the multispectral images and select bands of interest in maturation.

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.

Terroir analysis and its complexity

Terroir is not only a geographical site, but it is a more complex concept able to express the “collective knowledge of the interactions” between the environment and the vines mediated through human action and “providing distinctive characteristics” to the final product (OIV 2010). It is often treated and accepted as a “black box”, in which the relationships between wine and its origin have not been clearly explained. Nevertheless, it is well known that terroir expression is strongly dependent on the physical environment, and in particular on the interaction between soil-plant and atmosphere system, which influences the grapevine responses, grapes composition and wine quality. The Terroir studying and mapping are based on viticultural zoning procedures, obtained with different levels of know-how, at different spatial and temporal scales, empiricism and complexity in the description of involved bio-physical processes, and integrating or not the multidisciplinary nature of the terroir. The scientific understanding of the mechanisms ruling both the vineyard variability and the quality of grapes is one of the most important scientific focuses of terroir research. In fact, this know-how is crucial for supporting the analysis of climate change impacts on terroir resilience, identifying new promised lands for viticulture, and driving vineyard management toward a target oenological goal. In this contribution, an overview of the last findings in terroir studies and approaches will be shown with special attention to the terroir resilience analysis to climate change, facing the use and abuse of terroir concept and new technology able to support it and identifying the terroir zones.