Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.

The potential of multispectral/hyperspectral technologies for early detection of “flavescence dorée” in a Portuguese vineyard

“Flavescence dorée” (FD) is a grapevine quarantine disease associated with phytoplasmas and transmitted to healthy plants by insect vectors, mainly Scaphoideus titanus. Infected plants usually develop symptoms of stunted growth, unripe cane wood, leaf rolling, leaf yellowing or reddening, and shrivelled berries. Since plants can remain symptomless up to four years, they may act as reservoirs of FD contributing to the spread of the disease. So far, conventional management strategies rely mainly on the insecticide treatments, uprooting of infected plants and use of phytoplasma-free propagation material. However, these strategies are costly and could have undesirable environmental impacts. Thus, the development of sustainable and noninvasive approaches for early detection of FD and its management are of great importance to reduce disease spread and select the best cultural practices and treatments. The present study aimed to evaluate if multispectral/hyperspectral technologies can be used to detect FD before the appearance of the first symptoms and if infected grapevines display a spectral imaging fingerprint. To that end, physiological parameters (leaf area, chlorophyll content and photosynthetic rate) were collected in concomitance to the measurements of plant reflectance (using both a portable apparatus and a remote sensing drone). Measurements were performed in two leaves of 8 healthy and 8 FD-infected grapevines, at four timepoints: before the development of disease symptoms (21st June); and after symptoms appearance (ii) at veraison (2nd August); at post-veraison (11th September); and at harvest (25th September). At all timepoints, FD infected plants revealed a significant decrease in the studied physiological parameters, with a positive correlation with drone imaging data and portable apparatus analyses. Moreover, spectra of either drone imaging and portable apparatus showed clear differences between healthy and FD-infected grapevines, validating multispectral/ hyperspectral technology as a potential tool for the early detection of FD or other grapevine-associated diseases.

Impact of climate change on the viticultural climate of the Protected Designation of Origin “Jumilla” (SE Spain)

Protected Designation of Origin “Jumilla” (PDO Jumilla) is located in the Spanish provinces of Albacete and Murcia, in the South-eastern part of the Iberian Peninsula, where most of the models predict a severe impact of climate change in next decades. PDO Jumilla covers an area of 247,054 hectares, of which more than 22,000 hectares

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.

Is wine terroir a valid concept under a changing climate?

The OIV[i] defines terroir as a concept referring to an area in which collective knowledge of the interactions between the physical and biological environment (soil, topography, climate, landscape characteristics and biodiversity features) and vitivinicultural practices develops, providing distinctive wine characteristics. Those are perceptible in the taste of wine, which drives consumer preference and, therefore, wine’s value in the marketplace. Geographical indications (GI) are recognized regulatory constructs formalizing and protecting the nexus between wine taste and the terroir generating it. Despite considering updates, GIs do not consider the nexus as a dynamic one and do not anticipate change, namely of climate. Being climate a fundamental feature of terroir, it strongly impacts wine characteristics, such as taste. According to IPCC[ii], many widespread, rapid and unprecedented changes of climate occurred, some being irreversible over hundreds to thousands of years. Climatic shifts and atmospheric-driven extreme events have been widely reported worldwide. Recent climatic trends are projected to strengthen in upcoming decades, whereas extremes are expected to increase in frequency and intensity, forcing wines away from GI definitions. Geographical shifts of viticultural suitability are projected, often moving into regions and countries different from current ones. Some authors propose adaptation in viticulture, winemaking and product innovation. We show evidence of climate changing wine characteristics in the Douro valley, home of 270-year-old Port GI. We discuss herein resist or adapt stances for when climate changes the nexus between terroir and wine characteristics. Using the MED-GOLD[iii] dashboard, a tool allowing for easy visual navigation of past and future climates, we demonstrate how policymakers can identify future moments, throughout the 21st century under different emission scenarios, when GI specifications will likely need updates (e.g., boundaries, varieties) to reduce climate-change impacts.