Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Assessing the climate change vulnerability of European winegrowing regions by combining exposure, sensitivity and adaptive capacity indicators

Winegrowing regions recognized as protected designations of origin (PDOs) are closely tied to well defined geographic locations with a specific set of pedoclimatic attributes and strictly regulated by legal specifications. However, climate change is increasingly threatening these regions by changing local conditions and altering winegrowing processes. The vulnerability to these changes is largely heterogenous across different winegrowing regions because it is determined by individual characteristics of each region, including the capacity to adapt to new climatic conditions and the sensitivity to climate change, which depend not only on natural, but also socioeconomic and legal factors. Accurate vulnerability assessments therefore need to combine information about adaptive capacity and climate change sensitivity with projected exposure to new climatic conditions. However, most existing studies focus on specific impacts neglecting important interactions between the different factors that determine climate change vulnerability. Here, we present the first comprehensive vulnerability assessment of European wine PDOs that spatially combines multiple indicators of adaptive capacity and climate change sensitivity with high-resolution climate projections. We found that the climate change vulnerability of PDO areas largely depends on the complex interactions between physical and socioeconomic factors. Homogenous topographic conditions and a narrow varietal spectrum increase climate change vulnerability, while the skills and education of farmers, together with a good economic situation, decrease their vulnerability. Assessments of climate change consequences therefore need to consider multiple variables as well as their interrelations to provide a comprehensive understanding of the expected impacts of climate change on European PDOs. Our results provide the first vulnerability assessment for European winegrowing regions at high spatiotemporal resolution that includes multiple factors related to climate exposure, sensitivity, and adaptive capacity on the level of single winegrowing regions. They will therefore help to identify hot spots of climate change vulnerability among European PDOs and efficiently direct adaptation strategies.

Simulating climate change impact on viticultural systems in historical and emergent vineyards

Global climate change affects regional climates and hold implications for wine growing regions worldwide. Although winegrowers are constantly adapting to internal and external factors, it seems relevant to develop tools, which will allow them to better define actual and future agro-climatic potentials. Within this context, we develop a modelling approach, able to simulate the impact of environmental conditions and constraints on vine behaviour and to highlight potential adaptation strategies according to different climate change scenarios. Our modeling approach, named SEVE (Simulating Environmental impacts on Viticultural Ecosystems), provides a generic modeling framework for simulating grapevine growth and berry ripening under different conditions and constraints (slope, aspect, soil type, climate variability…) as well as production strategies and adaptation rules according to climate change scenarios. Each activity is represented by an autonomous agent able to react and adapt its reaction to the variability of environmental constraints. Using this model, we have recently analyzed the evolution of vineyards’ exposure to climatic risks (frost, pathogen risk, heat wave) and the adaptation strategies potentially implemented by the winegrowers. This approach, implemented for two climate change scenarios, has been initiated in France on traditional (Loire Valley) and emerging (Brittany) vineyards. The objective is to identify the time horizons of adaptations and new opportunities in these two regions. Carried out in collaboration with wine growers, this approach aims to better understand the variability of climate change impacts at local scale in the medium and long term.

The potential of multispectral/hyperspectral technologies for early detection of “flavescence dorée” in a Portuguese vineyard

“Flavescence dorée” (FD) is a grapevine quarantine disease associated with phytoplasmas and transmitted to healthy plants by insect vectors, mainly Scaphoideus titanus. Infected plants usually develop symptoms of stunted growth, unripe cane wood, leaf rolling, leaf yellowing or reddening, and shrivelled berries. Since plants can remain symptomless up to four years, they may act as reservoirs of FD contributing to the spread of the disease. So far, conventional management strategies rely mainly on the insecticide treatments, uprooting of infected plants and use of phytoplasma-free propagation material. However, these strategies are costly and could have undesirable environmental impacts. Thus, the development of sustainable and noninvasive approaches for early detection of FD and its management are of great importance to reduce disease spread and select the best cultural practices and treatments. The present study aimed to evaluate if multispectral/hyperspectral technologies can be used to detect FD before the appearance of the first symptoms and if infected grapevines display a spectral imaging fingerprint. To that end, physiological parameters (leaf area, chlorophyll content and photosynthetic rate) were collected in concomitance to the measurements of plant reflectance (using both a portable apparatus and a remote sensing drone). Measurements were performed in two leaves of 8 healthy and 8 FD-infected grapevines, at four timepoints: before the development of disease symptoms (21st June); and after symptoms appearance (ii) at veraison (2nd August); at post-veraison (11th September); and at harvest (25th September). At all timepoints, FD infected plants revealed a significant decrease in the studied physiological parameters, with a positive correlation with drone imaging data and portable apparatus analyses. Moreover, spectra of either drone imaging and portable apparatus showed clear differences between healthy and FD-infected grapevines, validating multispectral/ hyperspectral technology as a potential tool for the early detection of FD or other grapevine-associated diseases.

Copper contamination in vineyard soils of Bordeaux: spatial risk assessment for the replanting of vines and crops

Copper (Cu) is widely and historically used in viticulture as a fungicide against mildew. Cu has a strong affinity for soil organic matter and accumulates in topsoil horizons. Thus, Cu may negatively affect soil organisms and plants, consequently reducing soil fertility and productivity. The Bordeaux vineyards have the largest vineyard surfaces (26%) within French controlled appellation and a great proportion of French wine production (around 5 million hl per year). Considering the local context of vineyard surfaces decreasing (vine uprooting) and possible new crop plantation, the issue of Cu potential toxicity rises. Therefore, the aims of this work are firstly to evaluate the Cu contamination in vineyard soils of Bordeaux, secondly to produce a risk assessment map for new vine or crop plantation. We used soil analyses from several local studies to build a database with 4496 soil horizon samples. The database was enhanced by means of pedotransfer functions in order to estimate the bioaccessible (EDTA-extractable) Cu in soils of samples without measurements. From this database, 1797 georeferenced samples with CuEDTA concentrations in the topsoil (0-50 cm depth) were used for kriging interpolation in order to produce the spatial distribution map of CuEDTA in vineyard soils. Then, the spatial distribution of Cu was crossed with vine uprooting surfaces and municipality boundaries. CuEDTAconcentrations ranged from 0.52 to 459 mg/kg and showed clear anomalies. Our results from spatial analysis showed that almost 50% of vineyard soil surfaces have CuEDTA concentrations higher than 30 mg/kg (moderate risk for new plantation) and 20% with concentrations higher than 50 mg/kg (high risk for new plantation). A decision-support map based on municipalities was realised to provide a simple tool to stakeholders concerned by land use management.

Grape must quality and mesoclimatic variability in Fruška Gora wine-growing region, Serbia

The Fruška Gora mountain is a traditional wine-growing region in Serbia situated in the Pannonian Basin. Due to such a position, the vicinity of the Danube River and the presence of concave configuration, it is suitable for grape production. This paper provides analyses of spatial variations in meteorological parameters and grape juice quality within Fruška Gora wine region over three consecutive vintages (2018-2020). The examined period can be defined as warm with cool nights during September (AVG 18,9°C; GDD 1918°C; CI 12°CF) and with the presence of mesoclimatic variability. The East part of the study area was somewhat drier and hotter compared to other parts of the region. The analyses of grape must samples (190 in total) of five cultivars (Cabernet-Sauvignon, Merlot, Chardonnay, Sauvignon blanc and Grašac (Welschriesling)) commonly grown across the region (19 sites), were performed using Fourier Transform Infrared Technology (FTIR). Among all cultivars, Sauvignon blanc was harvested first in the East area (DOY=246±5, GDD at harvest=1552±74, 22.2±0.7 °Brix), while the latest harvest was recorded for Cabernet-Sauvignon in the West (DOY=283±5, GDD at harvest=1936±187, 23.4±1.0 °Brix ). Both the red and white cultivars had higher acidity and YAN in the grape must if the vines were grown in the North and East compared to South and West areas. According to PCA analysis, Grašac showed the lowest variation in grape must chemical composition. Thus, the results confirm that Grašac is the most stable cultivar in Fruška Gora. All monitored cultivars reached technological fruit ripeness by the end of the growing season. However, it was difficult to reach full ripeness of red cultivars, mostly beacuse of uncoupling of technolocical and phenolic ripeness. Thus, Cabernet-Sauvignon had higher variations in GDD sums at harvest compared to other cultivars, which probably increased variations in grape must quality.