Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Elevational range shifts of mountain vineyards: Recent dynamics in response to a warming climate

Increasing temperatures worldwide are expected to cause a change in spatial distribution of plant species along elevational gradients and there are already observable shifts to higher elevations as a consequence of climate change for many species. Not only naturally growing plants, but also agricultural cultivations are subject to the effects of climate change, as the type of cultivation and the economic viability depends largely on the prevailing climatic conditions. A shift to higher elevations therefore represents a viable adaptation strategy to climate change, as higher elevations are characterized by lower temperatures. This is especially important in the case of viticulture because a certain wine-style can only be achieved under very specific climatic conditions. Although there are several studies investigating climatic suitability within winegrowing regions or longitudinal shifts of winegrowing areas, little is known about how fast vineyards move to higher elevations, which may represent a viable strategy for winegrowers to maintain growing conditions and thus wine-style, despite the effects of climate change. We therefore investigated the change in the spatial distribution of vineyards along an elevational gradient over the past 20 years in the mountainous wine-growing region of Alto Adige (Italy). A dataset containing information about location and planting year of more than 26000 vineyard parcels and 30 varieties was used to perform this analysis. Preliminary results suggest that there has been a shift to higher elevations for vineyards in general (from formerly 700m to currently 850 m a.s.l., with extreme sites reaching 1200 m a.s.l.), but also that this development has not been uniform across different varieties and products (i.e. vitis vinifera vs hybrid varieties and still vssparkling wines). This is important for climate change adaptation as well as for rural development. Mountain areas, especially at mid to high elevations, are often characterized by severe land abandonment which can be avoided to some degree if economically viable and sustainable land management strategies are available.

20-Year-Old data set: scion x rootstock x climate, relationships. Effects on phenology and sugar dynamics

Global warming is one of the biggest environmental, social, and economic threats. In the Douro Valley, change to the climate are expected in the coming years, namely an increase in average temperature and a decrease in annual precipitation. Since vine cultivation is extremely vulnerable and influenced by the climate, these changes are likely to have negative effects on the production and quality of wine.
Adaptation is a major challenge facing the viticulture sector where the choice of plant material plays an important role, particularly the rootstock as it is a driver for adaptation with a wide range of effects, the most important being phylloxera, nematode and salt, tolerance to drought and a complex set of interactions in the grafted plant.
In an experimental vineyard, established in the Douro Region in 1997, with four randomized blocs, with five varieties, Touriga Nacional, Tinta Barroca, Touriga Franca and Tinta Roriz, grafted in four rootstocks, Rupestris du Lot, R110, 196-17C, R99 and 1103P, data was collected consecutively over 20 years (2001-2020). Phenological observations were made two to three times a week, following established criteria, to determine the average dates of budbreak, flowering and veraison. During maturation, weekly berry samples were taken to study the dynamics of sugar accumulation, amongst other parameters. Climate data was collected from a weather station located near the vineyard parcel, with data classified through several climatic indices.
The results achieved show a very low coefficient of variations in the average date of the phenophases and an important contribution from the rootstock in the dynamic of the phenology, allowing a delay in the cycle of up to10-12 days for the different combinations. The Principal Component Analysis performed, evaluating trends in the physical-chemical parameters, highlighted the effect of the climate and rootstock on fruit quality by grape varieties.

Rapid damage assessment and grapevine recovery after fire

There is increasing scientific consensus that climate changeis the underlying cause of the prolonged dry and hot conditions that have increased the risk of extreme fire weather in many countries around the world. In December 2019, a bushfire event occurred in the Adelaide Hills, South Australia where 25,000 hectares were burnt and in vineyards and surrounding areas various degrees of scorching and infrastructure damage occurred. The ability to coordinate and plan recovery after a fire event relies on robust and timely data. The current practice for measuring the scale and distribution of fire damage is to walk or drive the vineyard and score individual vines based on visual observation. The process is time consuming, subjective, or semi-quantitative at best. After the December 2019 fires, it took many months to access properties and estimate the area of vineyard damaged. This study compares the rapid assessment and mapping of fire damage using high-resolution satellite imagery with more traditional ground based measures. Satellite imagery tracking vineyard recovery in the season following the bushfire is being correlated to field assessments of vineyard productivity such as canopy health and development, fertility and carbohydrate storage. Canopy health in the seasons following the fires correlated to the severity of the initial fire damage. Severely damaged vines had reduced canopy growth, were infertile or had very low fertility as well as lower carbohydrate levels in buds and canes during dormancy, which reduced productivity in the seasons following the bushfire event. In contrast, vines that received minor damage were able to recover within 1-2 years. Tools that rapidly and affordably capture the extent and severity of damage over large vineyard area will allow producers, government and industry bodies to manage decisions in relation to fire recovery planning, coordination and delivery, improving the efficiency and effectiveness of their response.

Metabolomic discrimination of grapevine water status for Chardonnay and Pinot noir

Water status impact in viticulture has been widely explored, as it strongly affects grapevine physiology and grape chemical composition. It is considered as a key component of vitivinicultural terroir. Most of the studies concerning grapevine water status have focused on either physiological traits, or berry compounds, or traits involved in wine quality. Here, the response of grapevine to water availability during the ripening period is assessed through non-targeted metabolomics analysis of grape berries by ultra-high resolution mass spectrometry. The grapevine water status has been assessed during 2 consecutive years (2019 & 2020), through carbon isotope discrimination on juices from berries collected at maturity (21.5 brix approx.) for 2 Vitis vinifera cv. Pinot noir (PN) and Chardonnay (CH). A total of 220 grape juices were collected from 5 countries worldwide (Italy; Argentina; France; Germany; Portugal). Measured δ13C (‰) varied from -28.73 to -22.6 for PN, and from -28.79 to -21.67 for CH. These results also clearly revealed higher water stress for the 2020 vintage. The same grape juices have been analysed by Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS) and Liquid Chromatography coupled to Mass Spectrometry (LC-qTOF-MS), leading to the detection of up to 4500 CHONS containing elemental compositions, and thus likely tens of thousands of individual compounds, which include fatty acids, organic acids, peptides, phenolics, also with high levels of glycosylation. Multivariate statistical analysis revealed that up to 160 elemental compositions, covering the whole range of detected masses (100 –1000 m/z), were significantly correlated to the observed gradients of water status. Examples of chemical markers, which are representative of these complex fingerprints, include various derivatives of the known abscisic acid (ABA), such as phaesic acid or abscisic acid glucose ester, which are significantly correlated with higher water stress, regardless of the variety. Cultivar-specific behaviours could also be identified from these fingerprints. Our results provide an unprecedented representation of the metabolic diversity, which is involved in the water status regulation at the grape level, and which could contribute to a better knowledge of the grapevine mitigation strategy in a climate change context.

Water deficit differentially impacts the performances and the accumulation of grape metabolites of new varieties tolerant to fungi

The use of resistant varieties is a long-term but promising solution to reduce chemical input in viticulture. Several important breeding programs in Europe and abroad are now releasing a range of new hybrids performing well regarding fungi susceptibility and producing good quality wines. Unfortunately, insufficient attention is paid by the breeders to the adaptation of these varieties to climatic changes, notably to the increased climatic demand and water deficit (WD). Thus, prior to the adoption of such varieties by the wine industry in Mediterranean regions, there is a need to consider their suitability to WD. This study aimed to characterize the different drought-strategies adopted by 6 new resistant varieties selected by INRAE in comparison to Syrah. To allow the assessment of long-term impacts of WD, field-grown vines were exposed to contrasted WD from 2018 to 2021 under a semi-arid Mediterranean climate. A gradient of WD was applied in the field and controlled through plant measurements at the single plant level. Grape development was non-destructively monitored to determine the arrest of berry phloem unloading. The impacts of WD on berry composition, including water, primary metabolites (sugars, organic acids), secondary metabolites (anthocyanins, thiols precursors) and main cations contents, were assessed at this specific stage. Results showed different varietal responses during the year and inter-annual acclimation in terms of plant water use efficiency, biomass accumulation, as well as yield components and berry composition. WD differentially reduced the accumulation of primary metabolites at plant and berry levels, but it little changed their concentrations in the fruits at the ripe stage. Moreover, WD differentially impacted the accumulation of secondary metabolites and major cations between the varieties. In the talk, we’ll present the main results regarding the WD impacts on fruit metabolites and enlarge the reflection about the practical assessment of the grapevine acclimation to WD.