Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

A blueprint for managing vine physiological balance at different spatial and temporal scales in Champagne

In Champagne, the vine adaptation to different climatic and technical changes during these last 20 years can be seen through physiological balance disruptions. These disruptions emphasize the general grapevine decline. Since the 2000s, among other nitrogen stress indicators, the must nitrogen has been decreasing. The combination of restricted mineral fertilizers and herbicide use, the growing variability of spring rainfall, the increasing thermal stress as well as the soil type heterogeneity are only a few underlying factors that trigger loss of physiological balance in the vineyards. It is important to weigh and quantify the impact of these factors on the vine. In order to do so, the Comité Champagne uses two key-tools: networking and modelization. The use of quantitative and harmonized ecophysiological indicators is necessary, especially in large spatial scales such as the Champagne appellation. A working group with different professional structures of Champagne has been launched by the Comité Champagne in order to create a common ecophysiology protocol and thus monitor the vine physiology, yearly, around 100 plots, with various cultural practices and types of soil. The use of crop modelling to follow the vine physiological balance within different pedoclimatic conditions enables to understand the present balance but also predict the possible disruptions to come in future climatic scenarios. The physiological references created each year through the working group, benefit the calibration of the STICS model used in Champagne. In return, the model delivers ecophysiology indicators, on a daily scale and can be used on very different types of soils. This study will present the bottom-up method used to give accurate information on the impacts of soil, climate and cultural practices on vine physiology.

Updating the Winkler index: An analysis of Cabernet sauvignon in Napa Valley’s varied and changing climate

This study aims to create an updated, agile viticultural climate index (similar to the Winkler Index) by performing in-depth analyses of current and historical data from industry partners in several major winegrowing regions. The Winkler Index was developed in the early twentieth century based on analysis of various grape-growing regions in California. The index uses heat accumulation (i.e. Growing Degree Days) throughout the growing season to determine which grape varieties are best suited to each region. As viticultural regions are increasingly subject to the complexity and uncertainty of a changing climate, a more rigorous, agile model is needed to aid grape growers in determining which cultivars to plant where. For the first phase of this study, 21 industry partners throughout Napa Valley shared historical phenology, harvest, viticultural practice, and weather data related to their Cabernet sauvignon vineyard blocks. To complement this data, berry samples were collected throughout the 2021 growing season from 50 vineyard blocks located throughout 16 American Viticultural Areas that were then analyzed for basic berry chemistry and phenolics. These blocks have been mapped using a Geographic Information System (GIS), enabling analysis of altitude, vineyard row orientation, slope, and remotely sensed climate data. Sampling sites were also chosen based on their proximity to a weather station. By analyzing historical data from industry partners and data specifically collected for this study, it is possible to identify key parameters for further analysis. Initial results indicate extreme variability at a high spatial resolution not currently accounted for in modern viticultural climate indices and suggest that viticultural practices play a major role. Using the structure of data collection and analyses developed for the first phase, this project will soon be expanded to other wine regions globally, while continuing data collection in Napa Valley.

Understanding graft union formation by using metabolomic and transcriptomic approaches during the first days after grafting in grapevine

Since the arrival of Phyloxera (Daktulosphaira vitifolia) in Europe at the end of the 19th century, grafting has become essential to cultivate Vitis vinifera. Today, grafting provides not only resistance to this aphid, but it used to adapt the cultivars according to the type of soil, environment, or grape production requirements by using a panel of rootstocks. As part of vineyard decline, it is often mentioned the importance of producing quality grafted grapevine to improve vineyard longevity, but, to our knowledge, no study has been able to demonstrate that grafting has a role in this context. However, some scion/rootstock combinations are considered as incompatible due to poor graft union formation and subsequently high plant mortality soon after grafting. In a context of climate change where the creation of new cultivars and rootstocks is at the centre of research, the ability of new cultivars to be grafted is therefore essential. The early identification of graft incompatibility could allow the selection of non-viable plants before planting and would have a beneficial impact on research and development in the nursery sector. For this reason, our studies have focused on the identification of metabolic and transcriptomic markers of poor grafting success during the first days/week after grafting; we have identified some correlations between some specialized metabolites, especially stilbenes, and grafting success, as well as an accumulation of some amino acids in the incompatible combination. The study of the metabolome and the transcriptome allowed us to understand and characterise the processes involved during graft union formation.

Effect of fertigation strategies to adapt PGI Côtes de Gascogne production to hot vintage

The development of fertigation could be a possible solution to adapt PGI Côtes de Gascogne (south-western France) wine production to climate change. The goal would be to limit the negative effects of water stress on yield performance expectation (around 15 tons per hectare) and to make the use of fertilizers more efficient. This study aimed to compare the effects of three strategies of water and minerals supply on grapes and wines qualities. Two fertigation practices were compared to a rainfed control which is the current standard of the local grape growing production. The fertilizers (nitrogen and potassium) were (i) fully brought by irrigation pipe during the season, (ii) partially brought by irrigation pipe and partially on the soil or (iii) fully brought on the soil at the beginning of the season for the non-irrigated control (local standard). The trial was run on cv. Colombard trained on spur pruned with vertical shoot positioning system on a sandy-silty-clay soil over the 2020 vintage which was particularly hot for the region. Moderate to strong water deficit appeared during the growing period of the berries and held on after veraison. Irrigation strategies allowed for maintaining grapevine without water deficit and being significantly different from the control water status. Grapevine with fully or partial fertigation strategies produced 25% more yield mainly due to the increase of the bunch weight. Also, the fully fertigation showed the best ratio between yield and maturity and brought 30% less of fertilizers (both nitrogen and potassium) than the two other strategies. Finally, the analysis of aromatic compounds in Colombard wines, varietal thiols family, showed the same level of concentrations for the 3 treatments, confirming that the yield performance did not impact the aromatic potential in this trial.

Mesoclimate impact on Tannat in the Atlantic terroir of Uruguay

The study of climate is relevant as an element conditioning the typicity of a product, its quality and sustainability over the years. The grapevine development and growth and the final grape and wine composition are closely related to temperature, while climate components vary at mesoscale according to topography and/or proximity to large bodies of water. The objective of this work is to assess the mesoclimate of the Atlantic region of Uruguay and to determine the effect of topography and the ocean on temperature and consequently on Tannat grapevine behavior.