Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate projections over France wine-growing region and its potential impact on phenology

Climate change represents a major challenge for the French wine industry. Climatic conditions in French vineyards have already changed and will continue to evolve. One of the notable effects on grapevine is the advancing growing season. The aim of this study is to characterise the evolution of agroclimatic indicators (Huglin index, number of hot days, mean temperature, cumulative rainfall and number of rainy days during the growing season) at French wine-growing regions scale between 1980 and 2019 using gridded data (8 km resolution, SAFRAN) and for the middle of the 21th century (2046-2065) with 21 GCMs statistically debiased and downscaled at 8 km. A set of three phenological models were used to simulate the budburst (BRIN, Smoothed-Utah), flowering, veraison and theoretical maturity (GFV and GSR) stages for two grape varieties (Chardonnay and Cabernet-Sauvignon) over the whole period studied. All the French wine-growing regions show an increase in both temperatures during the growing season and Huglin index. This increase is accompanied by an advance in the simulated flowering (+3 to +9 days), veraison (+6 to +13 days) and theoretical maturity (+6 to +16 days) stages, which are more noticeable in the north-eastern part of France. The climate projections unanimously show, for all the GCMs considered, a clear increase in the Huglin index (+662 to 771 °C.days compared to the 1980-1999 period) and in the number of hot days (+5.6 to 22.6 days) in all the wine regions studied. Regarding rainfall, the expected evolution remains very uncertain due to the heterogeneity of the climates simulated by the 21 models. Only 4 regions out of 21 have a significant decrease in the number of rainy days during the growing season. The two budburst models show a strong divergence in the evolution of this stage with an average difference of 18 days between the two models on all grapevine regions. The theoretical maturity is the most impacted stage with a potential advance between 40 and 23 days according to wine-growing regions.

Phenological characterization of a wide range of Vitis Vinifera varieties

In order to study the impact of climate change on Bordeaux grape varieties and to assess the adaptation capacities of candidates to the grape varieties of this wine region to the new climatic conditions, an experimental block design composed of 52 grape varieties was set up in 2009 at the INRAE Bordeaux Aquitaine center. Among the many parameters studied, the three main phenological stages of the vine (budburst, flowering and veraison) have been closely monitored since 2012. Observations for each year, stage and variety were carried out on four independent replicates. Precocity indices have been calculated from the data obtained over the 2012-2021 period (Barbeau et al. 1998). This work allowed to group the phenological behaviour of the grapevine varieties, not only based on the timing of the subsequent developmental stages, but also on the overall precocity of the cycle and the total length of the cycle between budburst and veraison. Results regarding the variability observed among the different grape varieties for these phenological stages are presented as heat maps.

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

Amino nitrogen content in grapes: the impact of crop limitation

As an essential element for grapevine development and yield, nitrogen is also involved in the winemaking process and largely affects wine composition. Grape must amino nitrogen deficiency affects the alcoholic fermentation kinetics and alters the development of wine aroma precursors. It is therefore essential to control and optimize nitrogen use efficiency by the plant to guarantee suitable grape nitrogen composition at harvest. Understanding the impact of environmental conditions and cultural practices on the plant nitrogen metabolism would allow us to better orientate our technical choices with the objective of quality and sustainability (less inputs, higher efficiency). This trial focuses on the impact of crop limitation – that is a common practice in European viticulture – on nitrogen distribution in the plant and particularly on grape nitrogen composition. A wide gradient of crop load was set up in a homogeneous plot of Chasselas (Vitis vinifera) in the experimental vineyard of Agroscope, Switzerland. Dry weight and nitrogen dynamics were monitored in the roots, trunk, canopy and grapes, during two consecutive years, using a 15N-labeling method. Grape amino nitrogen content was assessed in both years, at veraison and at harvest. The close relationship between fruits and roots in the maintenance of plant nitrogen balance was highlighted. Interestingly, grape nitrogen concentration remained unchanged regardless of crop load to the detriment of the growth and nitrogen content of the roots. Meanwhile, the size and the nitrogen concentration of the canopy were not affected. Leaf gas exchange rates were reduced in response to lower yield conditions, reducing carbon and nitrogen assimilation and increasing intrinsic water use efficiency. The must amino nitrogen profiles could be discriminated as a function of crop load. These findings demonstrate the impact of plant balance on grape nitrogen composition and contribute to the improvement of predictive models and sustainable cultural practices in perennial crops.

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).