Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Mapping and tracking canopy size with VitiCanopy

Understanding vineyard variability to target management strategies, apply inputs efficiently and deliver consistent grape quality to the winery is essential. However, despite inherent vineyard variability, the majority are managed as if they are uniform. VitiCanopy is a simple, grower-friendly tool for precision/digital viticulture that allows users to collect and interpret objective spatial information about vineyard performance. After four years of field and market research, an upgraded VitiCanopy has been created to achieve a more streamlined, technology-assisted vine monitoring tool that provides users with a set of superior new features, which could significantly improve the way users monitor their grapevines. These new features include:
• New user interface
• User authentication
• Batch analysis of multiple images
• Ease the learning curve through enhanced help features
• Reporting via the creation of colour maps that will allow users to assess the spatial differences in canopies within a vineyard.
Use-case examples are presented to demonstrate the quantification and mapping of vineyard variability through objective canopy measurements, ground-truthing of remotely sensed measurements, monitoring of crop conditions, implementation of disease and water management decisions as well as creating a history of each site to forecast quality. This intelligent tool allows users to manage grapevines and make informed management choices to achieve the desired production targets and remain profitable.

Influence of weather and climatic conditions on the viticultural production in Croatia

The research includes an analysis of the impact of weather conditions on phenological development of the vine and grape quality, through monitoring of four experimental cultivars (Chardonnay, Graševina, Merlot and Plavac mali) over two production years. In each experimental vineyard, which were evenly distributed throughout the regions of Slavonia and The Croatian Danube, Croatian Uplands,

Rootstock regulation of scion phenotypes: the relationship between rootstock parentage and petiole mineral concentration

Grapevine is grown as a graft since the end of the 19th century. Rootstocks not only provide tolerance to Phylloxera but also ensure the supply of water and mineral nutrients to the scion. Rootstocks are an important mean of adaptation to environmental conditions, because the scion controls the typical features of the grapes and wine. However, among the large diversity of rootstocks worldwide, few of them are commercially used in the vineyard. The aim of this study was to investigate the extent to which rootstocks modify the mineral composition of the petioles of the scion. Vitis vinifera cvs. Cabernet-Sauvignon, Pinot noir, Syrah and Ugni blanc were grafted onto 55 different rootstock genotypes and planted in a vineyard as three replicates of 5 vines. Petioles were collected in the cluster zone with 6 replicates per combination. Petiolar concentrations of 13 mineral elements (N, P, K, S, Mg, Ca, Na, B, Zn, Mn, Fe, Cu, Al) at veraison were determined. Scion, rootstock and the interaction explained the same proportion of the phenotypic variance for most mineral elements. Rootstock genotype showed a significant influence on the petiole mineral element composition. Rootstock effect explained from 7 % for Cu to 25 % for S of the variance. The difference of rootstock conferred mineral status is discussed in relation to vigor and fertility. Rootstocks were also genotyped with 23 microsatellite markers. Data were analysed according to genetic groups in order to determine whether the petiole mineral composition could be related to the genetic parentage of the rootstock. Thanks to a highly powerful design, it is the first time that such a large panel of rootstocks grafted with 4 scions has been studied. These results give the opportunity to better characterize the rootstocks and to enlarge the diversity used in the vineyard.

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

Terroir analysis and its complexity

Terroir is not only a geographical site, but it is a more complex concept able to express the “collective knowledge of the interactions” between the environment and the vines mediated through human action and “providing distinctive characteristics” to the final product (OIV 2010). It is often treated and accepted as a “black box”, in which the relationships between wine and its origin have not been clearly explained. Nevertheless, it is well known that terroir expression is strongly dependent on the physical environment, and in particular on the interaction between soil-plant and atmosphere system, which influences the grapevine responses, grapes composition and wine quality. The Terroir studying and mapping are based on viticultural zoning procedures, obtained with different levels of know-how, at different spatial and temporal scales, empiricism and complexity in the description of involved bio-physical processes, and integrating or not the multidisciplinary nature of the terroir. The scientific understanding of the mechanisms ruling both the vineyard variability and the quality of grapes is one of the most important scientific focuses of terroir research. In fact, this know-how is crucial for supporting the analysis of climate change impacts on terroir resilience, identifying new promised lands for viticulture, and driving vineyard management toward a target oenological goal. In this contribution, an overview of the last findings in terroir studies and approaches will be shown with special attention to the terroir resilience analysis to climate change, facing the use and abuse of terroir concept and new technology able to support it and identifying the terroir zones.