Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Leaf vine content in nutrients and trace elements in La Mancha (Spain) soils: influence of the rootstock

The use of rootstock of American origin has been the classic method of fighting against Phylloxera for more than 100 years. For this reason, it is interesting to establish if different rootstock modifies nutrient composition as well as trace elements content that could be important for determining the traceability of the vine products. A survey of four classic rootstocks (110-Richter, SO4, FERCAL and 1103-Paulsen) and four new ones (M1, M2, M3 and M4) provided by Agromillora Iberia. S.L.U., all of them grafted with the Tempranillo variety, has been carried out during 2019. The eight rootstocks were planted in pots of 500 cc, on three soils with very different characteristics from Castilla-La Mancha (Spain). In the month of July, the leaves were collected and dried in a forced air oven for seven days at 40ºC. Then, the samples were prepared for the analysis determination, carried out by X-Ray fluorescence spectrometry. The results obtained showed that in the case of content in mineral elements in leaf, separated by soil type, we can report the importance of few elements such as Si, Fe, Pb and, especially, Sr. The rootstock does not influence the composition of the vine leaf for the studied elements that are the most important in determining the geochemical footprint of the soil. The influence of the soil can be discriminated according to some elements such as Fe, Pb, Si and, especially, Sr.

Aromatic maturity is a cornerstone of terroir expression in red wine

Harvesting grapes at adequate maturity is key to the production of high-quality red wines. Enologists and wine makers define several types of maturity, including technical maturity, phenolic maturity and aromatic maturity. Technical maturity and phenolic maturity are relatively well documented in the scientific literature, while articles on aromatic maturity are scarcer. This is surprising, because aromatic maturity is, without a doubt, the most important of the three in determining wine quality and typicity (including terroir expression). Optimal terroir expression can be obtained when the different types of maturity are reached at the same time, or within a short time frame. This is more likely to occur when the ripening takes place under mild temperatures, neither too cool, nor too hot. Aromatic expression in wine can be driven, from low to high maturity, by green, herbal, fresh fruit, ripe fruit, jammy fruit, candied fruit or cooked fruit aromas. Green and cooked fruit aromas are not desirable in red wines, while the levels of other aromatic compounds contribute to the typicity of the wine in relation to its origin. Wines produced in cool climates, or on cool soils in temperate climates, are likely to express herbal or fresh fruit aromas; while wines produced under warm climates, or on warm soils in temperate climates, may express ripe fruit, jammy fruit or candied fruit aromas. Growers can optimize terroir expression through their choice of grapevine variety. Early ripening varieties perform better in cool climates and late ripening varieties in warm climates. Additionally, maturity can be advanced or delayed by different canopy management practices or training systems.

δ13C : A still underused indicator in precision viticulture  

The first demonstration of the interest of carbon isotope composition of sugars in grapevine, as an integrated indicator of vineyard water status, dates back to 2000 (Gaudillère et al., 1999; Van Leeuwen et al., 2001). Thanks to the isotopic discrimination of Carbon that takes place during plant photosynthesis, under hydric stress conditions, it is possible to accurately estimate the photosynthetic activity. Ever since, δ13C has been widely applied with success to zonation, terroir studies and vine physiology research, but is still not widely used by viticulturists. This is quite astonishing by considering the impact of global warming on viticulture and the need to improve water management, that would justify a widespread use of δ13C.
The lack of private laboratories proposing the analysis, the cost of the technology, as well as the long analytical delays, have been detrimental to its development. Some laboratories tried to overcome the analytical difficulties of isotopic analysis by using fourier transformed infrared spectroscopy, as a fast and cheap alternative to the official OIV method (IRMS). These claimed FTIR models have never been published or peer reviewed and cannot be considered robust. In this work, thanks to the recent acquisition of IRMS technology, new modern and robust applications of δ13C for viticulture are proposed. This includes the use of the analysis to make parcel separations at harvesting, the possibility to increase the precision of hydric stress cartography and the potential cost reduction when compared with Scholander pressure bomb analysis.

Adaptation to soil and climate through the choice of plant material

Choosing the rootstock, the scion variety and the training system best suited to the local soil and climate are the key elements for an economically sustainable production of wine. The choice of the rootstock/scion variety best adapted to the characteristics of the soil is essential but, by changing climatic conditions, ongoing climate change disrupts the fine-tuned local equilibrium. Higher temperatures induce shifts in developmental stages, with on the one hand increasing fears of spring frost damages and, on the other hand, ripening during the warmest periods in summer. Expected higher water demand and longer and more frequent drought events are also major concerns. The genetic control of the phenotypes, by genomic information but also by the epigenetic control of gene expression, offers a lot of opportunities for adapting the plant material to the future. For complex traits, genomic selection is also a promising method for predicting phenotypes. However, ecophysiological modelling is necessary to better anticipate the phenotypes in unexplored climatic conditions Genetic approaches applied on parameters of ecophysiological models rather than raw observed data are more than ever the basis for finding, or building, the ideal varieties of the future.

Co-design and evaluation of spatially explicit strategies of adaptation to climate change in a Mediterranean watershed

Climate change challenges differently wine growing systems, depending on their biophysical, sociological and economic features. Therefore, there is a need to locally design and evaluate adaptation strategies combining several technical options, and considering the local opportunities and constraints (e.g. water access, wine typicity). The case study took place in a typical and heterogeneous Mediterranean vineyard of 1,500 ha in the South of France. We developed a participatory modeling approach to (1) conceptualize local climate change issues and design spatially explicit adaptation strategies with stakeholders, (2) numerically evaluate their effects on phenology, yield and irrigation needs under the high-emissions climate change scenario RCP 8.5, and (3) collectively discuss simulation results. We organized five sets of workshops, with in-between modeling phases. A process-based model was developed that allowed to evaluate the effects of six technical options (late varieties, irrigation, water saving by reducing canopy size, adjusting cover cropping, reducing density, and shading) with various distributions in the watershed, as well as vineyard relocation. Overall, we co-designed three adaptation strategies. Delay harvest strategy with late varieties showed little effects on decreasing air temperature during ripening. Water constraint limitation strategy would compensate for production losses if disruptive adaptations (e.g. reduced density) were adopted, and more land got access to irrigation. Relocation strategy would foster high premium wine production in the constrained mountainous areas where grapevine is less impacted by climate change. This research shows that a spatial distribution of technical changes gives room for adaptation to climate change, and that the collaboration with local stakeholders is a key to the identification of relevant adaptation. Further research should explore the potential of adaptation strategies based on soil quality improvement and on water stress tolerant varieties.