Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Influence of agronomic practices in soil water content in mid-mountain vineyards

In the context of LIFE project MIDMACC (LIFE18 CCA/ES/001099), several pilots have been installed in vineyards in mid mountain areas of Catalonia (NE Spain) to test well stablished agronomic practices to increase the adaptation of Mediterranean mid mountain to climate change. Soil water content (SWC) at three different depths (15, 30 and 45cm) was measured in continuum from August 2020. One pilot (WC) included a well-established green cover (GC), a new GC (NC) and a conventional soil management (CM, tilling+herbicides). NC presented an intermediate state between WC and CM, responding similarly to CM in autumn but quickly reaching similar SWC to WC, then following the same evolution till next spring, with CM presenting lower values along autumn and winter. Then vegetation activation decreased SWC in all plots, (much slower in CM, lacking GC). Sensibility to spring rains is again intermediate for NC, which joins SWC evolution of CM by the end of spring till next autumn. It is expected that NC will resemble WC more and more as its GC develops. In the pilot combining vine training (VSP vs Gobelet) and hillside management (slope vs terrace), no clear pattern could be related with these conditions. However, both terraces seem to be more sensitive to spring rains. A third pilot included new vineyards (7 and 1 year old). In the new vineyard (N), higher canopy development, a spontaneous green cover and row straw resulted in a slower SWC dynamic, not so sensitive to rains but conserving more soil water in spring and most of summer, even with presumably a higher water extraction by vines. In the newest vineyard (VN) the deepest sensor is still sensitive to rain events all over the year and SWC is always highest at this depth, revealing small water capture by vines.

Traditional agroforestry vineyards, sources of inspiration for the agroecological transition of viticulture

A unique “terroir” can be found in southern Bolivia, which combines the specific features of climate, topography and altitude of high valleys, with the management of grapevines staked on trees. It is one of the rare remnants of agroforestry viticulture. A survey was carried out among 29 grapegrowers in three valleys, to characterize the structure and management of these vineyards, and identify the services they expect from trees. Farms were small (2.2 ha on average) and 85% of vineyards were less than 1 ha. Viticulture was associated with vegetable, fruit and fodder production, sometimes in the same fields. Molle trees were found in all plots, together with one or two other native tree species. Traditional grapevine varieties such as Negra Criolla, Moscatel de Alejandría and Vicchoqueña were grown with a large range of densities from 1550 to 9500 vines ha-1. From 18 to 30% of them were staked on trees, with 1.2 to 4.9 vines per tree. The management of these vineyards (irrigation, fertilization and grapevine protection) was described, the most particular technical operation being the coordinated pruning of trees and grapevines. Three types of management could be identified in the three valleys. Grapegrowers had a clear idea of the ecosystem services they expected from trees in their vineyards. The main one was protection against climate hazards (hail, frost, flood). Then they expected benefits in terms of pest and disease control, improvement of soil fertility and resulting yield. At last, some producers claimed that tree-staking was quicker and cheaper than conventional trellising. It can be hypothesized then that agroforestry is a promising technique for the agroecological transition of viticulture. Its contribution to the “terroir” of the high valleys of southern Bolivia and its link with the specificities of the wines and spirits produced there remain to be explored.

Adaptability of grapevines to climate change: characterization of phenology and sugar accumulation of 50 varieties, under hot climate conditions

Climate is the major factor influencing the dynamics of the vegetative cycle and can determine the timing of phenological periods. Knowledge of the phenology of varieties, their chronological duration, and thermal requirements, allows not only for the better management of interventions in the vineyard, but also to predict the varieties’ behaviour in a scenario of climate change, giving the wine producer the possibility of selecting the grape varieties that are best adapted to the climatic conditions of a certain terroir. In 2014, Symington Family Estates, Vinhos, established two grape variety libraries in two different places with distinctive climate conditions (Douro Superior, and Cima Corgo), with the commitment of contributing to a deeper agronomic and oenological understanding of some grape varieties, in hot climate conditions. In these research vineyards are represented local varieties that are important in the regional and national viticulture, but also others that have over time been forgotten — as well as five international reference cultivars. From 2017 to 2021, phenological observations have been made three times a week, following a defined protocol, to determine the average dates of budbreak, flowering and veraison. With the climate data of each location, the thermal requirements of each variety and the chronological duration of each phase have been calculated. During maturation, berry samples have been gathered weekly to study the dynamics of sugar accumulation, between other parameters. The data was analysed applying phenological and sugar accumulation models available in literature. The results obtained show significant differences between the varieties over several parameters, from the chronological duration and thermal requirements to complete the various stages of development, to the differences between the two locations, confirming the influence of the climate on phenology and the stages of maturation, in these specific conditions.

Genotypic variability in root architectural traits and putative implications for water uptake in grafted grapevine

Root system architecture (RSA) is important for soil exploration and edaphic resources acquisition by the plant, and thus contributes largely to its productivity and adaptation to environmental stresses, particularly soil water deficit. In grafted grapevine, while the degree of drought tolerance induced by the rootstock has been well documented in the vineyard, information about the underlying physiological processes, particularly at the root level, is scarce, due to the inherent difficulties in observing large root systems in situ. The objectives of this study were to determine genetic differences in the root architectural traits and their relationships to water uptake in two Vitis rootstocks genotypes (RGM, 140Ru) differing in their adaptation to drought. Young rootstocks grafted upon the Riesling variety were transplanted into cylindrical tubes and in 2D rhizotrons under two conditions, well watered and moderate water stress. Root traits were analyzed by digital imaging and the amount of transpired water was measured gravimetrically twice a week. Root phenotyping after 30 days reveal substantial variation in RSA traits between genotypes despite similar total root mass; the drought-tolerant 140Ru showed higher root length density in the deep layer, while the drought-sensitive RGM was characterised by shallow-angled root system development with more basal roots and a larger proportion of fine roots in the upper half of the tube. Water deficit affected canopy size and shoot mass to a greater extent than root development and architectural-related traits for both 140Ru and RGM, suggesting vertical distribution of roots was controlled by genotype rather than plasticity to soil water regime. The deeper root system of 140Ru as compared to RGM correlated with greater daily water uptake and sustained stomata opening under water-limited conditions but had little effect on above-ground growth. Our results highlight that grapevine rootstocks have constitutively distinct RSA phenotypes and that, in the context of climate change, those that develop an extensive root network at depth may provide a desirable advantage to the plant in coping with reduced water resources.

Influence of grapevine rootstock/scion combination on rhizosphere and root endophytic microbiomes

Soil is a reservoir of microorganisms playing important roles in biogeochemical cycles and interacting with plants whether in the rhizosphere or in the root endosphere. The composition of the microbial communities thus impacts the plant health. Rhizodeposits (such as sugar, organic and amino acids, secondary metabolites, dead root cells …) are released by the roots and influence the communities of rhizospheric microorganisms, acting as signaling compounds or carbon sources for microbes. The composition of root exudates varies depending on several factors including genotypes. As most of the cultivated grapevines worldwide are grafted plants, the aim of this study was to explore the influence of rootstock and scion genotypes on the microbial communities of the rhizosphere and the root endosphere. The work was conducted in the GreffAdapt plot (55 rootstocks x 5 scions), in which the 275 combinations have been planted into 3 blocks designed according to the soil resistivity. Samples of roots and rhizosphere of 10 scion x rootstock combinations were first collected in May among the blocks 2 and 3. The quantities of bacteria, fungi and archaea have been assessed in the rhizosphere by quantitative PCR, and by cultivable methods for bacteria and fungi. The communities of bacteria, fungi and arbuscular mycorrhizal fungi (AMF) was analyzed by Illumina sequencing of 16S rRNA gene, ITS and 28S rRNA gene, respectively. The level of mycorrhization was also evaluated using black ink coloration of newly formed roots harvested in October. The level of bacteria, fungi and archaea was dependent on rootstock and scion genotypes. A block effect was observed, suggesting that the soil characteristics strongly influenced the microorganisms from the rhizosphere and root endosphere. High-throughput sequencing of the different target genes showed different communities of bacteria, fungi and AMF associated with the scion x rootstock combinations. Finally, all the combinations were naturally mycorrhized. The root mycorrhization intensity was influenced by the rootstock genotype, but not by the scion one. Altogether, these results suggest that both rootstock and scion genotypes influence the rhizosphere and root endophytic microbiomes. It would be interesting to analyze the biochemical composition of the rhizodeposition of these genotypes for a better understanding of the processes involved in the modulation of these microbiomes. Moreover, crossing our data with the plant agronomic characteristics could provide insights into their roles on plant fitness.