Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Low-cost sensors as a support tool to monitor soil-plant heat exchanges in a Mediterranean vineyard

Mediterranean viticulture is increasingly exposed to more frequent extreme conditions such as heat waves. These extreme events co-occur with low soil water content, high air vapor pressure deficit and high solar radiant energy fluxes and result in leaf and berry sunburn, lower yield, and berry quality, which is a major constraint for the sustainability of the sector. Grape growers must find ways to proper and effectively manage heat waves and extreme canopy and berry temperatures. Irrigation to keep soil moisture levels and enable adequate plant turgor, and convective and evaporative cooling emerged as a key tool to overcome this major challenge. The effects of irrigation on soil and plant water status are easily quantifiable but the impact of irrigation on soil and canopy temperature and on heat convection from soil to cluster zone remain less characterized. Therefore, a more detailed quantification of vineyard heat fluxes is highly relevant to better understand and implement strategies to limit the effects of extreme weather events on grapevine leaf and berry physiology and vineyards performance. Low-cost sensor technologies emerge as an opportunity to improve monitoring and support decision making in viticulture. However, validation of low-cost sensors is mandatory for practical applicability. A two-year study was carried in a vineyard in Alentejo, south of Portugal, using low-cost thermal cameras (FLIR One, 80×60 pixels and FLIR C5, 160×120 pixels, 8-14 µm, FLIR systems, USA) and pocket thermohygrometers (Extech RHT30, EXTECH instruments, USA) to monitor grapevine and soil temperatures. Preliminary results show that low-cost cameras can detect severe water stress and support the evaluation of vertical canopy temperature variability, providing information on soil surface temperature. All these thermal parameters can be relevant for soil and crop management and be used in decision support systems.

Photoselective shade films affect grapevine berry secondary metabolism and wine composition

Grapevine physiology and production are challenged by forecasted increases in temperature and water deficits. Within this scenario, photoselective overhead shade films are promising tools in warm viticulture areas to overcome climate change related factors. The aim of this study was to evaluate the vulnerability of ‘Cabernet Sauvignon’ grape berry to solar radiation overexposure and optimize shade film use for berry integrity. A randomized complete block design field study was conducted across two years (2020-2021) in Oakville, Napa Valley, CA, with four shade films (D1, D3, D4, D5) differing in the percent of radiation spectra transmitted and compared to an uncovered control (C0). Integrals for gas exchange parameters and mid-day stem water potential were unaffected by the shade films in 2020 and 2021. By harvest, berries from uncovered and shaded vines did not differ in their size or primary metabolism in either year. Despite precipitation exclusion during the dormant season in the shaded treatments, yield did not differ between them and the control in either season. In 2020, total skin anthocyanins (mg/g fresh mass) in the shaded treatments was greater than C0 during berry ripening and at harvest. Conversely, flavonol concentrations in 2020 were reduced in shaded vines compared to C0. The 2020 growing season highlighted the impact of heat degradation on flavonoids. Flavonoid concentrations in 2021 increased until harvest while flavonoid degradation was apparent from veraison to harvest in 2020 across shaded and control vines. Wine analyses highlighted the importance of light spectra to modify wine composition. Wine color intensity, tonality and anthocyanin values were enhanced in D4 whereas antioxidant properties were enhanced in C0 and D5 wines. Altogether, our results highlighted the need of new approaches in warm viticulture areas given the impact that composition of light has on berry and wine quality.

Upscaling the integrated terroir zoning through digital soil mapping: a case study in the Designation of Origin Campo de Borja

homogeneous zones by intersecting several partial zonings of major factors that influence vineyard growth. Each of them follows specific process from their corresponding disciplines. Soil zoning specifically refers to a Soil Resource Inventory map that has traditionally been generated by conventional soil mapping methods. These methods have shortcomings in reaching fine cartographic and categorical details and involve significant expenses, which undermines their applicability. A new framework named Digital Soil Mapping has introduced quantitative models by statistical techniques to establish soil-landscape relationships and is able to provide intensive scale cartography.

In the present study, a microzoning at 1:10.000 scale is generated from an initial zoning, where the conventional soil map with polytaxic map units is replaced by a new one from digital techniques that disaggregates them. The comparison between the zonings considers a quantitative evaluation of capability for each Homogeneous Terroir Unit by means of the Viticultural Quality Index and its categorization based on its distribution by map. The spatial intersection of both maps gives rise to a confusion matrix in which the flows of class variations after the substitution are assessed.

The results show a five-fold increase in the number of Homogeneous Terroir Units identified and a larger differentiation among them, evidenced by a wider range in the capability index distribution. Both elements are accompanied by an increase in the detection of areas of higher potential within previously undervalued uniform zones.These features are a direct effect of the improvements brought by Digital Soil Mapping techniques and would verify the advantages of their implementation in the Integrated Terroir zoning. Eventually, such new highly detailed terroir units would benefit precision viticulture and sustainable management practices.

Impact of geographical location on the phenolic profile of minority varieties grown in Spain. II: red grapevines

Because terroir and cultivar are drivers of wine quality, is essential to investigate theirs effects on polyphenolic profile before promoting the implantation of a red minority variety in a specific area. This work, included in MINORVIN project, focuses in the polyphenolic profile of 7 red grapevines minority varieties of Vitis vinifera L. (Morate, Sanguina, Santafe, Terriza Tinta Jeromo Tortozona Tinta) and Tempranillo) from six typical viticulture Spanish areas: Aragón (A1), Cataluña (A2), Castilla la Mancha (A3), Castilla –León (A4), Madrid (A5) and Navarra (A6) of 2020 season. Polyphenolic substances were extracted from grapes. 35 compounds were identified and quantified (mg subtance/kg fresh berry) by HPLC and grouped in anthocyanins (ANT) flavanols (FLAVA), flavonols (FLAVO), hydroxycinnamic (AH), benzoic (BA) acids and stilbenes (ST). Antioxidant activity (AA, mmol TE /g fresh berry) was determined by DPPH method. The results were submitted to a two-way ANOVA to investigate the influence of variety, area and their interaction for each polyphenolic family and cluster analysis was used to construct hierarchical dendrograms, searching the natural groupings among the samples. Sanguina (A3) had the most of total polyphenols while Tempranillo (A5) those of ANT. Sanguina (A2) and (A3) reached the highest values of FLAVO, FLAVA and AA. These two last samples had also the maximum of AA. The effect cultivar and area were significant for all polyphenolic families analyzed. A high variability due to variety (>50%) was observed in FLAVA and the maximum value of variability due to growing area was detected in AA (86.41%), ANT and FLAVO (51%); the interaction variety*zone was significant only for ANT, FLAVO, EST and AA. Finally, dendrograms presented five cluster: i) Sanguina (A2); ii) Sanguina (A3); iii) Tempranillo (A5); iv) Tempranillo (A3); Terriza (A3,A5), Morate (A5,A6); v) Santafé (A1,A6); Tortozona tinta (A1,A3,A6); Tinta Jeromo (A3,A4).

Sustaining wine identity through intra-varietal diversification

With contemporary climate change, cultivated Vitis vinifera L. is at risk as climate is a critical component in defining ecologically fitted plant materiel. While winegrowers can draw on the rich diversity among grapevine varieties to limit expected impacts (Morales-Castilla et al., 2020), replacing a signature variety that has created a sense of local distinctiveness may lead to several challenges. In order to sustain wine identity in uncertain climate outcomes, the study of intra-varietal diversity is important to reflect the adaptive and evolutionary potential of current cultivated varieties. The aim of this ongoing study is to understand to what extent can intra-varietal diversity be a climate change adaptation solution. With a focus on early (Sauvignon blanc, Riesling, Grolleau, Pinot noir) to moderate late (Chenin, Petit Verdot, Cabernet franc) ripening varieties, data was collected for flowering and veraison for the various studied accessions (from conservatory plots) and clones. For these phenological growing stages, heat requirements were established using nearby weather stations (adapted from the GFV model, Parker et al., 2013) and model performances were verified. Climate change projections were then integrated to predict the future behaviour of the intra-varietal diversity. Study findings highlight the strong phenotypic diversity of studied varieties and the importance of diversification to enhance climate change resilience. While model performances may require improvements, this study is the first step towards quantifying heat requirements of different clones and how they can provide adaptation solutions for winegrowers to sustain local wine identity in a global changing climate. As genetic diversity is an ongoing process through point mutations and epigenetic adaptations, perspective work is to explore clonal data from a wide variety of geographic locations.