Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The concept of terroir: what place for microbiota?

Microbes play key roles on crop nutrient availability via biogeochemical cycles, rhizosphere interactions with roots as well as on plant growth and health. Recent advances in technologies, such as High Throughput Sequencing Techniques, allowed to gain deeper insight on the structure of bacterial and fungal communities associated with soil, rhizosphere and plant phyllosphere. Over the past 10 years, numerous scientific studies have been carried out on the microbial component of the vineyard. Whether the soil or grape compartments have been taken into account, many studies agree on the evidence of regional delineations of microbial communities, that may contribute to regional wine characteristics and typicity. Some authors proposed the term “microbial terroir” including “yeast terroir” for grapes to describe the connection between microbial biogeography and regional wine characteristics. Many factors are involved in terroir including climate, soil, cultivar and human practices as well as their interactions. Studies considering “microbial terroir” greatly contributed to improve our knowledge on factors that shape the vineyard microbial structure and diversity. However, the potential impact of “microbial terroir” on wine composition has yet not received strong scientific evidence and many questions remain to be addressed, related to the functional characterization of the microbial community and its impact on plant physiology and grape composition, the origins and interannual stability of vineyard microbiota, as well as their impact on wine sensorial attributes. The presentation will give an overview on the role of microbiota as a terroir component and will highlight future perspectives and challenges on this key subject for the wine industry.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.

Phenolic composition of Tempranillo Blanco grapes changes after foliar application of urea

Our research aimed to determine the effect and efficiency of foliar application of urea on the phenolic composition of Tempranillo Blanco grapes. The field experiment was carried out in 2019 and 2020 seasons and the plot was located in D.O.Ca Rioja (North of Spain). The vineyard was Vitis vinifera L. Tempranillo Blanco and grafted on Richter-110 rootstock. The treatments were control (C), whose plants were sprayed with water and three doses of urea: plants were sprayed with urea 3 kg N/ha (U3), 6 kg N/ha (U6) and 9 kg N/ha (U9). The applications were performed in two phenological stages, pre-veraison (Pre) and veraison (Ver). Also, each of the treatments was repeated one week later. Control and treatments were performed in triplicate and arranged in a randomised block design. Grapes were harvested at optimum ripening stage. High-performance liquid chromatography was used to analyse the phenolic composition of the grapes. Finally, the results obtained from the analytical determinations – flavonols, flavanols and non-flavonoid (hydroxybenzoic acids, hydroxycinnamic acids and stilbenes) – were studied statistically by analysis of variance. The results showed that, in 2019, U6-Pre and U9-Pre treatments increased the hydroxybenzoic acid content in grapes, and also all foliar treatments applied at Pre enhanced the stilbene concentration. Moreover, U3-Ver was the only treatment that rose flavonol and stilbene contents in the Tempranillo Blanco grapes. In 2020, all treatments applied at Pre enhanced the flavonol concentration in grapes. Furthermore, U3-Pre and U9-Pre treatments increased stilbene content in grapes. Nevertheless, the hydroxybenzoic acid content was improved by U6-Ver and U9-Ver and besides, hydroxycinnamic acid concentration in grapes was increased by all treatments applied at Ver. In conclusion, the lower and highest dose of urea (U3 and U9), applied at pre-veraison, were the best treatments to improve the Tempranillo Blanco grape phenolic composition.

Grape berry size is a key factor in determining New Zealand Pinot noir wine composition

Making high quality but affordable Pinot noir (PN) wine is challenging in most terroirs and New Zealand’s (NZ) situation is no exception. To increase the probability of making highly typical PN wines producers choose to grow grapes in cool climates on lower fertility soils while adopting labour intensive practices. Stringent yield targets and higher input costs necessarily mean that PN wine cost is high, and profitability lower, in line-priced varietal wine ranges. To understand the reasons why higher yielding vines are perceived to produce wines of lower quality we have undertaken an extensive study of PN in NZ. Since 2018, we established a network of twelve trial sites in three NZ regions to find individual vines that produced acceptable commercial yields (above 2.5kg per vine) and wines of composition comparable to “Icon” labels. Approximately 20% of 660 grape lots (N = 135) were selected from within a narrow juice Total Soluble Solids (TSS) range and made into single vine wines under controlled conditions. Principal Component Analysis of the vine, berry, juice and wine parameters from three vintages found grape berry mass to be most effective clustering variable. As berry mass category decreased there was a systematic increase in the probability of higher berry red colour and total phenolics with a parallel increase in wine phenolics, changed aroma fraction and decreased juice amino acids. The influence of berry size on wine composition would appear stronger than the individual effects of vintage, region, vineyard or vine yield. Our observations support the hypothesis that it is possible to produce PN wines that fall within an “Icon” benchmark composition range at yields above 2.5kg per vine provided that the Leaf Area:Fruit Weight ratio is above 12cm2 per g, mean berry mass is below 1.2g and juice TSS is above 22°Brix.