Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate and the evolving mix of grape varieties in Australia’s wine regions

The purpose of this study is to examine the changing mix of winegrape varieties in Australia so as to address the question: In the light of key climate indicators and predictions of further climate change, how appropriate are the grape varieties currently planted in Australia’s wine regions? To achieve this, regions are classified into zones according to each region’s climate variables, particularly average growing season temperature (GST), leaving aside within-region variations in climates. Five different climatic classifications are reported. Using projections of GSTs for the mid- and late 21st century, the extent to which each region is projected to move from its current zone classification to a warmer one is reported. Also shown is the changing proportion of each of 21 key varieties grown in a GST zone considered to be optimal for premium winegrape production. Together these indicators strengthen earlier suggestions that the mix of varieties may be currently less than ideal in many Australian wine regions, and would become even less so in coming decades if that mix was not altered in the anticipation of climate change. That is, grape varieties in many (especially the warmest) regions will have to keep changing, or wineries will have to seek fruit from higher latitudes or elevations if they wish to retain their current mix of varieties and wine styles.

Aromatic maturity is a cornerstone of terroir expression in red wine

Harvesting grapes at adequate maturity is key to the production of high-quality red wines. Enologists and wine makers define several types of maturity, including technical maturity, phenolic maturity and aromatic maturity. Technical maturity and phenolic maturity are relatively well documented in the scientific literature, while articles on aromatic maturity are scarcer. This is surprising, because aromatic maturity is, without a doubt, the most important of the three in determining wine quality and typicity (including terroir expression). Optimal terroir expression can be obtained when the different types of maturity are reached at the same time, or within a short time frame. This is more likely to occur when the ripening takes place under mild temperatures, neither too cool, nor too hot. Aromatic expression in wine can be driven, from low to high maturity, by green, herbal, fresh fruit, ripe fruit, jammy fruit, candied fruit or cooked fruit aromas. Green and cooked fruit aromas are not desirable in red wines, while the levels of other aromatic compounds contribute to the typicity of the wine in relation to its origin. Wines produced in cool climates, or on cool soils in temperate climates, are likely to express herbal or fresh fruit aromas; while wines produced under warm climates, or on warm soils in temperate climates, may express ripe fruit, jammy fruit or candied fruit aromas. Growers can optimize terroir expression through their choice of grapevine variety. Early ripening varieties perform better in cool climates and late ripening varieties in warm climates. Additionally, maturity can be advanced or delayed by different canopy management practices or training systems.

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.

Optimizing stomatal traits for future climates

Stomatal traits determine grapevine water use, carbon supply, and water stress, which directly impact yield and berry chemistry. Breeding for stomatal traits has the strong potential to improve grapevine performance under future, drier conditions, but the trait values that breeders should target are unknown. We used a functional-structural plant model developed for grapevine (HydroShoot) to determine how stomatal traits impact canopy gas exchange, water potential, and temperature under historical and future conditions in high-quality and hot-climate California wine regions (Napa and the Central Valley). Historical climate (1990-2010) was collected from weather stations and future climate (2079-99) was projected from 4 representative climate models for California, assuming medium- and high-emissions (RCP 4.5 and 8.5). Five trait parameterizations, representing mean and extreme values for the maximum stomatal conductance (gmax) and leaf water potential threshold for stomatal closure (Ψsc), were defined from meta-analyses. Compared to mean trait values, the water-spending extremes (highest gmax or most negative Ysc) had negligible benefits for carbon gain and canopy cooling, but exacerbated vine water use and stress, for both sites and climate scenarios. These traits increased cumulative transpiration by 8 – 17%, changed cumulative carbon gain by -4 – 3%, and reduced minimum water potentials by 10 – 18%. Conversely, the water-saving extremes (lowest gmax or least negative Ψsc) strongly reduced water use and stress, but potentially compromised the carbon supply for ripening. Under RCP 8.5 conditions, these traits reduced transpiration by 22 – 35% and carbon gain by 9 – 16% and increased minimum water potentials by 20 – 28%, compared to mean values. Overall, selecting for more water-saving stomatal traits could improve water-use efficiency and avoid the detrimental effects of highly negative canopy water potentials on yield and quality, but more work is needed to evaluate whether these benefits outweigh the consequences of minor declines in carbon gain for fruit production.

Extreme canopy management for vineyard adaptation to climate change: is it a good idea?

Climate change constitutes an enormous challenge for humankind and for all human activities, viticulture not being an exception. Long-term strategic changes are probably needed the most, but growers also need to deal with short-term changes: summers that are getting progressively warmer, earlier harvest dates and higher pH in musts and wines. In the last 10-15 years, a relevant corpus of research is being developed worldwide in order to evaluate to which extent extreme canopy management operations, aimed at reducing leaf area and, thus, limiting the source to sink ratio, could be useful to delay ripening. Although extreme canopy management can result in relevant delays in harvest dates, longer term studies, as well as detailed analysis of their implications on carbohydrate reserves, bud fertility and future yield are desirable before these practices can be recommended.