Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Impact of yeast derivatives to increase the phenolic maturity and aroma intensity of wine

Using viticultural and enological techniques to increase aromatics in white wine is a prized yet challenging technique for commercial wine producers. Equally difficult are challenges encountered in hastening phenolic maturity and thereby increasing color intensity in red wines. The ability to alter organoleptic and visual properties of wines plays a decisive role in vintages in which grapes are not able to reach full maturity, which is seen increasingly more often as a result of climate change. A new, yeast-based product on the viticultural market may give the opportunity to increase sensory properties of finished wines. Manufacturer packaging claims these yeast derivatives intensify wine aromas of white grape varieties, as well as improve phenolic ripeness of red varieties, but the effects of this application have been little researched until now. The current study applied the yeast derivative, according to the manufacture’s instructions, to the leaves of both neutral and aromatic white wine varieties, as well as on structured red wine varieties. Chemical parameters and volatile aromatics were analyzed in grape musts and finished wines, and all wines were subjected to sensory analysis by a tasting panel. Collective results of all analyses showed that the application of the yeast derivative in the vineyard showed no effect across all varieties examined, and did not intensify white wine aromatics, nor improve phenolic ripeness and color intensity in red wine.

Elucidating vineyard site contributions to key sensory molecules: Identification of correlations between elemental composition and volatile aroma profile of site-specific Pinot noir wines

The reproducibility of elemental profile in wines produced across multiple vintages has been previously reported using grapes from a single scion clone of Vitis vinifera L. cv. Pinot noir. The grapevines were grown on fourteen different vineyard sites, from Oregon to southern California in the U.S.A., which span distances from approximately hundreds of meters to 1450 km, while elevations range from near sea level to nearly 500 m. In addition, sensorial (i.e. aroma, taste, and mouthfeel) and chemical (i.e. polyphenolic and volatile) differences across the different vineyard sites have also been observed among these wines at two aging time points. While strong evidence exists to support that grapes grown in different regions can produce wines with unique chemical and sensorial profiles, even when a single clone is used, the understanding of growing site characteristics that result in this reproducible differentiation continues to emerge. One hypothesis is that the elemental profile that a vineyard site imparts to the grape berries and the resulting wine is an important contributor to this differentiation in chemistry and sensory of wines. For example, various classes of enzymes that catalyze the formation of key aroma compounds or their precursors require specific metals. In this work, we begin to report correlations between elemental and volatile aroma profiles of site-specific Pinot noir wines, made under standardized winemaking conditions, that have been previously shown to be distinguished separately by these chemical analyses.

Updating the Winkler index: An analysis of Cabernet sauvignon in Napa Valley’s varied and changing climate

This study aims to create an updated, agile viticultural climate index (similar to the Winkler Index) by performing in-depth analyses of current and historical data from industry partners in several major winegrowing regions. The Winkler Index was developed in the early twentieth century based on analysis of various grape-growing regions in California. The index uses heat accumulation (i.e. Growing Degree Days) throughout the growing season to determine which grape varieties are best suited to each region. As viticultural regions are increasingly subject to the complexity and uncertainty of a changing climate, a more rigorous, agile model is needed to aid grape growers in determining which cultivars to plant where. For the first phase of this study, 21 industry partners throughout Napa Valley shared historical phenology, harvest, viticultural practice, and weather data related to their Cabernet sauvignon vineyard blocks. To complement this data, berry samples were collected throughout the 2021 growing season from 50 vineyard blocks located throughout 16 American Viticultural Areas that were then analyzed for basic berry chemistry and phenolics. These blocks have been mapped using a Geographic Information System (GIS), enabling analysis of altitude, vineyard row orientation, slope, and remotely sensed climate data. Sampling sites were also chosen based on their proximity to a weather station. By analyzing historical data from industry partners and data specifically collected for this study, it is possible to identify key parameters for further analysis. Initial results indicate extreme variability at a high spatial resolution not currently accounted for in modern viticultural climate indices and suggest that viticultural practices play a major role. Using the structure of data collection and analyses developed for the first phase, this project will soon be expanded to other wine regions globally, while continuing data collection in Napa Valley.

Assessing the relationship between cordon strangulation, dieback, and fungal trunk disease symptom expression

Grapevine trunk diseases including Eutypa dieback are a major factor in the decline of vineyards and may lead to loss of productivity, reduced income, and premature reworking or replanting. Several studies have yielded results indicating that vines may be more likely to express symptoms of vascular disease if their health is already compromised by stress. In Australia and many other wine-growing regions it is a common practice for canes to be wrapped tightly around the cordon wire during the establishment of permanent cordon arms. It is likely that this practice may have a negative effect on health and longevity, as older cordons that have been trained in this manner often display signs of decay and dieback, with the wire often visibly embedded within the wood of the cordon. It is possible that adopting a training method which avoids constriction of the vasculature of the cordon may help to limit the onset of vascular disease symptom expression. A survey was conducted during the spring of two consecutive growing seasons on vineyards in South Australia displaying symptoms of Eutypa lata infection when symptomless shoots were 50–100 cm long. Vines were assessed as follows: (i) the proportion of cordon exhibiting dieback was rated using a 0–100% scale; (ii) the proportion of canopy exhibiting foliar symptoms of Eutypa dieback was rated using a 0–100% scale; (iii) the severity of strangulation was rated using a 0–4 point scale. Images were also taken of each vine for the purpose of measuring plant area index (PAI) using the VitiCanopy App. The goal of the survey was to determine if and to what extent any correlation exists between severity of strangulation and cordon dieback, in addition to Eutypa dieback foliar symptom expression.

Projected changes in vine phenology of two varieties with different thermal requirements cultivated in La Mancha DO (Spain) under climate change scenarios

The aim of this work was to analyze the phenology variability of Tempranillo and Chardonnay cultivars, related to the climatic characteristics in La Mancha Designation of Origin, and their potential changes under climate change scenarios. Phenological dates referred to budbreak, flowering, veraison and harvest were analyzed for the period 2000-2019. The weather conditions at daily time scale, recorded during the same period, were also evaluated. The thermal requirements to reach each of these phenological stages were calculated and expressed as the GDD accumulated from DOY=60. Changes in phenology were projected by 2050 and 2070 taking into account those values and the projected temperatures and precipitation, simulated under two Representative Concentration Pathway (RCP) scenarios –RCP4.5 and RCP8.5– using an ensemble of models. The average phenological dates during the period under study were, April 16th ± 6.6 days and April 5th ± 6.0 days for budbreak, May 31st ± 6.0 days and May 27th ± 5.3 days for flowering, July 26th ± 5.6 days and July 25th ± 5.8 days for veraison, and Ago 23rd ± 10.8 days and Ago 17th ± 9.0 days for harvest, respectively, for Tempranillo and Chardonnay. The projected changes in temperature imply an average change in the maximum growing season (April-August) temperatures of 1.2 and 1.9°C by 2050, and 1.6 and 2.6°C by 2070, under the RCP4.5 and RCP8.5 scenarios, respectively. A reduction in precipitation is predicted, which vary between 15% for 2050 under RCP4.5 scenario and up to 30% by 2070 under RCP8.5. The advance of the phenological dates for 2050, could be of 6, 7, 7, and 8 days for Tempranillo and 4, 6, 6 and 9 days for Chardonnay, respectively for budbreak, flowering, veraison and harvest under the RCP4.5 scenario. Under the RCP8.5 emission scenario, the advance could be up to 30% higher.