Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The concept of terroir: what place for microbiota?

Microbes play key roles on crop nutrient availability via biogeochemical cycles, rhizosphere interactions with roots as well as on plant growth and health. Recent advances in technologies, such as High Throughput Sequencing Techniques, allowed to gain deeper insight on the structure of bacterial and fungal communities associated with soil, rhizosphere and plant phyllosphere. Over the past 10 years, numerous scientific studies have been carried out on the microbial component of the vineyard. Whether the soil or grape compartments have been taken into account, many studies agree on the evidence of regional delineations of microbial communities, that may contribute to regional wine characteristics and typicity. Some authors proposed the term “microbial terroir” including “yeast terroir” for grapes to describe the connection between microbial biogeography and regional wine characteristics. Many factors are involved in terroir including climate, soil, cultivar and human practices as well as their interactions. Studies considering “microbial terroir” greatly contributed to improve our knowledge on factors that shape the vineyard microbial structure and diversity. However, the potential impact of “microbial terroir” on wine composition has yet not received strong scientific evidence and many questions remain to be addressed, related to the functional characterization of the microbial community and its impact on plant physiology and grape composition, the origins and interannual stability of vineyard microbiota, as well as their impact on wine sensorial attributes. The presentation will give an overview on the role of microbiota as a terroir component and will highlight future perspectives and challenges on this key subject for the wine industry.

Assessing the relationship between cordon strangulation, dieback, and fungal trunk disease symptom expression

Grapevine trunk diseases including Eutypa dieback are a major factor in the decline of vineyards and may lead to loss of productivity, reduced income, and premature reworking or replanting. Several studies have yielded results indicating that vines may be more likely to express symptoms of vascular disease if their health is already compromised by stress. In Australia and many other wine-growing regions it is a common practice for canes to be wrapped tightly around the cordon wire during the establishment of permanent cordon arms. It is likely that this practice may have a negative effect on health and longevity, as older cordons that have been trained in this manner often display signs of decay and dieback, with the wire often visibly embedded within the wood of the cordon. It is possible that adopting a training method which avoids constriction of the vasculature of the cordon may help to limit the onset of vascular disease symptom expression. A survey was conducted during the spring of two consecutive growing seasons on vineyards in South Australia displaying symptoms of Eutypa lata infection when symptomless shoots were 50–100 cm long. Vines were assessed as follows: (i) the proportion of cordon exhibiting dieback was rated using a 0–100% scale; (ii) the proportion of canopy exhibiting foliar symptoms of Eutypa dieback was rated using a 0–100% scale; (iii) the severity of strangulation was rated using a 0–4 point scale. Images were also taken of each vine for the purpose of measuring plant area index (PAI) using the VitiCanopy App. The goal of the survey was to determine if and to what extent any correlation exists between severity of strangulation and cordon dieback, in addition to Eutypa dieback foliar symptom expression.

Updating the Winkler index: An analysis of Cabernet sauvignon in Napa Valley’s varied and changing climate

This study aims to create an updated, agile viticultural climate index (similar to the Winkler Index) by performing in-depth analyses of current and historical data from industry partners in several major winegrowing regions. The Winkler Index was developed in the early twentieth century based on analysis of various grape-growing regions in California. The index uses heat accumulation (i.e. Growing Degree Days) throughout the growing season to determine which grape varieties are best suited to each region. As viticultural regions are increasingly subject to the complexity and uncertainty of a changing climate, a more rigorous, agile model is needed to aid grape growers in determining which cultivars to plant where. For the first phase of this study, 21 industry partners throughout Napa Valley shared historical phenology, harvest, viticultural practice, and weather data related to their Cabernet sauvignon vineyard blocks. To complement this data, berry samples were collected throughout the 2021 growing season from 50 vineyard blocks located throughout 16 American Viticultural Areas that were then analyzed for basic berry chemistry and phenolics. These blocks have been mapped using a Geographic Information System (GIS), enabling analysis of altitude, vineyard row orientation, slope, and remotely sensed climate data. Sampling sites were also chosen based on their proximity to a weather station. By analyzing historical data from industry partners and data specifically collected for this study, it is possible to identify key parameters for further analysis. Initial results indicate extreme variability at a high spatial resolution not currently accounted for in modern viticultural climate indices and suggest that viticultural practices play a major role. Using the structure of data collection and analyses developed for the first phase, this project will soon be expanded to other wine regions globally, while continuing data collection in Napa Valley.

Revealing the Barossa zone sub-divisions through sensory and chemical analysis of Shiraz wine

The Barossa zone is arguably one of the most well-recognised wine producing regions in Australia and internationally; known mainly for the production of its distinct Shiraz wines. However, within the broad Barossa geographical delimitation, a variation in terroir can be perceived and is expressed as sensorial and chemical profile differences between wines. This study aimed to explore the sub-division classification across the Barossa region using chemical and sensory measurements. Shiraz grapes from 4 different vintages and different vineyards across the Barossa (2018, n = 69; 2019, n = 72; 2020, n = 79; 2021, n = 64) were harvested and made using a standardised small lot winemaking procedure. The analysis involved a sensory descriptive analysis with a highly trained panel and chemical measurement including basic chemistry (e.g. pH, TA, alcohol content, total SO2), phenolic composition, volatile compounds, metals, proline, and polysaccharides. The datasets were combined and analysed through an unsupervised, clustering analysis. Firstly, each vintage was considered separately to investigate any vintage to vintage variation. The datasets were then combined and analysed as a whole. The number of sub-divisions based on the measurements were identified and characterised with their sensory and chemical profile and some consistencies were seen between the vintages. Preliminary analysis of the sensory results showed that in most vintages, two major groups could be identified characterised with one group showing a fruit-forward profile and another displaying savoury and cooked vegetables characters. The exploration of distinct profiles arising from the Barossa wine producing region will provide producers with valuable information about the regional potential of their wine assisting with tools to increase their target market and reputation. This study will also provide a robust and comprehensive basis to determine the distinctive terroir characteristics which exist within the Barossa wine producing region.

Modeling the suitability of Pinot Noir in Oregon’s Willamette Valley in a changing climate

Air temperature is the key driver of grapevine phenology and a significant environmental factor impacting yield and quality for a winegrape growing region. In this study the optimal downscaled CMIP5 ensemble for computing thegrowing season average temperature (GST) viticulture climate classification index was determined to spatially compute on a decadal basis predictions of the GST climate index and the grapevine sugar ripeness (GSR) model for Pinot Noir throughout the Willamette Valley (WV) American Viticultural Area (AVA). Forecasts for average temperature and a 220 g/L target sugar concentration level were computed using daily Localized Constructed Analogs (LOCA) downscaled CMIP5 historic and Representative Concentration Pathways (RCP) future climate projections of minimum and maximum daily temperature. We explore spatiotemporal trends of the GST climate classification index and Pinot Noir specific applications of the GSR phenology model for the WV AVA. Spatiotemporal computations of the GST climate index and Pinot Noir specific applications of the GSR model enable the opportunity to explore relationships between their computed values with one intent being to provide updated GST ranges that better align with current temperature-based modeling understanding of Pinot Noir grapevine phenology and the viticultural application of LOCA CMIP5 climate projections for the WV AVA. The Pinot Noir specific applications of the GSR model or the GST index with updated bounds indicate that the percent of the WV AVA area suitable for Pinot Noir production is currently at or near its peak value in the upper 80s to lower 90s of this century.