Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

austrianvineyards.com: online viewer of all designations of Austrian wine

To digitally record and present all the origins of Austrian wines in the same perfect and clear way was the motivation for the Austrian Wine Marketing Board (Austrian Wine) to start with the project in 2018. In June 2021 the results were presented to the public in an online viewer showing all the designations of Austrian wine, available at https://austrianvineyards.com in a largely barrier-free manner. The online viewer provides tailored individual maps fitted to the respective zoom level. The smallest unit of wine-origins in Austria is called Ried and is displayed in a plot-specific manner highlighting areas under vine. Information on the Ried include administrative district, winegrowing municipality, cadastral municipality, large collective vineyard site, specific winegrowing region, generic winegrowing region, winegrowing area and, in many cases, an illustrative picture. Complementary data on the size, elevation (minimum-maximum), orientation (in 8 sectors plus flat) and gradient (minimum, maximum, average) are based on the area under vine according to the EU’s Integrated Administration and Control System. Additional information covers climate data. The diagrams are taken from the monthly breakdown of data in the annals of the Central Institute for Meteorology and Geodynamics, Austria provide a display of values for air temperature, precipitation, and sunshine hours for the reference year and the long-term average. Seasonal aggregated data on temperature, precipitation, and sunshine hours complete the display. Short descriptions with emphasis on geology and soil, field name in historical maps, etymology of the denomination, and main planted variety complements the available information for the main designations in the online viewer. These descriptions are compiled by winegrowers, geologists, historians, and journalists. All the information and data can be extracted to a pdf-file. Printed vineyard maps are also available. Missing content regarding wine origins in Styria will be completed in winter 2021/22.

The plantation frame as a measure of adaptation to climate change

The mechanization of vineyard work originally led to a reduction in planting densities due to the lack of machinery adapted to the vineyard. The current availability of specific machinery makes it possible to establish higher planting densities. In this work, three planting densities (1.40×0.80 m, 1.80×1 m and 2.20×1.20 m, corresponding to 8928, 5555 and 3787 plants/ha respectively) were studied with four varieties autochthonous of Galicia (northwestern Spain): Albariño and Treixadura (white), Sousón and Mencía (red). The vines were trained in a vertical shoot positioning system using a single Royat cordon, and pruned to spurs with two buds each. Agronomic data (yield, pruning wood weight, Ravaz index) and oenological data in must were collected. The higher planting density (1.40×0.80 m) had no significant effect on grape yield per vine in white varieties, although production per hectare was much higher due to the greater number of plants. In red varieties, this planting density resulted in a significantly lower production per vine, compensated by the greater number of plants. In addition, it significantly reduced the Brix degree in the must of the Albariño, Treixadura and Sousón varieties, and increased the total acidity in the latter two and Mencía. It also caused an increase in extractable and total anthocyanins and IPT in red grapes. The effects of high planting density on grapes are of great interest for the adaptation of varieties in the context of climate change. In the future, it could be advisable to modify the limits imposed by the appellations of origin on the planting density of these varieties in order to obtain more balanced wines.

Exploring resilience and competitiveness of wine estates in Languedoc-Roussillon in the recent past: a multi-level perspective

The Languedoc-Roussillon wineries are facing a decline in wine yields particularly PGI yields due to many factors. Climate change is just ones, but is expected to increase in the future. There is also structurally a large heterogeneity of yield profiles among terroirs, varieties and strategies. This work investigates the link between yield, competitiveness and resilience to explore how resilient winegrowers have been in the recent past. To this end two approaches have been combined; (i) an accountancy database analysis at estate scale and (ii) municipality level competitiveness analysis. A new resilience indicator that characterizes the capacity of an estate to absorb yield variation is also defined. The FADN database between 2000 and 2018 of ex-Languedoc-Roussillon (France) and other data are used to analyse the current situation and the past evolution of competitiveness and resilience by type of estate (type of farm: PGI and/or PDO & type of commercialization: bulk and/or bottles). The net margin, which defines competitiveness, is not correlated to yield for all types but depends on the type of commercialization and the level of specialisation. The resilience indicator shows that the net margin of estates specialized in PGI is particularly sensitive to yield declines. We also show that price evolutions seem to compensate the effect of yield losses for the majority of types. Municipality scale analysis shows the links between local pedoclimate, yield, commercialization strategies and price. Overlapping a PDO with a PGI does not always increase a municipality’s PGI competitiveness. It is difficult to make links between causes and effects due to the complexity of the wine production system. Production diversification may be a solution. Resorting to the two level of analysis helps resolving the data gap that is necessary to explore the links between yield and economic performance of the wine estates in the long term.

Revealing the Barossa zone sub-divisions through sensory and chemical analysis of Shiraz wine

The Barossa zone is arguably one of the most well-recognised wine producing regions in Australia and internationally; known mainly for the production of its distinct Shiraz wines. However, within the broad Barossa geographical delimitation, a variation in terroir can be perceived and is expressed as sensorial and chemical profile differences between wines. This study aimed to explore the sub-division classification across the Barossa region using chemical and sensory measurements. Shiraz grapes from 4 different vintages and different vineyards across the Barossa (2018, n = 69; 2019, n = 72; 2020, n = 79; 2021, n = 64) were harvested and made using a standardised small lot winemaking procedure. The analysis involved a sensory descriptive analysis with a highly trained panel and chemical measurement including basic chemistry (e.g. pH, TA, alcohol content, total SO2), phenolic composition, volatile compounds, metals, proline, and polysaccharides. The datasets were combined and analysed through an unsupervised, clustering analysis. Firstly, each vintage was considered separately to investigate any vintage to vintage variation. The datasets were then combined and analysed as a whole. The number of sub-divisions based on the measurements were identified and characterised with their sensory and chemical profile and some consistencies were seen between the vintages. Preliminary analysis of the sensory results showed that in most vintages, two major groups could be identified characterised with one group showing a fruit-forward profile and another displaying savoury and cooked vegetables characters. The exploration of distinct profiles arising from the Barossa wine producing region will provide producers with valuable information about the regional potential of their wine assisting with tools to increase their target market and reputation. This study will also provide a robust and comprehensive basis to determine the distinctive terroir characteristics which exist within the Barossa wine producing region.

Postveraison shoot trimming in Tannat and Merlot: preliminary results on yield components, plant balance and berry composition

There is currently a trend towards the production of wines with low alcohol content. To achieve this, grapes with low sugar content must be used. There are techniques at the vineyard level that can delay ripening and avoid excessive sugar accumulation without, a priori, affecting the final polyphenol content. Postveraison shoot trimming (PVST) is experimentally evaluated for these purposes, but its impact under Uruguayan climatic conditions with high interannual variability is not known. The aim of this work is to assess the PVST in Tannat and Merlot cultivars and their impact on yield components, plant balance and berry primary composition. In this study, two commercial vineyards of 10 years old Tannat and Merlot (grafted on SO4) at Canelones Department were selected. During the 2020-201 growing season, grapevines were submitted to PVST when grapes reached 15º Brix. In a randomized block, trimmed (T) and control (C) plants were evaluated with three repetitions each cultivar. Evaluation of the evolution of primary berry composition during ripening, measurement of yield components and plant balance were performed. For both cultivars, PVST did not affect yield components. Merlot reached 5.4 kg per plant and Tannat 7.1 kg, with not statistical significance between treatments. However, statistical differences were observed in terms of plant balance. In Merlot Ravaz Index reached a difference of 5.3 (12.0 in T and 6.7 in C) meanwhile Tannat reached 3.5 of statistical difference (13.7 in T and 10.2 in C). The tendency to imbalance for the treated plants had an impact on the final grape composition. Merlot grapes showed statistical difference in final total acidity (0.3 g of difference between treatments) while treatments impact final sugar content on Tannat grapes (10.0 g of difference between treatments). Further studies are needed to assess the impact of different canopy management techniques in our conditions.