Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Spatial determination of areas in the Western Balkans region favorable for organic production

In problematic conditions for production of grapes and wine caused by the COVID-19 pandemic and the resulting occurrence of wine surpluses, producers are increasingly turning to the innovative viticulture and winemaking of products that are more appealing to the market and the consumers. On the other hand, consumption of the food safety or organic products, and therefore of organic grapes and wine, is increasingly common in the world, in particular in Europe. The Regional Rural Development Standing Working Group (SWG RRD), as a regional intergovernmental organization gathers actors in the viticulture and winemaking sector from states and territories of the Western Balkans (South-East Europe) in the Expert Working Group for Wine, with the aim of improving viticulture and winemaking in this region through joint activities. In accordance with the aforementioned, the SWG RRD is working on advancing organic production of grapes and wine, and on recognition of specificities of the terroir of wine-growing areas in Western Balkans. In addition, as part of the project “Facilitation of Exchange and Advice on Wine Regulations in Western Balkan Countries” helmed by the German Federal Ministry of Food and Agriculture, in addition to harmonization of relevant legislation with EU regulations, efforts are being invested towards recognition of organic wines. Within activities and project implemented by this organization, expert analyses and scientific research of the terroir of Western Balkans were carried out, and some of the results are presented in this paper.

Different soil types and relief influence the quality of Merlot grapes in a relatively small area in the Vipava Valley (Slovenia) in relation to the vine water status

Besides location and microclimatic conditions, soil plays an important role in the quality of grapes and wine. Soil properties influence…

Exploring resilience and competitiveness of wine estates in Languedoc-Roussillon in the recent past: a multi-level perspective

The Languedoc-Roussillon wineries are facing a decline in wine yields particularly PGI yields due to many factors. Climate change is just ones, but is expected to increase in the future. There is also structurally a large heterogeneity of yield profiles among terroirs, varieties and strategies. This work investigates the link between yield, competitiveness and resilience to explore how resilient winegrowers have been in the recent past. To this end two approaches have been combined; (i) an accountancy database analysis at estate scale and (ii) municipality level competitiveness analysis. A new resilience indicator that characterizes the capacity of an estate to absorb yield variation is also defined. The FADN database between 2000 and 2018 of ex-Languedoc-Roussillon (France) and other data are used to analyse the current situation and the past evolution of competitiveness and resilience by type of estate (type of farm: PGI and/or PDO & type of commercialization: bulk and/or bottles). The net margin, which defines competitiveness, is not correlated to yield for all types but depends on the type of commercialization and the level of specialisation. The resilience indicator shows that the net margin of estates specialized in PGI is particularly sensitive to yield declines. We also show that price evolutions seem to compensate the effect of yield losses for the majority of types. Municipality scale analysis shows the links between local pedoclimate, yield, commercialization strategies and price. Overlapping a PDO with a PGI does not always increase a municipality’s PGI competitiveness. It is difficult to make links between causes and effects due to the complexity of the wine production system. Production diversification may be a solution. Resorting to the two level of analysis helps resolving the data gap that is necessary to explore the links between yield and economic performance of the wine estates in the long term.

Under-vine management effects on grapevine production, soil properties and plant communities in South Australia

Under-vine (UV) management has traditionally consisted of synthetic herbicide use to limit competition between weeds and grapevines. With growing global interest towards non-synthetic chemical use, this study aimed to capture the effects of alternative UV management at two commercial Shiraz vineyards in South Australia, where the sole management variables were UV management since 2016. In adjacent treatment blocks, cultivation (CU) was compared to spontaneous vegetation (SV) in McLaren Vale (MV), and herbicide was compared to SV in Eden Valley (EV). Soil water infiltration rates were slower and grapevine stem water potential was lower in CU compared to SV in MV, with the latter having a plant community dominated by soursob (Oxalis pes-caprae) during winter; while in EV, there was little separation between the treatments. Yields were affected at both sites, with SV being higher in MV and HE being higher in EV. In MV, the only effect on grape must was a lower 13C:12C isotope ratio in CU, indicating greater grapevine water stress. In the grape must at EV, SV had higher total soluble solids, total phenolics, anthocyanins, and yeast available nitrogen; and lower pH and titratable acidity. Pruning weights were not affected by the treatments in MV, while they were higher in HE at EV. Assessments revealed that the differing soil types at the two sites were likely the main determinants of the opposing production outcomes associated with UV management. In the silty loam soil of MV, the higher yields in SV were likely due to more plant-available water, as a potential result of the continuous soil bio-pores formed by winter UV vegetation. Conversely, in the loamy sand soils of EV with a lower cation exchange capacity, the lower yields and pruning weights in SV suggest the UV vegetation competed significantly with the grapevines for available water and nutrients.

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.