Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Rootstock effect on Cabernet Sauvignon aromatic and chemical composition

Grape quality potential for wine production is strongly influenced by environmental parameters and agronomic factors. Several studies underline the rootstock effect on scions vegetative growth and berry composition [1] with an impact on wine quality. Rootstocks are promising agronomic tools for climate change adaptation and in most grape-growing regions the potential diversity of rootstocks is not fully used and only a few genotypes are planted. Moreover, little is known about the effect of rootstock genetic variability on the aromatic composition in wines.

The start of Croatian grapevine breeding program

Modern viticulture in Croatia and the world is mainly based on the grapevine varieties susceptible to various diseases and pests, which leads to unsustainable use of large amounts of pesticides. The sustainable development of viticulture in the future will only be possible by increasing the resistance of the grapevine through the development of new resistant varieties. Breeding programs have been launched in the leading wine-growing countries with the aim of developing resistant varieties possessing high quality level. Coratia is rich in in native grapevine varieties that are the basis of wine production, and are not included in the breeding programs of other countries.

Cumulative effect (6 years) of deficit irrigation in two important cultivars of Douro region, Portugal

Numerous studies have demonstrated the importance of irrigation in improving the grape yield and quality in areas with arid and semiarid climates, particularly in the context of ongoing climate changes. However, the introduction of irrigation in vineyards of the Mediterranean basin is a matter of debate, in particular in those of the Douro Demarcated Region (DDR), due to the limited number of available studies in this region. The present study aimed to evaluate how different irrigation deficits for 6 years would influence production and must quality in Touriga Francesa (TF) and Touriga Nacional (TN) varieties.

Soil survey and chemical parameters evaluation in viticultural zoning

The most recent methodological developments in soil survey and land evaluation, that can be taken as reference in the viticultural field, go over usage of the GIS and database. These informatic tools, which begin to be widely utilised, consent to realise evaluations at different geographic scale and with different data quality and quantity in entrance.

How to improve the success of dead vine replacement: insights into the impacts of young plant‘s environment 

Grapevine faces multiple biotic and/or abiotic stresses, which are interrelated. Depending on their incidence, they can have a negative impact on the development and production of the plant, but also on its longevity, leading to vine dieback. One of the consequences of vine dieback on production is the increased replacement rate of dead or missing vines within a parcel.