Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The evolution of italian vine nursery production over the past 30 years

Italy has a long history of viticulture and has become one of the world’s leading producers of vine propagation material. The Italian vine nursery industry is today highly qualified and has become highly competitive on a global scale. The quality of the material is guaranteed by compliance with European Union regulations, which have been in force since the second half of the 20th century and have subsequently been supplemented and updated.

Sensory changes in wines associated with the ripening of Grenache grapes from vineyards in different climatic zones

Climate change is introducing a high variability on grape ripening, causing uncertainty, excessive spending on pesticides and eventually frustrating results in terms of the quality of the vintage, with the increasingly frequent appearance of aromatic problems associated with overripeness, raisining and greenness, which sometimes only appear in bottled wines.

Somatic embryogenesis and polyploidy in grapevine: morphological shoot and leaf traits variations

Somatic embryogenesis (SE) has been used in a variety of biotechnology applications such as virus elimination, cryopreservation, induced mutagenesis and genetic transformation. The SE induction process may cause DNA alterations and ploidy changes, which may provide a source of genetic variability useful for the improvement of agronomic characteristics of plants. This research aims at investigating the spontaneous alterations of the genome in grapevine plants regenerated through SE. Regenerants obtained from different embryogenic events from three different grapevine genotypes (Catarratto, Frappato and Nero d’Avola) were analysed.

Effect of drought on grapevine wood fungal pathogen communities using a metatranscriptomics approach

Crops are facing increasing biotic and abiotic stress pressures due to global changes. However, trade-off mechanisms between these stresses and the underlying physiological processes are still poorly understood, especially in perennial crop species. To better understand these trade-offs, we studied the effect of drought on grapevine (Vitis vinifera) physiology and esca-related wood fungal communities. Esca is a vascular disease caused by a community of wood-infecting pathogenic fungi, and characterized by trunk necrosis, leaf scorch symptoms, yield losses, and mortality.

Vineyard soil mapping to optimise wine quality: from ‘terroir’ characterisation to vineyard management

In this study, a soil mapping methodology at subplot level (scale 1:5000) for vineyard soils was developed. The aim of this mapping method was to establish mapping units, which could be used as basic units for ‘terroir’ characterisation and vineyard management (precision viticulture).