Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Sensory and nephelometric analysis of tannin fractions obtained by ultrafiltration of red wines

The assessment of red wine mouthfeel relies primarily on the sensory description of its tannic properties. This evaluation could be improved by gaining a better understanding of the physicochemical properties of these tannins. Hence, the objectives of the present study were threefold: (1) to gain an insight into the sensory properties of subpopulations of proanthocyanidic tannins of different molecular sizes obtained through several ultrafiltration steps, (2) to quantify the kinetics of haze formation of these proanthocyanidic tannins in a dynamic polyvinylpyrrolidone (PVP) precipitation test, (3) to determine whether a correlation exists between the sensory and the precipitation data.

Historical zoning in the world

The study of the interaction between vineyards and the environment to establish the grapevines in the appropriate places has been applied in wine science for 5000 years. Advances in the field of the zoning have not been uniform in time, and have occupied a preferential place in the contributions of Roman writers of the 1st Century AC, the contemplations of Tokay (1700) and Porto (1756) and works of the second half of the 20th century. Zoning practices today integrate multidisciplinary methodologies (viticulture, enology, soils, climatology, cartography, statistics, computer science) and require further development for future application.

Fructose implication in the Sotolon formation in fortified wines: preliminary results

Sotolon (3-hydroxy-4,5-dimethyl-2(5H)-furanone) is a naturally occurring odorant compound with a strong caramel/spice-like scent, present in many foodstuffs. Its positive contribution for the aroma of different fortified wines such as Madeira, Port and Sherry is recognized. In contrast, it is also known to be responsible for the off-flavor character of prematurely aged dry white wines. The formation mechanisms of sotolon in wine are still not well elucidated, particularly in Madeira wines, which are submitted to thermal processing during its traditional ageing. The sotolon formation in these wines has been related to sugar degradation mechanisms, particularly from fructose [1].

A DNA-free editing approach to help viticulture sustainability: dual editing of DMR6-1 and DMR6-2 enhances resistance to downy mildew 

The sustainability of viticulture hinges on maintaining quality and yield while reducing pesticide use. Promising strides in this direction involve the development of clones with enhanced disease tolerance, particularly through the knockout of plant susceptibility genes. Knocking out of Downy Mildew Resistant 6 (DMR6) led to increased levels of endogenous salicylic acid (SA), a regulator of immunity, resulting in enhanced tolerance to Downy Mildew (DM) and other diseases in various crops.

VITIGEOSS Business Service: Task scheduling optimization in vineyards

Agriculture plantations are complex systems whose performance critically depends on the execution of several types of tasks with precise timing and efficiency to respond to different external factors. This is particularly true for orchards like vineyards, which need to be strictly monitored and regulated, as they are sensitive to diverse types of pests, and climate conditions. In these environments, managing and optimally scheduling the available work force and resources is not trivial and is usually done by teams of senior managers based on their experience. In this regard, having a baseline schedule could help them in the decision process and improve their results, in terms of time and resources spent.