Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Grapevine abiotic stress induce tolerance to bunch rot

Context. Botrytis bunch rot occurrence is the most important limitation for the wine industry in humid climate viticulture.

Grapevine varietal diversity as mitigation tool for climate change: Agronomic and oenologic potential of 14 foreign varieties grown in Languedoc region (France)

Climate change effects in Languedoc include an expected rise in temperatures, increased evapotranspiration as well as more severe and frequent climatic hazards, such as frost, drought periods and heat waves. For winegrowers theses phenomena impact both yield and quality, resulting in more frequent unbalanced wines. Research on identified mitigation tools for vineyard management is necessary to improve resilience of grapevine agrosystems. Varietal assortment is one of them. This study focuses on agronomic and oenologic potential of 14 foreign varieties grown in Languedoc French region. Fourteen grapevine varieties were monitored during 2021 from June until harvest on eight different sites, some of which occurring on more than one site adding up to 21 different modalities: 7 white varieties Alvarinho B, Assyrtiko B (2), Malvasia Istriana B, Parellada B, Verdejo B, Verdelho B, Xarello B, and 7 black varieties Saperavi N (2), Touriga nacional N, Baga N, Aleatico N, Montepulciano N (2), Primitivo N (3), Calabrese N (3). Varietals were compared through the following parameters: phenology was assessed by using the information collected in the Database Network of French Vine Conservatories (INRAE-SupAgro-IFV, 2005-2015). The number of inflorescences for shoots from secondary buds and bourillons and suckers were observed to assess post-bud break frost tolerance potential. Grapevine water status was studied through stem water potential measurement, observation of foliage symptoms of drought, and 𝛿13C on must. Frequencies and intensities of downy mildew, powdery mildew, and black rot attacks were estimated before harvest on leaves and clusters and botrytis at harvest to assess disease susceptibilities. Berry composition was monitored from end of veraison until harvest. Yield and mean bunch weight were also calculated. Varieties were then ranked on a 1-4 scale for each parameter and compared through PCA. Forty two stations of the Mediterranean basin were compared by PCA with the Multicriteria Climatic Classification indicators in order to confront the collected information during 2021 campaign to the hypothesis that plants coming from dry and hot regions are genetically adapted to such climatic conditions.

Study of the sensory dimension of the wine typicality related to a terroir and crossing with their viticultural and oenological characteristics

The typicality of a product can be characterized by properties of similarity in relation to a type, but also by the properties of distinction.

Viticultura protegida: uso de mallas sombreadoras fotoselectivas como una herramienta para enfrentar la crisis climática en uva de mesa en el norte de Chile

The production of table grapes in Chile is of great importance, being one of the main established fruit crops with over 43,000 hectares distributed across a diverse climate range, from the southern limit of the Atacama desert to the mediterranean zone. Chile is also one of the leading exporters of table grapes. producers must confront the challenges posed by the climate crisis, such as decreased rainfall, increased heatwaves, and extreme temperature events during the growing season, mainly associated with desertification in northern Chile (Atacama and Coquimbo regions).

Managing soil health in vineyards: knowns and unknowns 

The use of soil conservation practices in wine grape production is becoming common throughout the world in response to an increased awareness of the value of soil health to maintain crop productivity and environmental quality. However, little information is available on the meaning of soil health within a viticultural context, and what soil properties should be targeted to achieve both the agronomic and environmental goals of wine grape producers. Conservation practices lead to increases in soil organic matter which may improve soil water retention, and increase soil C content therefore constituting a potential avenue to adapt to droughts and sequester C. Well-known management practices such as the use of cover crops, compost or no-till, although effective, seem to result in highly variable outcomes in soil organic matter and other soil health indicators. This variability is likely associated to the application of the practices in different soils and climates. Thus, integration of soil health building practices needs a thorough understanding of their efficacy under different conditions. Furthermore, additions of soil organic matter could trigger emissions of CO2 and N2O, a potent greenhouse gas that could represent a potential tradeoff of soil conservation practices. Finally, nutrient and water availability may be affected by the increase in soil organic matter having consequences for vine balance and grape quality.