Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Analysis of Cabernet Sauvignon and Aglianico winegrape (V. vinifera L.) responses to different pedo-climatic environments in southern Italy

Water deficit is one of the most important effects of climate change able to affect agricultural sectors. In general, it determines a reduction in biomass production, and for some plants, as in the case of grapevine, it can endorse fruit quality. The monitoring and management of plant water stress in the vineyard

Grapevine yield-gap: identification of environmental limitations by soil and climate zoning in Languedoc-Roussillon region (south of France)

Grapevine yield has been historically overlooked, assuming a strong trade-off between grape yield and wine quality. At present, menaced by climate change, many vineyards in Southern France are far from the quality label threshold, becoming grapevine yield-gaps a major subject of concern. Although yield-gaps are well studied in arable crops, we know very little about grapevine yield-gaps. In the present study, we analysed the environmental component of grapevine yield-gaps linked to climate and soil resources in the Languedoc Roussillon. We used SAFRAN data and IGP Pays d’Oc wine yields from 2010 to 2018. We selected climate and soil indicators proving to have a significant effect on average wine yield-gaps at the municipality scale. The most significant factors of grapevine yield were the Soil Available Water Capacity; followed by the Huglin Index and the Climatic Dryness Index. The Days of Frost; the Soil pH; and the Very Hot Days were also significant. Then, we clustered geographical zones presenting similar indicators, facilitating the identification of resources yield-gaps. We discussed the number of zones with the experts of IGP Pays d’Oc label, obtaining 7 zones with similar limitations for grapevine yield. Finally, we analysed the main resources causing yield-gaps and the grapevine varieties planted on each zone. Mapping grapevine resource yield-gaps are the first stage for understanding grapevine yield-gaps at the regional scale.

Terroir traceability in grapes, musts and wine: results of research on Gewürztraminer and Sauvignon Blanc grape varieties in northern Italy

In the study of terroir, a separate analysis of its many component factors can be of great help in accurately identifying a vineyard’s natural elements that impact wine quality and typicity. This research used a dedicated pluri-disciplinary approach to investigate the ecological characteristics, including geology and geographical features, of 14 vineyards that produce Gewürztraminer and Sauvignon Blanc cultivars in the alpine Alto Adige DOC wine region. Both the geopedological method using Vineyards Geological Identity (VGI) and the new Solar Radiaton Identity (SRI) topoclimatic classification method were used to provide analytical measurements and qualitative/quantitative characterisations. In addition, wide-ranging targeted and untargeted oenological and chemical analyses were carried out on grapes, musts and wines to correlate the soils’ geomineral and physical conditions with the biochemical properties of their fruits and wines. The research identified strong correlations between vineyard geo-identity and wine biofingerprint, confirming a mineral traceability of strontium rubidium ratio and some minerals distinctive to the local geology, such as K, Ca, Ag, Ba and Mn.  The study also discovered that particular geomineral and physical soil conditions of the studied vineyards are related to the different amount of amino acids, primary varietal aromas and polyphenols found in grapes, musts and wines. The research confirmed that winemaking technologies support oenological quality, although in some cases, human practices can overpower certain characteristic elements in wine, erasing the typical imprint left by the vineyards’ natural terroir, which becomes less traceable. Terroir abiotic ecological factors and vineyard identity can be classified in detail using the new VGI and SRI analysis methods to discover interrelationships between geo-pedological and topoclimatic conditions that impact wine quality. These methods are also helpful in identifying which ecological elements are exclusive to a particular vineyard or wine sub-region.

Biodiversity in the vineyard agroecosystem: exploring systemic approaches

Biodiversity conservation and restoration are essential for guarantee the provision of ecosystem services associated to vineyard agroecosystem such as climate regulation trough carbon sequestration and control of pests and diseases. Most of published research dealing with the complexity of the vineyard agroecosystems emphasizes the necessity of innovative approaches, including the integration of information at different temporal and spatial scales and development of systemic analysis based on modelling. A biodiversity survey was conducted in the Franciacorta wine-growing area (Lombardy, Italy), one of the most important Italian wine-growing regions for sparkling wine production, considering a portion of the territory of 112 ha. The area was divided into several Environmental Units (EUs), defined as a whole vineyard or portion of vineyard homogenous in terms of four agronomic characteristics: planting year, planting density, cultivar, and training system. In each EU a set of compartments was identified and characterised by specific variables. The compartments are meteorology, morphology (altitude, slope, aspect, row orientation, and solar irradiance), ecological infrastructures and management. The landscape surrounding EU was also characterised in terms of land-use in a buffer zone of 500 m. For each component a specific methodology was identified and applied. Different statistical approaches were used to evaluate the method to integrate the information related to different compartments within the EU and related to the buffer zone. These approaches were also preliminarily evaluated for their ability to describe the contribution of biodiversity and landscape components to ecosystem services. This methodological exploration provides useful indication for the development of a fully systemic approach to structural and functional biodiversity in vineyard agroecosystems, contributing to promote a multifunctional perspective for the all wine-growing sector.

Photoselective shade films affect grapevine berry secondary metabolism and wine composition

Grapevine physiology and production are challenged by forecasted increases in temperature and water deficits. Within this scenario, photoselective overhead shade films are promising tools in warm viticulture areas to overcome climate change related factors. The aim of this study was to evaluate the vulnerability of ‘Cabernet Sauvignon’ grape berry to solar radiation overexposure and optimize shade film use for berry integrity. A randomized complete block design field study was conducted across two years (2020-2021) in Oakville, Napa Valley, CA, with four shade films (D1, D3, D4, D5) differing in the percent of radiation spectra transmitted and compared to an uncovered control (C0). Integrals for gas exchange parameters and mid-day stem water potential were unaffected by the shade films in 2020 and 2021. By harvest, berries from uncovered and shaded vines did not differ in their size or primary metabolism in either year. Despite precipitation exclusion during the dormant season in the shaded treatments, yield did not differ between them and the control in either season. In 2020, total skin anthocyanins (mg/g fresh mass) in the shaded treatments was greater than C0 during berry ripening and at harvest. Conversely, flavonol concentrations in 2020 were reduced in shaded vines compared to C0. The 2020 growing season highlighted the impact of heat degradation on flavonoids. Flavonoid concentrations in 2021 increased until harvest while flavonoid degradation was apparent from veraison to harvest in 2020 across shaded and control vines. Wine analyses highlighted the importance of light spectra to modify wine composition. Wine color intensity, tonality and anthocyanin values were enhanced in D4 whereas antioxidant properties were enhanced in C0 and D5 wines. Altogether, our results highlighted the need of new approaches in warm viticulture areas given the impact that composition of light has on berry and wine quality.