Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Terroir traceability in grapes, musts and wine: results of research on Gewürztraminer and Sauvignon Blanc grape varieties in northern Italy

In the study of terroir, a separate analysis of its many component factors can be of great help in accurately identifying a vineyard’s natural elements that impact wine quality and typicity. This research used a dedicated pluri-disciplinary approach to investigate the ecological characteristics, including geology and geographical features, of 14 vineyards that produce Gewürztraminer and Sauvignon Blanc cultivars in the alpine Alto Adige DOC wine region. Both the geopedological method using Vineyards Geological Identity (VGI) and the new Solar Radiaton Identity (SRI) topoclimatic classification method were used to provide analytical measurements and qualitative/quantitative characterisations. In addition, wide-ranging targeted and untargeted oenological and chemical analyses were carried out on grapes, musts and wines to correlate the soils’ geomineral and physical conditions with the biochemical properties of their fruits and wines. The research identified strong correlations between vineyard geo-identity and wine biofingerprint, confirming a mineral traceability of strontium rubidium ratio and some minerals distinctive to the local geology, such as K, Ca, Ag, Ba and Mn.  The study also discovered that particular geomineral and physical soil conditions of the studied vineyards are related to the different amount of amino acids, primary varietal aromas and polyphenols found in grapes, musts and wines. The research confirmed that winemaking technologies support oenological quality, although in some cases, human practices can overpower certain characteristic elements in wine, erasing the typical imprint left by the vineyards’ natural terroir, which becomes less traceable. Terroir abiotic ecological factors and vineyard identity can be classified in detail using the new VGI and SRI analysis methods to discover interrelationships between geo-pedological and topoclimatic conditions that impact wine quality. These methods are also helpful in identifying which ecological elements are exclusive to a particular vineyard or wine sub-region.

Differential responses of red and white grape cultivars trained to a single trellis system – the VSP

Commercial grape production relies on training grapevine cultivars onto a variety of trellis systems. Training allows for well-lit leaves and clusters, maximizing fruit quality in addition to facilitating cultivation, harvesting, and diseases control. Although grapevines can be trained onto an infinite variety of trellis systems, most red and white cultivars are trained to the standard VSP (Vertical Shoot Positioning) system. However, red and white cultivars respond differently to VSP in fruit composition and growth characteristics, which are yet to be fully understood. Therefore, the objective of this study was to examine the influence of the VSP trellis system on fruit composition of three red, Cabernet Sauvignon, Merlot and Syrah, and three white, Chardonnay, Riesling, and Gewurztraminer cultivars grown under uniform growing conditions in the same vineyard. All cultivars were monitored for maturity and harvested at their physiologically maximum possible sugar concentration to compare various fruit quality attributes such as Brix, pH, TA, malic and tartaric acids, glucose and fructose, potassium, YAN, and phenolic compounds including total anthocyanins, anthocyanin profile, and tannins. A distinct pattern in fruit composition was observed in each cultivar. In regards to growth characteristics, Syrah grew vigorously with the highest cluster weight. Although all cultivars developed pyriform seeds, the seed size and weight varied among all cultivars. Also varied were mesocarp cell viability, brush morphology, and cane structure. This knowledge of the canopy architectural characteristics assessed by the widely employed fruit compositional attributes and growth characteristics will aid the growers in better management of the vines in varied situations.

Variations of soil attributes in vineyards influence their reflectance spectra

Knowledge on the reflectance spectrum of soil is potentially useful since it carries information on soil chemical composition that can be used to the planning of agricultural practices. If compared with analytical methods such as conventional chemical analysis, reflectance measurement provides non-destructive, economic, near real-time data. This paper reports results from reflectance measurements performed by spectroradiometry on soils from two vineyards in south Brazil. The vineyards are close to each other, are on different geological formations, but were subjected to the same management. The objective was to detect spectral differences between the two areas, correlating these differences to variations in their chemical composition, to assess the technique’s potential to predict soil attributes from reflectance data.To that end, soil samples were collected from ten selected vine parcels. Chemical analysis yield data on concentration of twenty-one soil attributes, and spectroradiometry was performed on samples. Chemical differences significant to a 95% confidence level between the two studied areas were found for six soil attributes, and the average reflectance spectra were separated by this same level along most of the observed spectral domain. Correlations between soil reflectance and concentrations of soil attributes were looked for, and for ten soil traits it was possible to define wavelength domains were reflectance and concentrations are correlated to confidence levels from 95% to 99%. Partial Least Squares Regression (PLSR) analyses were performed comparing measured and predicted concentrations, and for fifteen out of 21 soil traits we found Pearson correlation coefficients r > 0.8. These preliminary results, which have to be validated, suggest that variations of concentration in the investigated soil attributes induce differences in reflectance that can be detected by spectroradiometry. Applications of these observations include the assessment of the chemical content of soils by spectroradiometry as a fast, low-cost alternative to chemical analytical methods.

Co-design and evaluation of spatially explicit strategies of adaptation to climate change in a Mediterranean watershed

Climate change challenges differently wine growing systems, depending on their biophysical, sociological and economic features. Therefore, there is a need to locally design and evaluate adaptation strategies combining several technical options, and considering the local opportunities and constraints (e.g. water access, wine typicity). The case study took place in a typical and heterogeneous Mediterranean vineyard of 1,500 ha in the South of France. We developed a participatory modeling approach to (1) conceptualize local climate change issues and design spatially explicit adaptation strategies with stakeholders, (2) numerically evaluate their effects on phenology, yield and irrigation needs under the high-emissions climate change scenario RCP 8.5, and (3) collectively discuss simulation results. We organized five sets of workshops, with in-between modeling phases. A process-based model was developed that allowed to evaluate the effects of six technical options (late varieties, irrigation, water saving by reducing canopy size, adjusting cover cropping, reducing density, and shading) with various distributions in the watershed, as well as vineyard relocation. Overall, we co-designed three adaptation strategies. Delay harvest strategy with late varieties showed little effects on decreasing air temperature during ripening. Water constraint limitation strategy would compensate for production losses if disruptive adaptations (e.g. reduced density) were adopted, and more land got access to irrigation. Relocation strategy would foster high premium wine production in the constrained mountainous areas where grapevine is less impacted by climate change. This research shows that a spatial distribution of technical changes gives room for adaptation to climate change, and that the collaboration with local stakeholders is a key to the identification of relevant adaptation. Further research should explore the potential of adaptation strategies based on soil quality improvement and on water stress tolerant varieties.

Drought effect on aromatic and phenolic potential of seven recovered grapevine varieties in Castilla-La Mancha region (Spain)

The effects of climate change are seriously affecting the quality of wine grapes. High temperatures and drought cause imbalances in the chemical composition of grapes. The result is overripe grapes with low acidity and high sugar content, which produce wines with excessive alcohol content, lacking in freshness and not very aromatic. As a consequence, the search of varieties with capacity of produce quality grapes in adverse climate conditions is a good alternative to preserve the sustainability of vineyards. In this work, quality parameters of seven Vitis vinifera L. cultivars (five whites and two reds) recently recovered from extinction and grown under two different hydric regimes (rainfed and irrigated) were analyzed during the 2020 vintage. At harvest time, weight of 100 berries, must physicochemical parameters (brix degree, total acidity, malic acid, pH), and carbon and oxygen isotope ratios (δ13C, δ18O) were determined. Subsequently, varietal aroma potential index (IPAv) and total polyphenol index (TPI) were analyzed. Quality parameters, IPAv and TPI, showed significant differences between varieties and water regimes. Both red varieties, Moribel and Tinto Fragoso, stood out for their high aromatic and phenolic potential, which was higher under rainfed regime. Regarding to white varieties, Montonera del Casar and Jarrosuelto stood out in terms of varietal aroma potential. Montonera del Casar high acidity in its musts and Jarrosuelto showed the highest berry weights.