Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The impact of leaf canopy management on eco-physiology, wood chemical properties and microbial communities in root, trunk and cordon of Riesling grapevines (Vitis vinifera L.)

In the last decades, climate change required already adaptation of vineyard management. Increase in temperature and unexpected weather events cause changes in all phenological stages requiring new management tools. For example, defoliation can be a useful tool to reduce the sugar content in the berries creating differences in the wine profiles. In a ten-year field experiment using Riesling (Vitis vinifera L, planted 1986, Geisenheim, Germany), various mechanical defoliation strategies and different intensities were trialed until 2016 before the vineyard was uprooted. Wood was sampled from the plant compartments root, trunk, cordon and shoot for analyses of physicochemical properties (e.g. lignin and element content, pH, diameter), nonstructural carbohydrates and the microbial communities. The aim of the study was to investigate the influence of reduced canopy leaf area on the sink-source allocation into different compartments and potential changes of the fungal and prokaryotic wood-inhabiting community using a metabarcoding approach. Severe summer pruning (SSP) of the canopy and mechanical defoliation (MDC) above the bunch zone decreased the leaf area by 50% compared to control (C). SSP reduced the photosynthetic capacity, which resulted in an altered source-sink allocation and carbohydrate storage. With lower leaf area, less carbohydrates are allocated. This for example resulted in a decreased trunk diameter. Further, it affected the composition of the grapevine wood microbiota. SSP and MDC management changed significantly the prokaryotic community composition in wood of the root samples, but had no effect in other compartments. In general, this study found strong compartment and less management effects of the microbial community composition and associated physicochemical properties. The highest microbial diversities were identified in the wood of the trunk, and several species were recorded the first time in grapevine.

Photoselective shade films affect grapevine berry secondary metabolism and wine composition

Grapevine physiology and production are challenged by forecasted increases in temperature and water deficits. Within this scenario, photoselective overhead shade films are promising tools in warm viticulture areas to overcome climate change related factors. The aim of this study was to evaluate the vulnerability of ‘Cabernet Sauvignon’ grape berry to solar radiation overexposure and optimize shade film use for berry integrity. A randomized complete block design field study was conducted across two years (2020-2021) in Oakville, Napa Valley, CA, with four shade films (D1, D3, D4, D5) differing in the percent of radiation spectra transmitted and compared to an uncovered control (C0). Integrals for gas exchange parameters and mid-day stem water potential were unaffected by the shade films in 2020 and 2021. By harvest, berries from uncovered and shaded vines did not differ in their size or primary metabolism in either year. Despite precipitation exclusion during the dormant season in the shaded treatments, yield did not differ between them and the control in either season. In 2020, total skin anthocyanins (mg/g fresh mass) in the shaded treatments was greater than C0 during berry ripening and at harvest. Conversely, flavonol concentrations in 2020 were reduced in shaded vines compared to C0. The 2020 growing season highlighted the impact of heat degradation on flavonoids. Flavonoid concentrations in 2021 increased until harvest while flavonoid degradation was apparent from veraison to harvest in 2020 across shaded and control vines. Wine analyses highlighted the importance of light spectra to modify wine composition. Wine color intensity, tonality and anthocyanin values were enhanced in D4 whereas antioxidant properties were enhanced in C0 and D5 wines. Altogether, our results highlighted the need of new approaches in warm viticulture areas given the impact that composition of light has on berry and wine quality.

Exploring resilience and competitiveness of wine estates in Languedoc-Roussillon in the recent past: a multi-level perspective

The Languedoc-Roussillon wineries are facing a decline in wine yields particularly PGI yields due to many factors. Climate change is just ones, but is expected to increase in the future. There is also structurally a large heterogeneity of yield profiles among terroirs, varieties and strategies. This work investigates the link between yield, competitiveness and resilience to explore how resilient winegrowers have been in the recent past. To this end two approaches have been combined; (i) an accountancy database analysis at estate scale and (ii) municipality level competitiveness analysis. A new resilience indicator that characterizes the capacity of an estate to absorb yield variation is also defined. The FADN database between 2000 and 2018 of ex-Languedoc-Roussillon (France) and other data are used to analyse the current situation and the past evolution of competitiveness and resilience by type of estate (type of farm: PGI and/or PDO & type of commercialization: bulk and/or bottles). The net margin, which defines competitiveness, is not correlated to yield for all types but depends on the type of commercialization and the level of specialisation. The resilience indicator shows that the net margin of estates specialized in PGI is particularly sensitive to yield declines. We also show that price evolutions seem to compensate the effect of yield losses for the majority of types. Municipality scale analysis shows the links between local pedoclimate, yield, commercialization strategies and price. Overlapping a PDO with a PGI does not always increase a municipality’s PGI competitiveness. It is difficult to make links between causes and effects due to the complexity of the wine production system. Production diversification may be a solution. Resorting to the two level of analysis helps resolving the data gap that is necessary to explore the links between yield and economic performance of the wine estates in the long term.

Assessing the climate change vulnerability of European winegrowing regions by combining exposure, sensitivity and adaptive capacity indicators

Winegrowing regions recognized as protected designations of origin (PDOs) are closely tied to well defined geographic locations with a specific set of pedoclimatic attributes and strictly regulated by legal specifications. However, climate change is increasingly threatening these regions by changing local conditions and altering winegrowing processes. The vulnerability to these changes is largely heterogenous across different winegrowing regions because it is determined by individual characteristics of each region, including the capacity to adapt to new climatic conditions and the sensitivity to climate change, which depend not only on natural, but also socioeconomic and legal factors. Accurate vulnerability assessments therefore need to combine information about adaptive capacity and climate change sensitivity with projected exposure to new climatic conditions. However, most existing studies focus on specific impacts neglecting important interactions between the different factors that determine climate change vulnerability. Here, we present the first comprehensive vulnerability assessment of European wine PDOs that spatially combines multiple indicators of adaptive capacity and climate change sensitivity with high-resolution climate projections. We found that the climate change vulnerability of PDO areas largely depends on the complex interactions between physical and socioeconomic factors. Homogenous topographic conditions and a narrow varietal spectrum increase climate change vulnerability, while the skills and education of farmers, together with a good economic situation, decrease their vulnerability. Assessments of climate change consequences therefore need to consider multiple variables as well as their interrelations to provide a comprehensive understanding of the expected impacts of climate change on European PDOs. Our results provide the first vulnerability assessment for European winegrowing regions at high spatiotemporal resolution that includes multiple factors related to climate exposure, sensitivity, and adaptive capacity on the level of single winegrowing regions. They will therefore help to identify hot spots of climate change vulnerability among European PDOs and efficiently direct adaptation strategies.

Vineyards and clay minerals: multi-technique analytical approach and correlations with soil properties

Purpose of this research is to quantitatively assess the mineral component of vineyard soils, with particular attention to the mineralogical analysis of clays, which represent an element of high importance in the vineyard culture as well as in general agriculture. An X-ray diffraction (XRD) / thermogravimetric (TG) multi-technique analytical approach was developed, tested on soil samples taken from vineyards around the world. This codified analytical procedure was necessary to obtain precise qualitative and quantitative mineralogical data, globally comparable to distinguish the geopedological identity of the vineyards. Soil samples from vineyards of various locations were analysed, in very different geological conditions. The bulk-rock quantitative phase analysis (QPA) was obtained by the Rietveld method while the detailed composition of the clay-sized fraction was determined by modelling of the oriented X-ray diffraction patterns. The research provided a precise classification of the mineral component of soils, distinguishing the mineral phases of the clays and the so-called mixed-layer clay minerals. We found that the content in mixed layers can be directly correlated with the water retention and the cation exchange capacity ​​of the soil, while the presence of other clayey minerals and phyllosilicates in this research did not affect this CEC parameter, which codes the fertility level of the soils. The study demonstrates that terroir, in particular soils formed in complex or very different geological conditions, can only be effectively interpreted by properly analysing its mineral phases, in particular the mixed-layer clay component. These are characteristic abiotic ecological indicators, which may have specific eco-physiological influences on the plant.