Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Downscaling of remote sensing time series: thermal zone classification approach in Gironde region

In viticulture, the challenges of local climate modelling are multiple: taking into account the local environment, fine temporal and spatial scales, reliable time series of climate data, ease of implementation and reproducibility of the method. At the local scale, recent studies have demonstrated the contribution of spatialization methods for ground-based climate observation data considering topographic factors such as altitude, slope, aspect, and geographic coordinates (Le Roux et al, 2017; De Rességuier et al, 2020). However, these studies have shown questions in terms of the reproducibility and sustainability of this type of climate study. In this context, we evaluated the potential of MODIS thermal satellite images validated with ground-based climate data (Morin et al, 2020). Previous studies have been encouraging, but questions remain to be explored at the regional scale, particularly in the dynamics of the massive use of bioclimatic indices to classify the climate of wine regions. The results at the local scale were encouraging, but this approach was tested in the current study at the regional scale. Several objectives were set: 1) to evaluate the downscaling method for land surface temperature time series, 2) to identify regional thermal structure variations. We used weekly minimum and maximum surface temperature time series acquired by MODIS satellites at a spatial resolution of 1000 m and downscaled at 500 m using topographical variables. Two types of analyses were performed:

Under-vine management effects on grapevine production, soil properties and plant communities in South Australia

Under-vine (UV) management has traditionally consisted of synthetic herbicide use to limit competition between weeds and grapevines. With growing global interest towards non-synthetic chemical use, this study aimed to capture the effects of alternative UV management at two commercial Shiraz vineyards in South Australia, where the sole management variables were UV management since 2016. In adjacent treatment blocks, cultivation (CU) was compared to spontaneous vegetation (SV) in McLaren Vale (MV), and herbicide was compared to SV in Eden Valley (EV). Soil water infiltration rates were slower and grapevine stem water potential was lower in CU compared to SV in MV, with the latter having a plant community dominated by soursob (Oxalis pes-caprae) during winter; while in EV, there was little separation between the treatments. Yields were affected at both sites, with SV being higher in MV and HE being higher in EV. In MV, the only effect on grape must was a lower 13C:12C isotope ratio in CU, indicating greater grapevine water stress. In the grape must at EV, SV had higher total soluble solids, total phenolics, anthocyanins, and yeast available nitrogen; and lower pH and titratable acidity. Pruning weights were not affected by the treatments in MV, while they were higher in HE at EV. Assessments revealed that the differing soil types at the two sites were likely the main determinants of the opposing production outcomes associated with UV management. In the silty loam soil of MV, the higher yields in SV were likely due to more plant-available water, as a potential result of the continuous soil bio-pores formed by winter UV vegetation. Conversely, in the loamy sand soils of EV with a lower cation exchange capacity, the lower yields and pruning weights in SV suggest the UV vegetation competed significantly with the grapevines for available water and nutrients.

From a local to an international scale: sensory benchmarking of PDO wines. Quincy and Reuilly PDO wines (Sauvignon blanc) as a case study (France)

In a collective marketing strategy, the Protected Designation of Origin (PDO) can be used as a quality indicator. To highlight terroir specificities, it is useful to know how the wines are positioned on the local, national or international market from a sensory point of view. This is especially true for a comparison of varietal wines (e.g. Sauvignon blanc). We focus on the case of two closed Loire Valley PDO (France): Quincy and Reuilly. Three distinct tastings were organized. Firstly, at the local level comparing the 2 PDO (11 and 9 wines, 17 professional assessors); secondly at a regional level adding 3 closed PDO: Menetou-Salon, Sancerre and Pouilly-Fumé (3 wines per PDO, 16 assessors) and thirdly at an international level comparing these 5 PDO with Sauvignon Blanc wines coming from South Africa, New Zealand and Chile (1 to 3 wines per PDO, 19 assessors). All the wines were from the 2019 vintage and were considered to have a traditional elaboration process without contact with oak. A sensory descriptive analysis was performed using an aroma wheel allowing to combine a Check-All-That-Apply methodology, often used in sensory benchmarking, with a hierarchical structuration of the attributes. The aim is to facilitate data acquisition in a professional context without common training, to consider the hierarchical relationships among the attributes during the data analysis and to be able to characterize wines with a large range of sensorial variability. We use univariate, multivariate and clustering analyses. Similarities and differences between Quincy and Reuilly PDO wines and other Sauvignon blanc wines were identified. Specific attributes can distinguish the two PDO and different proximities exist with other local PDO, while clear differences were observed compared to international wines. Our study contributes to propose and discuss a method to do a wine sensory benchmarking highlighting sensory specificities linked to origin.

The impact of leaf canopy management on eco-physiology, wood chemical properties and microbial communities in root, trunk and cordon of Riesling grapevines (Vitis vinifera L.)

In the last decades, climate change required already adaptation of vineyard management. Increase in temperature and unexpected weather events cause changes in all phenological stages requiring new management tools. For example, defoliation can be a useful tool to reduce the sugar content in the berries creating differences in the wine profiles. In a ten-year field experiment using Riesling (Vitis vinifera L, planted 1986, Geisenheim, Germany), various mechanical defoliation strategies and different intensities were trialed until 2016 before the vineyard was uprooted. Wood was sampled from the plant compartments root, trunk, cordon and shoot for analyses of physicochemical properties (e.g. lignin and element content, pH, diameter), nonstructural carbohydrates and the microbial communities. The aim of the study was to investigate the influence of reduced canopy leaf area on the sink-source allocation into different compartments and potential changes of the fungal and prokaryotic wood-inhabiting community using a metabarcoding approach. Severe summer pruning (SSP) of the canopy and mechanical defoliation (MDC) above the bunch zone decreased the leaf area by 50% compared to control (C). SSP reduced the photosynthetic capacity, which resulted in an altered source-sink allocation and carbohydrate storage. With lower leaf area, less carbohydrates are allocated. This for example resulted in a decreased trunk diameter. Further, it affected the composition of the grapevine wood microbiota. SSP and MDC management changed significantly the prokaryotic community composition in wood of the root samples, but had no effect in other compartments. In general, this study found strong compartment and less management effects of the microbial community composition and associated physicochemical properties. The highest microbial diversities were identified in the wood of the trunk, and several species were recorded the first time in grapevine.

Spatial variability of temperature is linked to grape composition variability in the Saint-Emilion winegrowing area

Elevated temperature during the grape maturation period is a major threat for grape quality and thus wine quality. Therefore, characterizing the grape composition response to temperature at a larger scale would represent a crucial step towards adaptation to climate change. In response to changes in temperature, various physiological mechanisms regulate grape composition. Primary and secondary metabolisms are both involved in this response, with well-known effects, for example on anthocyanins, and lesser known effects, for example on aromas or aroma precursors. At the field scale or at the regional scale, however, numerous environmental or plant-specific factors intervene to make the effects of temperature difficult to distinguish from overall variability. In this study, it was attempted to overcome this difficulty by selecting well-characterized situations with differing temperatures.
A long-term study of air temperature variability across several Merlot vineyards in the Saint-Emilion and Pomerol wine producing area found significant temperature differences and gradients at various time scales linked to environmental factors. From this study area, a few sites were selected with similar age, soil and training system conditions, and with repeated and contrasted temperature differences during the maturation period. The average temperature difference during the maturation period was about 2°C between cooler and warmer sites, a difference similar to that expected under future climate change scenarios. In close vicinity to the temperature sensors at each site, grape berries were sampled at different times until full maturity during 2019 and 2020. Also, berries from bunches on either side of the row were analyzed separately, allowing an investigation of bunch exposure effect associated with the coupling of berry temperature and solar radiation. Four replicates of pooled berries for each time – site – bunch exposure combination were obtained and analyzed for biochemical composition. Analyses of variance of the biochemical composition data collected at different sampling times reveal significant effects associated with temperature, site, and bunch azimuth. For instance, anthocyanins in grape skins are clearly influenced by temperature and solar radiation exposure, with up to 30% reduction in warmer conditions.