Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Variations of soil attributes in vineyards influence their reflectance spectra

Knowledge on the reflectance spectrum of soil is potentially useful since it carries information on soil chemical composition that can be used to the planning of agricultural practices. If compared with analytical methods such as conventional chemical analysis, reflectance measurement provides non-destructive, economic, near real-time data. This paper reports results from reflectance measurements performed by spectroradiometry on soils from two vineyards in south Brazil. The vineyards are close to each other, are on different geological formations, but were subjected to the same management. The objective was to detect spectral differences between the two areas, correlating these differences to variations in their chemical composition, to assess the technique’s potential to predict soil attributes from reflectance data.To that end, soil samples were collected from ten selected vine parcels. Chemical analysis yield data on concentration of twenty-one soil attributes, and spectroradiometry was performed on samples. Chemical differences significant to a 95% confidence level between the two studied areas were found for six soil attributes, and the average reflectance spectra were separated by this same level along most of the observed spectral domain. Correlations between soil reflectance and concentrations of soil attributes were looked for, and for ten soil traits it was possible to define wavelength domains were reflectance and concentrations are correlated to confidence levels from 95% to 99%. Partial Least Squares Regression (PLSR) analyses were performed comparing measured and predicted concentrations, and for fifteen out of 21 soil traits we found Pearson correlation coefficients r > 0.8. These preliminary results, which have to be validated, suggest that variations of concentration in the investigated soil attributes induce differences in reflectance that can be detected by spectroradiometry. Applications of these observations include the assessment of the chemical content of soils by spectroradiometry as a fast, low-cost alternative to chemical analytical methods.

Impact of changes in pruning practices on vine growth and yield

A gradual decline in vineyards has been observed over the past twenty years worldwide. This might be explained by the climate change, practices change or the increase of dieback diseases. To increase the longevity of vines, we studied the impact of different pruning strategies in four adult and four young vineyards located in France and Spain. In France, vineyards were planted with Cabernet franc on 3309C while Spanish trials were planted with Tempranillo grafted on 110R. Vegetative expression, yield, quality of berries and wood vessels conductivity were measured. The distribution of vegetative expression, yield and berry composition between primary and secondary vegetation were quantified. Finally, tomography was used to evaluate the implication of the treatments on sap flows.
First results show that i) the respectful pruning leads to an increase of 30 to 50% more secondary shoots than the aggressive pruning in France and between 15 and 20% in Spain, ii) there is no major effect on the yield over the first two years following the implementation of the new pruning practices, although the proportion of clusters from suckers is higher on the respectful pruning method. On young vines, the development of the trunk according to a respectful pruning leads to a loss of harvest 2 years after planting. This is due to the removal, on the future trunk, of the green suckers which carrying bunches. This operation carried out in spring rather than during winter pruning, would promote a better leaf / fruit balance when the plant comes into production, and could lead to better hydraulic conduction in the vessels of the trunk. Maintaining these trials for several years will provide more robust data to assess the impact of these practices on the vines over the long term.

Protected Designation of Origin (D.P.O.) Valdepeñas: classification and map of soils

The objective of the work described here is the elaboration of a map of the different types of vineyard soils that to guide the famers in the choice of the most productive vine rootstocks and varieties. 90 vineyard soils profiles were analysed in the entire territory of the Origen Denominations of Valdepeñas. The sampling was carried out in 2018 (June to October) by making a sampling grid, followed by photointerpretation and control in the field. The studied soils can be grouped into 9 different soil types (according to FAO 2006 classification): Leptosols, Regosols, Fluvisols, Gleysols, Cambisols, Calcisols, Luvisols and Anthrosols. A map showing the soil distribution with different type of soils has been made with the ArcGIS program. Regarding to the choice of rootstock, Calcisoles are soils with a high active limestone content, so the rootstocks used in these soils must be resistant to this parameter; Luvisols are deep soils with high clay content, so they will support vigorous rootstocks. Because the cartographic units are composed of two or more subgroups, with are associated in variable proportions, 9 different soil associations have been established; Unit 1: Leptosols, Cambisols and Luvisols (80%, 15% and 5% respectively); Unit 2: Cambisols with Regosols and Luvisols (40%, 30% and 30% respectively); Unit 3: Cambisols and Gleysols with Regosols (40%, 40% and 20% respectively); Unit 4: Regosols with Cambisols, Leptosols and Calcisols (40%, 30%, 15% and 15% respectively); Unit 5: Cambisols, Leptosols, Calcisols and Regosols (25% each of them); Unit 6: Luvisols with Cambisol and Calcisols (80%, 10% and 10% respectively); Unit 7: Luvisols and Calcisols with Cambisols (40%, 40% and 20% respectively); Unit 8: Calcisols with, Cambisols and Luvisols (80%, 10% and 10% respectively); Unit 9: Anthrosols. These study allow to elaborate the first map of vineyard soils of this Protected Designation of Origin in Castilla-La Mancha.

From a local to an international scale: sensory benchmarking of PDO wines. Quincy and Reuilly PDO wines (Sauvignon blanc) as a case study (France)

In a collective marketing strategy, the Protected Designation of Origin (PDO) can be used as a quality indicator. To highlight terroir specificities, it is useful to know how the wines are positioned on the local, national or international market from a sensory point of view. This is especially true for a comparison of varietal wines (e.g. Sauvignon blanc). We focus on the case of two closed Loire Valley PDO (France): Quincy and Reuilly. Three distinct tastings were organized. Firstly, at the local level comparing the 2 PDO (11 and 9 wines, 17 professional assessors); secondly at a regional level adding 3 closed PDO: Menetou-Salon, Sancerre and Pouilly-Fumé (3 wines per PDO, 16 assessors) and thirdly at an international level comparing these 5 PDO with Sauvignon Blanc wines coming from South Africa, New Zealand and Chile (1 to 3 wines per PDO, 19 assessors). All the wines were from the 2019 vintage and were considered to have a traditional elaboration process without contact with oak. A sensory descriptive analysis was performed using an aroma wheel allowing to combine a Check-All-That-Apply methodology, often used in sensory benchmarking, with a hierarchical structuration of the attributes. The aim is to facilitate data acquisition in a professional context without common training, to consider the hierarchical relationships among the attributes during the data analysis and to be able to characterize wines with a large range of sensorial variability. We use univariate, multivariate and clustering analyses. Similarities and differences between Quincy and Reuilly PDO wines and other Sauvignon blanc wines were identified. Specific attributes can distinguish the two PDO and different proximities exist with other local PDO, while clear differences were observed compared to international wines. Our study contributes to propose and discuss a method to do a wine sensory benchmarking highlighting sensory specificities linked to origin.

Modelling vine water stress during a critical period and potential yield reduction rate in European wine regions: a retrospective analysis

Most European vineyards are managed under rainfed conditions, where seasonal water deficit has become increasingly important. The flowering-veraison phenophase represents an important period for vine response to water stress, which is seldomly thoroughly evaluated. Therefore, we aim to quantify the flowering-veraison water stress levels using Crop Water Stress Indicator (CWSI) over 1986–2015 for important European wine regions, and to assess the respective potential Yield Lose Rate (YLR). Additionally, we also investigate whether an advanced flowering-veraison phase may help alleviating the water stress with improved yield. A process-based grapevine model STICS is employed, which has been extensively calibrated for flowering and veraison stages using observed data at 38 locations with 10 different grapevine varieties. Subsequently, the model is being implemented at the regional level, considering site-specific calibration results and gridded climate and soil datasets. The findings suggest wine regions with stronger flowering-veraison CWSI tend to have higher potential YLR. However, contrasting patterns are found between wine regions in France-Germany-Luxembourg and Italy-Portugal-Spain. The former tends to have slight-to-moderate drought conditions (CWSI<0.5) and a negligible-to-moderate YLR (<30%), whereas the latter possesses severe-to-extreme CWSI (>0.5) and substantial YLR (>40%). Wine regions prone to a high drought risk (CWSI>0.75) are also identified, which are concentrated in southern Mediterranean Europe. An advanced flowering-veraison phase may have benefited from cooler temperatures and a higher fraction of spring precipitation in wine regions of Italy-Portugal-Spain, resulting in alleviated CWSI and moderate reductions of YLR. For those of France-Germany-Luxembourg, this can have reduced flowering-veraison precipitation, but prevalent alleviations of YLR are also found, possibly because of shifted phase towards a cooler growing season with reduced evaporative demands. Overall, such a retrospective analysis might provide new insights towards better management of seasonal water deficit for conventionally vulnerable Mediterranean wine regions, but also for relatively cooler and wetter Central European regions.