Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Biodiversity in the vineyard agroecosystem: exploring systemic approaches

Biodiversity conservation and restoration are essential for guarantee the provision of ecosystem services associated to vineyard agroecosystem such as climate regulation trough carbon sequestration and control of pests and diseases. Most of published research dealing with the complexity of the vineyard agroecosystems emphasizes the necessity of innovative approaches, including the integration of information at different temporal and spatial scales and development of systemic analysis based on modelling. A biodiversity survey was conducted in the Franciacorta wine-growing area (Lombardy, Italy), one of the most important Italian wine-growing regions for sparkling wine production, considering a portion of the territory of 112 ha. The area was divided into several Environmental Units (EUs), defined as a whole vineyard or portion of vineyard homogenous in terms of four agronomic characteristics: planting year, planting density, cultivar, and training system. In each EU a set of compartments was identified and characterised by specific variables. The compartments are meteorology, morphology (altitude, slope, aspect, row orientation, and solar irradiance), ecological infrastructures and management. The landscape surrounding EU was also characterised in terms of land-use in a buffer zone of 500 m. For each component a specific methodology was identified and applied. Different statistical approaches were used to evaluate the method to integrate the information related to different compartments within the EU and related to the buffer zone. These approaches were also preliminarily evaluated for their ability to describe the contribution of biodiversity and landscape components to ecosystem services. This methodological exploration provides useful indication for the development of a fully systemic approach to structural and functional biodiversity in vineyard agroecosystems, contributing to promote a multifunctional perspective for the all wine-growing sector.

Aromatic maturity is a cornerstone of terroir expression in red wine

Harvesting grapes at adequate maturity is key to the production of high-quality red wines. Enologists and wine makers define several types of maturity, including technical maturity, phenolic maturity and aromatic maturity. Technical maturity and phenolic maturity are relatively well documented in the scientific literature, while articles on aromatic maturity are scarcer. This is surprising, because aromatic maturity is, without a doubt, the most important of the three in determining wine quality and typicity (including terroir expression). Optimal terroir expression can be obtained when the different types of maturity are reached at the same time, or within a short time frame. This is more likely to occur when the ripening takes place under mild temperatures, neither too cool, nor too hot. Aromatic expression in wine can be driven, from low to high maturity, by green, herbal, fresh fruit, ripe fruit, jammy fruit, candied fruit or cooked fruit aromas. Green and cooked fruit aromas are not desirable in red wines, while the levels of other aromatic compounds contribute to the typicity of the wine in relation to its origin. Wines produced in cool climates, or on cool soils in temperate climates, are likely to express herbal or fresh fruit aromas; while wines produced under warm climates, or on warm soils in temperate climates, may express ripe fruit, jammy fruit or candied fruit aromas. Growers can optimize terroir expression through their choice of grapevine variety. Early ripening varieties perform better in cool climates and late ripening varieties in warm climates. Additionally, maturity can be advanced or delayed by different canopy management practices or training systems.

Assessment of the impact of actions in the vineyard and its surrounding environment on biodiversity in Rioja Alavesa (Spain)

Traditional viticulture areas have experienced in the last decades an intensification of field practices, linked to an increased use of fertilisers and phytosanitary products, and to a more intensive mechanization and uniformization of the landscape. This change in management has sometimes led to higher rates of soil erosion andloss of soil structure, fertility decline, groundwater contamination, and to an increased pressure of pests and diseases. Additionally, intensification usually leads to a simplification of landscapes, of particular concern in prestigious wine grape regions where the economical revenue encourages the conversion of land use from natural habitats to high value wine grape production. To revert this trend, it is necessary that growers implement actions that promote biodiversity in their vineyards. The aim of this study is to assess the impact of the implementation of cover crops, vegetational corridors, dry stone walls and vineyard biodiversity hotspots estimated through the study of arthropods. The work has been carried out in four vineyards in Rioja Alavesa belonging to Ostatu winery, where these infrastructures were implemented in 2020. The presence and diversity of arthropods was studied by capturing them at different times in the season and at different distances from the infrastructure using pit-fall traps in the soil and yellow, white and blue chromatic traps at the canopy level. This is a preliminary study in which all adult insects were sorted to the taxonomic level of order and Coleoptera were classified to morphospecies. The results obtained show that there is a relationship between the basic characteristics of the vineyard and the arthropods captured, with a positive effect, although also dependent on the vineyard, of the presence of infrastructure.

Downscaling of remote sensing time series: thermal zone classification approach in Gironde region

In viticulture, the challenges of local climate modelling are multiple: taking into account the local environment, fine temporal and spatial scales, reliable time series of climate data, ease of implementation and reproducibility of the method. At the local scale, recent studies have demonstrated the contribution of spatialization methods for ground-based climate observation data considering topographic factors such as altitude, slope, aspect, and geographic coordinates (Le Roux et al, 2017; De Rességuier et al, 2020). However, these studies have shown questions in terms of the reproducibility and sustainability of this type of climate study. In this context, we evaluated the potential of MODIS thermal satellite images validated with ground-based climate data (Morin et al, 2020). Previous studies have been encouraging, but questions remain to be explored at the regional scale, particularly in the dynamics of the massive use of bioclimatic indices to classify the climate of wine regions. The results at the local scale were encouraging, but this approach was tested in the current study at the regional scale. Several objectives were set: 1) to evaluate the downscaling method for land surface temperature time series, 2) to identify regional thermal structure variations. We used weekly minimum and maximum surface temperature time series acquired by MODIS satellites at a spatial resolution of 1000 m and downscaled at 500 m using topographical variables. Two types of analyses were performed:

Copper contamination in vineyard soils of Bordeaux: spatial risk assessment for the replanting of vines and crops

Copper (Cu) is widely and historically used in viticulture as a fungicide against mildew. Cu has a strong affinity for soil organic matter and accumulates in topsoil horizons. Thus, Cu may negatively affect soil organisms and plants, consequently reducing soil fertility and productivity. The Bordeaux vineyards have the largest vineyard surfaces (26%) within French controlled appellation and a great proportion of French wine production (around 5 million hl per year). Considering the local context of vineyard surfaces decreasing (vine uprooting) and possible new crop plantation, the issue of Cu potential toxicity rises. Therefore, the aims of this work are firstly to evaluate the Cu contamination in vineyard soils of Bordeaux, secondly to produce a risk assessment map for new vine or crop plantation. We used soil analyses from several local studies to build a database with 4496 soil horizon samples. The database was enhanced by means of pedotransfer functions in order to estimate the bioaccessible (EDTA-extractable) Cu in soils of samples without measurements. From this database, 1797 georeferenced samples with CuEDTA concentrations in the topsoil (0-50 cm depth) were used for kriging interpolation in order to produce the spatial distribution map of CuEDTA in vineyard soils. Then, the spatial distribution of Cu was crossed with vine uprooting surfaces and municipality boundaries. CuEDTAconcentrations ranged from 0.52 to 459 mg/kg and showed clear anomalies. Our results from spatial analysis showed that almost 50% of vineyard soil surfaces have CuEDTA concentrations higher than 30 mg/kg (moderate risk for new plantation) and 20% with concentrations higher than 50 mg/kg (high risk for new plantation). A decision-support map based on municipalities was realised to provide a simple tool to stakeholders concerned by land use management.