Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Effect of the commercial inoculum of arbuscular mycorrhiza in the establishment of a commercial vineyard of the cultivar “Manto negro

The favorable effect of symbiosis with arbuscular mycorrhizal fungi (AMF) has been known and studied since the 60s. Nowadays, many companies took the chance to start promoting and selling commercial inoculants of AMF, in order to be used as biofertilizers and encourage sustainable biological agriculture. However, the positive effect of these commercial biofertilizers on plant growth is not always demonstrated, especially under field conditions. In this study, we used a commercial inoculum on newly planted grapevines of a local cultivar grafted on a common rootstock R110. We followed the physiological status of vines, growth and productivity and functional biodiversity of soil bacteria during the first and second years of 20 inoculated with commercial inoculum bases on Rhizophagus irregularis and Funeliformis mosseaeAMF at field planting time and 20 non-inoculated control plants. All the parameters measured showed a neutral to negative effect on plant growth and production. The inoculated plants always presented lower values of photosynthesis, growth and grape production, although in some cases the differences did not reach statistical significance. On the contrary, the inoculation supposed an increase of the bacterial functional diversity, although the differences were not statistically significant either. Several studies show that the effect of inoculation with AMF is context-dependent. The non-favorable effects are probably due to inoculation ineffectiveness under complex field conditions and/or that, under certain conditions, AMF presence may be a parasitic association. This puts into question the effectiveness of its application in the field. Therefore, it is recommended to only resort to this type of biofertilizer when the cultivation conditions require it (e.g., very low previous microbial diversity, foreseeable stress due to drought, salinity, or lack of nutrients) and not as a general fertilization practice.

Terroir analysis and its complexity

Terroir is not only a geographical site, but it is a more complex concept able to express the “collective knowledge of the interactions” between the environment and the vines mediated through human action and “providing distinctive characteristics” to the final product (OIV 2010). It is often treated and accepted as a “black box”, in which the relationships between wine and its origin have not been clearly explained. Nevertheless, it is well known that terroir expression is strongly dependent on the physical environment, and in particular on the interaction between soil-plant and atmosphere system, which influences the grapevine responses, grapes composition and wine quality. The Terroir studying and mapping are based on viticultural zoning procedures, obtained with different levels of know-how, at different spatial and temporal scales, empiricism and complexity in the description of involved bio-physical processes, and integrating or not the multidisciplinary nature of the terroir. The scientific understanding of the mechanisms ruling both the vineyard variability and the quality of grapes is one of the most important scientific focuses of terroir research. In fact, this know-how is crucial for supporting the analysis of climate change impacts on terroir resilience, identifying new promised lands for viticulture, and driving vineyard management toward a target oenological goal. In this contribution, an overview of the last findings in terroir studies and approaches will be shown with special attention to the terroir resilience analysis to climate change, facing the use and abuse of terroir concept and new technology able to support it and identifying the terroir zones.

Biodiversity in the vineyard agroecosystem: exploring systemic approaches

Biodiversity conservation and restoration are essential for guarantee the provision of ecosystem services associated to vineyard agroecosystem such as climate regulation trough carbon sequestration and control of pests and diseases. Most of published research dealing with the complexity of the vineyard agroecosystems emphasizes the necessity of innovative approaches, including the integration of information at different temporal and spatial scales and development of systemic analysis based on modelling. A biodiversity survey was conducted in the Franciacorta wine-growing area (Lombardy, Italy), one of the most important Italian wine-growing regions for sparkling wine production, considering a portion of the territory of 112 ha. The area was divided into several Environmental Units (EUs), defined as a whole vineyard or portion of vineyard homogenous in terms of four agronomic characteristics: planting year, planting density, cultivar, and training system. In each EU a set of compartments was identified and characterised by specific variables. The compartments are meteorology, morphology (altitude, slope, aspect, row orientation, and solar irradiance), ecological infrastructures and management. The landscape surrounding EU was also characterised in terms of land-use in a buffer zone of 500 m. For each component a specific methodology was identified and applied. Different statistical approaches were used to evaluate the method to integrate the information related to different compartments within the EU and related to the buffer zone. These approaches were also preliminarily evaluated for their ability to describe the contribution of biodiversity and landscape components to ecosystem services. This methodological exploration provides useful indication for the development of a fully systemic approach to structural and functional biodiversity in vineyard agroecosystems, contributing to promote a multifunctional perspective for the all wine-growing sector.

First step in the preparation of a soil map of the Protected Designation of Origin Valdepeñas (Central, Spain)

This work is a first step to make a map of vineyard soils. The characterization of the soils of the Protected Designation of Origin (D.P.O.) Valdepeñas will allow to group the studied profiles according to their physico-chemical characteristics and the concentrations of most relevant chemical elements. 90 soil profiles were analysed throughout the territory and the soils were sampled and described according to FAO (2006) and classified according to and Soil Taxonomy (2014). All samples were air dried, sieved and some physico-chemical parameters were determined following standard protocols. Also, major and trace elements were analysed by X-ray fluorescence. The statistically study was made using the SPSS program. Trend maps were made using the ArcGIS program. The studied soils have the following average properties: pH, 8.3; electrical conductivity, 0,20 dS/m (low); clay, 18.8% (medium) and CaCO3, 17.1% (high). In the study for the major elements. The major elements of these soils are Si, followed by Ca and Al, with an average content of 203.7 g/kg, 105.5 g/kg and 74.0 g/kg respectively. On the other hand, 27 trace elements have been studied. Of all of them, it can be highlighted the average values of Ba (361.8 mg/kg), Sr (129.3 mg/kg), Rb (83.4 mg/kg), V (74.2 mg/kg) and Ce (70.6 mg/kg). Ba, V and Ce values are higher and the values of Sr and Rb are lower to those found in the literature. The discriminant analysis shows a percentage of grouping of 91%. The content of chemical elements together with the physico-chemical characteristics allows grouping the soils in 4 group according to their order in the classification to Soil Taxonomy; due to the importance of the Calcisols in Castilla-La Mancha, it has been decided to establish them as their own group even if they do not appear in Soil Taxonomy classification.

Metabolomic discrimination of grapevine water status for Chardonnay and Pinot noir

Water status impact in viticulture has been widely explored, as it strongly affects grapevine physiology and grape chemical composition. It is considered as a key component of vitivinicultural terroir. Most of the studies concerning grapevine water status have focused on either physiological traits, or berry compounds, or traits involved in wine quality. Here, the response of grapevine to water availability during the ripening period is assessed through non-targeted metabolomics analysis of grape berries by ultra-high resolution mass spectrometry. The grapevine water status has been assessed during 2 consecutive years (2019 & 2020), through carbon isotope discrimination on juices from berries collected at maturity (21.5 brix approx.) for 2 Vitis vinifera cv. Pinot noir (PN) and Chardonnay (CH). A total of 220 grape juices were collected from 5 countries worldwide (Italy; Argentina; France; Germany; Portugal). Measured δ13C (‰) varied from -28.73 to -22.6 for PN, and from -28.79 to -21.67 for CH. These results also clearly revealed higher water stress for the 2020 vintage. The same grape juices have been analysed by Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FT-ICR-MS) and Liquid Chromatography coupled to Mass Spectrometry (LC-qTOF-MS), leading to the detection of up to 4500 CHONS containing elemental compositions, and thus likely tens of thousands of individual compounds, which include fatty acids, organic acids, peptides, phenolics, also with high levels of glycosylation. Multivariate statistical analysis revealed that up to 160 elemental compositions, covering the whole range of detected masses (100 –1000 m/z), were significantly correlated to the observed gradients of water status. Examples of chemical markers, which are representative of these complex fingerprints, include various derivatives of the known abscisic acid (ABA), such as phaesic acid or abscisic acid glucose ester, which are significantly correlated with higher water stress, regardless of the variety. Cultivar-specific behaviours could also be identified from these fingerprints. Our results provide an unprecedented representation of the metabolic diversity, which is involved in the water status regulation at the grape level, and which could contribute to a better knowledge of the grapevine mitigation strategy in a climate change context.