Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Impact of geographical location on the phenolic profile of minority varieties grown in Spain. II: red grapevines

Because terroir and cultivar are drivers of wine quality, is essential to investigate theirs effects on polyphenolic profile before promoting the implantation of a red minority variety in a specific area. This work, included in MINORVIN project, focuses in the polyphenolic profile of 7 red grapevines minority varieties of Vitis vinifera L. (Morate, Sanguina, Santafe, Terriza Tinta Jeromo Tortozona Tinta) and Tempranillo) from six typical viticulture Spanish areas: Aragón (A1), Cataluña (A2), Castilla la Mancha (A3), Castilla –León (A4), Madrid (A5) and Navarra (A6) of 2020 season. Polyphenolic substances were extracted from grapes. 35 compounds were identified and quantified (mg subtance/kg fresh berry) by HPLC and grouped in anthocyanins (ANT) flavanols (FLAVA), flavonols (FLAVO), hydroxycinnamic (AH), benzoic (BA) acids and stilbenes (ST). Antioxidant activity (AA, mmol TE /g fresh berry) was determined by DPPH method. The results were submitted to a two-way ANOVA to investigate the influence of variety, area and their interaction for each polyphenolic family and cluster analysis was used to construct hierarchical dendrograms, searching the natural groupings among the samples. Sanguina (A3) had the most of total polyphenols while Tempranillo (A5) those of ANT. Sanguina (A2) and (A3) reached the highest values of FLAVO, FLAVA and AA. These two last samples had also the maximum of AA. The effect cultivar and area were significant for all polyphenolic families analyzed. A high variability due to variety (>50%) was observed in FLAVA and the maximum value of variability due to growing area was detected in AA (86.41%), ANT and FLAVO (51%); the interaction variety*zone was significant only for ANT, FLAVO, EST and AA. Finally, dendrograms presented five cluster: i) Sanguina (A2); ii) Sanguina (A3); iii) Tempranillo (A5); iv) Tempranillo (A3); Terriza (A3,A5), Morate (A5,A6); v) Santafé (A1,A6); Tortozona tinta (A1,A3,A6); Tinta Jeromo (A3,A4).

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Effect of the commercial inoculum of arbuscular mycorrhiza in the establishment of a commercial vineyard of the cultivar “Manto negro

The favorable effect of symbiosis with arbuscular mycorrhizal fungi (AMF) has been known and studied since the 60s. Nowadays, many companies took the chance to start promoting and selling commercial inoculants of AMF, in order to be used as biofertilizers and encourage sustainable biological agriculture. However, the positive effect of these commercial biofertilizers on plant growth is not always demonstrated, especially under field conditions. In this study, we used a commercial inoculum on newly planted grapevines of a local cultivar grafted on a common rootstock R110. We followed the physiological status of vines, growth and productivity and functional biodiversity of soil bacteria during the first and second years of 20 inoculated with commercial inoculum bases on Rhizophagus irregularis and Funeliformis mosseaeAMF at field planting time and 20 non-inoculated control plants. All the parameters measured showed a neutral to negative effect on plant growth and production. The inoculated plants always presented lower values of photosynthesis, growth and grape production, although in some cases the differences did not reach statistical significance. On the contrary, the inoculation supposed an increase of the bacterial functional diversity, although the differences were not statistically significant either. Several studies show that the effect of inoculation with AMF is context-dependent. The non-favorable effects are probably due to inoculation ineffectiveness under complex field conditions and/or that, under certain conditions, AMF presence may be a parasitic association. This puts into question the effectiveness of its application in the field. Therefore, it is recommended to only resort to this type of biofertilizer when the cultivation conditions require it (e.g., very low previous microbial diversity, foreseeable stress due to drought, salinity, or lack of nutrients) and not as a general fertilization practice.

Current climate change in the Oplenac wine-growing district (Serbia)

Serbian autochthonous vine varieties Smederevka (for white wines) and Prokupac (for rosé and red wines) are the primary representatives of typical characteristics of wines and terroir of numerous wine-growing areas in Serbia. In the past, these varieties were the leading vine varieties, however, as the result of globalization of winemaking and the trend of consumption of wines from widely prevalent vine varieties, they were replaced by introduced international varieties. Smederevka and Prokupac vine varieties are characterized by later time of grape ripening, and relative sensitivity to low temperatures. Climate conditions can be a restrictive factor for production of high-quality grapes and wine and for the spatial spreading of these varieties in hilly continental wine-growing areas.
This paper focuses on the spatial analysis of changes of main climate parameters, in particular, analysis of viticultural bioclimatic indices that were determined for the purposes of viticulture zoning of wine-growing areas in the period 1961-2010, and those same parameters determined for the current, that is, referential climate period (1988-2017). Results of the research, that is, analysis of climate changes indicate that the majority of examined climate parameters in the Oplenac wine-growing district improved from the perspective of Smederevka and Prokupac vine varieties. These studies of climate conditions indicate that changes of analyzed climate parameters, that is, bioclimatic indices will be favorable for cultivation of varieties with later grape ripening times and those more sensitive to low temperatures, such as the autochthonous vine varieties Smederevka and Prokupac, therefore, it is recommended to producers to more actively plant vineyards with these varieties in the territory of the Oplenac wine-growing district.

Deconstructing the soil component of terroir: from controversy to consensus

Wine terroir describes the collectively recognized relation between a geographical area and the distinctive organoleptic characteristics of the wines produced in it. The overriding objective in terroir studies is therefore to provide scientific proof relating the properties of terroir components to wine quality and typicity. In scientific circles, the role of climate (macro-, meso- and micro-) on grape and wine characteristics is well documented and accepted as the most critical. Moreover, there has been increasing interest in recent years about new elements with possible importance in shaping wine terroir like berry/leaf/soil microbiology or even aromatic plants in proximity to the vineyard conferring flavors to the grapes. However, the actual effect of these factors is also dependent on complex interactions with plant material (variety/clone, rootstock, vine age) and with human factors.
The contribution of soil, although a fundamental component of terroir and extremely popular among wine enthusiasts, remains a much-debated issue among researchers. The role of geology is probably the one mostly associated by consumers with the notion of terroir with different parent rocks considered to give birth to different wine styles. However, the relationship between wine properties and the underlying parent material raises a lot of controversy especially regarding the actual existence of rock-derived flavors in the wine (e.g. minerality). As far as the actual soil properties are concerned, the effect of soil physical properties is generally regarded as the most significant (e.g sandy soils being associated with lighter wines while those on clay with colored and tannic ones) mostly through control of water availability which ultimately modifies berry ripening conditions either directly by triggering biosynthetic pathways, or indirectly by altering vigor and yield components. The role of soil chemistry seems to be weakly associated to wine sensory characteristic, although N, K, S and Ca, but also soil pH, are often considered important in the overall soil effect.
Recently, in the light of evidence provided by precision agriculture studies reporting a high variability of vineyard soils, the spatial scale should also be taken into consideration in the evaluation of the soil effects on wines. While it is accepted that soil effects become more significant than climate on a local level, it is not clear whether these micro-variations of vineyard soils are determining in the terroir effect. Moreover, as terroir is not a set of only natural factors, the magnitude of the contribution of human-related factors (irrigation, fertilization, soil management) to the soil effect still remains ambiguous. Lastly, a major shortcoming of the majority of works about soil effects on wine characteristics is the absence of connection with actual vine physiological processes since all soil effects on grape and wine chemistry and sensorial properties are ultimately mediated through vine responses.
This article attempts to breakdown the main soil attributes involved in the terroir effect to suggest an improved understanding about soil’s true contribution to wine sensory characteristics. It is proposed that soil parameters per se are not as significant determining factors in the terroir effect but rather their mutual interactions as well as with other natural and human factors included in the terroir concept. Consequently, similarly to bioclimatic indices, composite soil indices (i.e. soil depth, water holding capacity, fertility, temperature etc), incorporating multiple soil parameters, might provide a more accurate and quantifiable means to assess the relative weight of the soil component in the terroir effect.