Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

Grapevine xylem embolism resistance spectrum reveals which varieties have a lower mortality risk in a future dry climate

Wine growing regions have recently faced intense and frequent droughts that have led to substantial economical losses, and the maintenance of grapevine productivity under warmer and drier climate will rely notably on planting drought-resistant cultivars. Given that plant growth and yield depend on water transport efficiency and maintenance of photosynthesis, thus on the preservation of the vascular system integrity during drought, a better understanding of drought-related hydraulic traits that have a significant impact on physiological processes is urgently needed. We have worked towards this end by assessing vulnerability to xylem embolism in 30 grapevine commercial varieties encompassing red and white Vitis vinifera varieties, hybrid varieties characterized by a polygenic resistance for powdery and downy mildew, and commonly used rootstocks. These analyses further allowed a global assessment of wine regions with respect to their varietal diversity and resulting vulnerability to stem embolism. Hybrid cultivars displayed the highest vulnerability to embolism, while rootstocks showed the greatest resistance. Significant variability also arose among Vitis vinifera varieties, with Ψ12 and Ψ50 values ranging from -0.4 to -2.7 MPa and from -1.8 to -3.4 MPa, respectively. Cabernet franc, Chardonnay and Ugni blanc featured among the most vulnerable varieties while Pinot noir, Merlot and Cabernet Sauvignon ranked among the most resistant. In consequence, wine regions bearing a significant proportion of vulnerable varieties, such as Poitou-Charentes, France and Marlborough, New Zealand, turned out to be at greater risk under drought. These results highlight that grapevine varieties may not respond equally to warmer and drier conditions, outlining the importance to consider hydraulic traits associated with plant drought tolerance into breeding programmes and modeling simulations of grapevine yield maintenance under severe drought. They finally represent a step forward to advise the wine industry about which varieties and regions would have the lowest risk of drought-induced mortality under climate change.

Effect of one-year cover crop and arbuscular mycorrhiza inocululation in the microbial soil community of a vineyard

The microbial composition of the soil is an important factor to consider in viticulture, since its influence on the “terroir” and on the organoleptic properties of the wine have been demonstrated. Different agronomic techniques have the potential to modify the composition and functionality of the soil microbial community. Maintaining green covers is known to increase soil microbial diversity. The direct application of inoculum of beneficial microorganisms to the soil has also been used to increase their abundance. However, the environmental conditions of each site seem to have a determining weight in the result of these practices. In this study, we compared the effect on the microbial community of a cover crop with legumes in autumn and the inoculation of grapevines with commercial inoculum bases on Rhizophagus irregularis and Funeliformis mosseae in the previous spring. The study has been carried out in a vineyard in Binissalem, Mallorca, Spain. After applying the treatments, we will analyze the soil microbial communities using the data obtained from Illumina amplification of soil DNA from the 16S and ITS regions to analyze bacteria and fungi community, respectively. In addition, we will record the physicochemical characteristics of the soil at each sampling point. The result showed that agronomic management, in the short term, has less influence than soil characteristics on the composition of the soil microbiome. With these results, we can conclude that in a vineyard, agricultural techniques should focus on improving the characteristics of the soil to improve the biodiversity of the soil microbiota.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Mechanisms involved in the heating of the environment by the aerodynamic action of a wind machine to protect a vineyard against spring frost

One of the main consequences of global warming is the rise of the mean temperature. Thus, the heat summation by the plants begins sooner in the early spring, and by cumulating growing degree-days, phenological development tends to happen earlier. However, spring frost is still a recurrent phenomenon causing serious damages to buds and therefore, threatening the harvests of the winegrowers. The wind machine is a solution to protect fruit crops against spring frost that is increasingly used. It is composed of a 10-m mast with a blowing fan at its peak. By tapping into the strength of the nocturnal thermal inversion, it sweeps the crop by propelling warm air above to the ground. Thus, stratification is momentarily suppressed. Furthermore, the continuous action of the machine, alone or in synergy, or the addition of a heater allow the bud to be bathed in a warmer environment. Also, the punctual action of the tower’s warm gust reaches the bud directly at each rotation period. All these actions allow the bud to continuously warm up, but with different intensities and over a different period. Although there is evidence of the effectiveness of the wind machines, the thermal transfers involved in those mechanisms raise questions about their true nature. Field measurements based on ultrasonic anemometers and fast responding thermocouples complemented by laboratory measurements on a reduced scale model allow to characterize both the airflow produced by the wind machine and the local temperature in its vicinity. Those experiments were realized in the vineyard of Quincy, in the framework of the SICTAG project. In the future paper, we will detail the aeraulic characterization of the wind machine and the thermal effects resulting from it and we will focus on how the wind machine warms up the local atmosphere and enables to reduce the freezing risk.