Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

austrianvineyards.com: online viewer of all designations of Austrian wine

To digitally record and present all the origins of Austrian wines in the same perfect and clear way was the motivation for the Austrian Wine Marketing Board (Austrian Wine) to start with the project in 2018. In June 2021 the results were presented to the public in an online viewer showing all the designations of Austrian wine, available at https://austrianvineyards.com in a largely barrier-free manner. The online viewer provides tailored individual maps fitted to the respective zoom level. The smallest unit of wine-origins in Austria is called Ried and is displayed in a plot-specific manner highlighting areas under vine. Information on the Ried include administrative district, winegrowing municipality, cadastral municipality, large collective vineyard site, specific winegrowing region, generic winegrowing region, winegrowing area and, in many cases, an illustrative picture. Complementary data on the size, elevation (minimum-maximum), orientation (in 8 sectors plus flat) and gradient (minimum, maximum, average) are based on the area under vine according to the EU’s Integrated Administration and Control System. Additional information covers climate data. The diagrams are taken from the monthly breakdown of data in the annals of the Central Institute for Meteorology and Geodynamics, Austria provide a display of values for air temperature, precipitation, and sunshine hours for the reference year and the long-term average. Seasonal aggregated data on temperature, precipitation, and sunshine hours complete the display. Short descriptions with emphasis on geology and soil, field name in historical maps, etymology of the denomination, and main planted variety complements the available information for the main designations in the online viewer. These descriptions are compiled by winegrowers, geologists, historians, and journalists. All the information and data can be extracted to a pdf-file. Printed vineyard maps are also available. Missing content regarding wine origins in Styria will be completed in winter 2021/22.

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.

The use of rootstock as a lever in the face of climate change and dieback of vineyard

As viticulture faces challenges such as climate change or vineyard dieback, the choice of the variety and rootstock becomes more and more crucial. To study rootstock levers in the Bordeaux region, a parcel of Cabernet Sauvignon (CS) was planted with four rootstocks in 2014. Twenty repetitions of each of the following four rootstocks were set up: 101-14 MGt, Nemadex AB, 420A MGt and Gravesac. The number of bunches, yields and pruning weights of the vine shoots were measured individually on 240 vines from 2017 to 2021. Since 2020, nitrogen status assessed by assimilable nitrogen level, hydric status assessed by δ13C and berry maturity were measured on 80 samples taken from 20 repetitions of the four rootstocks. A lower yield was measured for CS grafted onto Nemadex AB due to the lower number of bunches and the lower weight of berries. The differences between the other three rootstocks are small, but CS grafted onto 420A MGt was the most productive. The CS grafted onto Nemadex AB had the lowest pruning weight while 101-14 MGt had the highest. In 2020, δ13C showed a more moderate water stress with 101-14 MGt and 420A MGt than with Nemadex AB. Surprisingly, the Gravesac was under more stress than the 101-14 MGt. The nitrogen status in the berries was better for Nemadex AB but this was perhaps due to the significantly lower weight of the berries.Rootstock 101-14 MGt attained the highest accumulation of sugars in the berries while 420A MGt allows to preserve higher acidity. The parcel is still young which may explain some of the results. These measures must therefore be continued over the next several years to fully assess the effects of these rootstocks on the development of the vines and the quality of the production under new climatic conditions.

Local ancient grapevine cultivars to face future viticulture

Among the different strategies to cope with the negative impacts of climate change on viticulture, the exploitation of genetic diversity is one of the most promising to adapt to new conditions and maintain wine production and quality. One of the biggest concerns in the context of climate change is to improve water use efficiency (WUE). In this way, the use of genotypes that present a better response to drought and high WUE is a key issue. In this work, physiological performance analysis was conducted to compare the water deficit stress (WDS) responses of local and widespread grapevines cultivars. Leaf gas exchange, water use efficiency (WUE) at different levels (leaf and long-term WUE (∆13C)), leaf osmotic adjustment and other water relations parameters were determined in plants under well-watered and WDS conditions alongside assessment of the levels of foliar hormones concentrations. Results denote that local cultivars displayed better physiological performance under WDS as compared to the widely-distributed ones. he results corroborate the hypothesis that better stomatal control allows increasing leaf WUE under drought as occurred in the local Callet cv.; but the minority local cultivar Escursac cv. showed high WUE under both treatments. In this case, high WUE can be related to maintaining higher photosynthetic activity under drought. The different mechanisms underlying the better performance under WDS and high WUE of minority local cultivars are discussed.

Assessing the relationship between cordon strangulation, dieback, and fungal trunk disease symptom expression

Grapevine trunk diseases including Eutypa dieback are a major factor in the decline of vineyards and may lead to loss of productivity, reduced income, and premature reworking or replanting. Several studies have yielded results indicating that vines may be more likely to express symptoms of vascular disease if their health is already compromised by stress. In Australia and many other wine-growing regions it is a common practice for canes to be wrapped tightly around the cordon wire during the establishment of permanent cordon arms. It is likely that this practice may have a negative effect on health and longevity, as older cordons that have been trained in this manner often display signs of decay and dieback, with the wire often visibly embedded within the wood of the cordon. It is possible that adopting a training method which avoids constriction of the vasculature of the cordon may help to limit the onset of vascular disease symptom expression. A survey was conducted during the spring of two consecutive growing seasons on vineyards in South Australia displaying symptoms of Eutypa lata infection when symptomless shoots were 50–100 cm long. Vines were assessed as follows: (i) the proportion of cordon exhibiting dieback was rated using a 0–100% scale; (ii) the proportion of canopy exhibiting foliar symptoms of Eutypa dieback was rated using a 0–100% scale; (iii) the severity of strangulation was rated using a 0–4 point scale. Images were also taken of each vine for the purpose of measuring plant area index (PAI) using the VitiCanopy App. The goal of the survey was to determine if and to what extent any correlation exists between severity of strangulation and cordon dieback, in addition to Eutypa dieback foliar symptom expression.