Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Second pruning as a strategy to delay maturation in cv. ‘Touriga nacional’ in the Portuguese Douro region

The advance in maturation of wine grapes is an important climate change risk related effect that could affect warm regions like Portuguese Douro Wine Region. Indeed, the climate analysis over the past years registered a decrease in the precipitation, significant higher average temperatures, and a more frequent occurrence of extreme weather events, including heat waves. In these conditions the length from anthesis until maturation is shortened and the uncoupling of technical and phenolic maturity results in berries with higher sugar concentration (and lower acidity), but lower anthocyanins, tannins, and total phenolic concentration, which produce unbalanced wines.
In this work, an innovative strategy of crop forcing, based on forcing vine regrowth after a second pruning of green shoots, was tested, aimed at delaying ripening until the temperature becomes lower and, therefore, preventing acidity loss and increasing anthocyanin-to-sugar ratio. The experiments were conducted in 2019 and 2020 in a commercial vineyard of ‘Touriga Nacional’ located in the Douro Region. Crop forcing was conducted 15 (CF1) to 30 (CF2) days after fruit set. Vines pruned with conventional methods were used as control (CF0). Results confirmed that fruit ripening was shifted from the hot season (August/September), until a cooler period (October through early-November). At harvest, grapevine berries from CF1 and CF2 presented lower pH and higher acidity, than control, with no significant differences in colour intensity and phenolic levels composition. Sugar content was lower in CF2-treated vines in both seasons. However, in CF-treated vines the number and size of clusters were significantly lower (up to 88% reduction) than in control plants. A metabolomics analysis of mature berries from CF-treated vines and control is underway. Crop forcing was indeed effective in producing a more balance berry composition but severely reduced grapevine yield,

Towards a regional mapping of vine water status based on crowdsourcing observations

Monitoring vine water status is a major challenge for vineyard management because it influences both yield and harvest quality. It is also a challenge at the territorial scale for identifying periods of high water restriction or zones regularly impacted by water stress. This information is of major importance for defining collective strategies, anticipating harvest logistic or applying for irrigation authorisation. At this spatial scale, existing tools and methods for monitoring vine water status are few and often require strong assumptions (e.g. water balance model). This paper proposes to consider a collaborative collection of observations by winegrowers and wine industry stakeholders (crowdsourcing) as an interesting alternative. Indeed, it allows the collection of a large number of field observations while pooling the collection effort. However, the feasibility of such a project and its interest in monitoring vine water status at regional scale has never been tested.

The objective of this article is to explore the possibility of making a regional map of vine water status based on crowdsourcing observations. It is based on the study of the free mobile application ApeX-Vigne, which allows the collection of observations about vine shoot growth. This information is easy to collect and can be considered, under certain conditions, as a proxy for vine water status. This article presents the first results obtained from the nearly 18,000 observations collected by winegrowers and wine industry stakeholders during 2019, 2020 and 2021 seasons. It presents the vine shoot growth maps obtained at regional scale and their evolution over the three vintages studied. It also proposes an analysis of the factors that favoured the number of observations collected and those that favoured their quality. These results open up new perspectives for monitoring vine water status at a regional scale but above they provide references for other crowdsourcing projects in viticulture.

Adaptability of grapevines to climate change: characterization of phenology and sugar accumulation of 50 varieties, under hot climate conditions

Climate is the major factor influencing the dynamics of the vegetative cycle and can determine the timing of phenological periods. Knowledge of the phenology of varieties, their chronological duration, and thermal requirements, allows not only for the better management of interventions in the vineyard, but also to predict the varieties’ behaviour in a scenario of climate change, giving the wine producer the possibility of selecting the grape varieties that are best adapted to the climatic conditions of a certain terroir. In 2014, Symington Family Estates, Vinhos, established two grape variety libraries in two different places with distinctive climate conditions (Douro Superior, and Cima Corgo), with the commitment of contributing to a deeper agronomic and oenological understanding of some grape varieties, in hot climate conditions. In these research vineyards are represented local varieties that are important in the regional and national viticulture, but also others that have over time been forgotten — as well as five international reference cultivars. From 2017 to 2021, phenological observations have been made three times a week, following a defined protocol, to determine the average dates of budbreak, flowering and veraison. With the climate data of each location, the thermal requirements of each variety and the chronological duration of each phase have been calculated. During maturation, berry samples have been gathered weekly to study the dynamics of sugar accumulation, between other parameters. The data was analysed applying phenological and sugar accumulation models available in literature. The results obtained show significant differences between the varieties over several parameters, from the chronological duration and thermal requirements to complete the various stages of development, to the differences between the two locations, confirming the influence of the climate on phenology and the stages of maturation, in these specific conditions.

Legacy of land-cover changes on soil erosion and microbiology in Burgundian vineyards

Soils in vineyards are recognized as complex agrosystems whose characteristics reflect complex interactions between natural factors (lithology, climate, slope, biodiversity) and human activities. To date, most of the unknown lies in an incomplete understanding of soil ecosystems, and specifically in the microbial biodiversity even though soil microbiota is involved in many key functions, such as nutrient cycling and carbon sequestration. Soil biological properties are indicative of soil quality. Therefore, understanding how soil communities are related to soil ecosystem functioning is becoming an essential issue for soil strategy conservation. Here, we propose to assess the importance of land-cover history on the present-day microbiological and physico-chemical properties. The studied area was selected in the Burgundian vineyards (Pernand-Vergelesses, Burgundy, France) where land occupation has been reconstructed over the last 40 years. Soil samples were collected in five areas reflecting various land cover history (forest, vineyards, shifting from forest to vineyards). For each area, physico-chemical parameters (pH, C, N, P, grain size) were measured and DNA was extracted to characterize the abundance and diversity of microbial communities. The obtained results show significant differences in the five areas suggesting that present-day microbial molecular biomass and bacterial taxonomic is partly inherited from past land occupation. Over longer period of time, such study of land-uses legacies may help to better assess ecosystem recovery and the impact of management practices for a better soil quality and vineyards sustainability.

Late season canopy management practices to reduce sugar loading and improve color profile of Cabernet-Sauvignon grapes and wines in the high irradiance and hot conditions of California Central Valley

Global warming is accelerating grape ripening, leading to unbalanced wines from fruit with high sugar content but poor aroma and colour development. Reducing the size of the photosynthetic apparatus after veraison has been shown to delay technological ripeness in cool climates, but methods have not been tested in areas with high irradiance and temperature where fruit exposure could have disastrous effects on berry composition. In this Cabernet-Sauvignon trial, we compared the application of an antitranspirant (pinolene), to severe canopy topping and above bunch zone leaf removal, all performed at mid-ripening, with an untouched control. We monitored the vines weekly by measuring stem water potential, gas exchange, fruit zone light exposure. We sampled berries to measure berry weight, total soluble solids, pH, titratable acidity, and the anthocyanin profile. At harvest, we assessed yield components, measured carbon isotope discrimination, rated sunburn on clusters, and produced experimental wines. We submitted harvest samples to metabolomic profiling through PFP-Q Exactive MS/MS and wines to sensory analysis. Application of the antitranspirant significantly reduced stomatal conductance and assimilation rate but did not affect the stem water potential. Inversely, leaf removal and topping increased water potential but did not affect leaf gas exchange. The late topping was the only treatment able to decrease sugar content (up to 2Bx), increase titratable acidity and pH, and improve anthocyanin content because of lower degradation of di-hydroxylated forms. Late leaf removal above the bunch zone increased lightning conditions in the canopy and produced the most significant damage on fruits. Yield components were not affected. This work suggests that late-season canopy management can effectively control ripening speeds and improve grapes and wines. Still, the effect on grape exposure in a critical time must be well balanced to avoid problems with the appropriate technique.