Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Influence of a spontaneous cover crop on the vineyard and soil erosion under Mediterranean climate

Sixty five % of the agricultural area of the Basque Country located in the DO Ca Rioja corresponds to vineyards. More than 40% of it has an average slope greater than 10%, which makes it sensitive to erosive processes. Furthermore, it is foreseeable that extreme weather events (storms, hail, extreme heat and cold, etc.) will be favored due to climate change. Cover cropping can mitigate this risk, and therefore the objective of this work is to evaluate the impact that a vegetable cover has on the agronomic behavior of the vineyard, the quality of the grape and soil erosion. For this, a trial has been carried out with a Graciano variety vineyard with a slope between 10% -20% during the years 2020 and 2021. Conventional tillage management in the area has been compared (4-6 passes per year of tillage machinery) versus spontaneous vegetation cover management in the vineyard. This implies not tilling and allowing the grass of the land to colonize the range between the lines of vines, controlling their height through 1-3 mowing passes per year, always trying to affect the surface of the land as little as possible. The vegetative growth, yield and quality of the grape and wine was measured. Furthermore, erosion has been measured using Gerlasch boxes. The yield was lower in the second year of the trial in the cover crop treatment, but erosion was significantly reduced.

Adaptability of grapevines to climate change: characterization of phenology and sugar accumulation of 50 varieties, under hot climate conditions

Climate is the major factor influencing the dynamics of the vegetative cycle and can determine the timing of phenological periods. Knowledge of the phenology of varieties, their chronological duration, and thermal requirements, allows not only for the better management of interventions in the vineyard, but also to predict the varieties’ behaviour in a scenario of climate change, giving the wine producer the possibility of selecting the grape varieties that are best adapted to the climatic conditions of a certain terroir. In 2014, Symington Family Estates, Vinhos, established two grape variety libraries in two different places with distinctive climate conditions (Douro Superior, and Cima Corgo), with the commitment of contributing to a deeper agronomic and oenological understanding of some grape varieties, in hot climate conditions. In these research vineyards are represented local varieties that are important in the regional and national viticulture, but also others that have over time been forgotten — as well as five international reference cultivars. From 2017 to 2021, phenological observations have been made three times a week, following a defined protocol, to determine the average dates of budbreak, flowering and veraison. With the climate data of each location, the thermal requirements of each variety and the chronological duration of each phase have been calculated. During maturation, berry samples have been gathered weekly to study the dynamics of sugar accumulation, between other parameters. The data was analysed applying phenological and sugar accumulation models available in literature. The results obtained show significant differences between the varieties over several parameters, from the chronological duration and thermal requirements to complete the various stages of development, to the differences between the two locations, confirming the influence of the climate on phenology and the stages of maturation, in these specific conditions.

Towards a regional mapping of vine water status based on crowdsourcing observations

Monitoring vine water status is a major challenge for vineyard management because it influences both yield and harvest quality. It is also a challenge at the territorial scale for identifying periods of high water restriction or zones regularly impacted by water stress. This information is of major importance for defining collective strategies, anticipating harvest logistic or applying for irrigation authorisation. At this spatial scale, existing tools and methods for monitoring vine water status are few and often require strong assumptions (e.g. water balance model). This paper proposes to consider a collaborative collection of observations by winegrowers and wine industry stakeholders (crowdsourcing) as an interesting alternative. Indeed, it allows the collection of a large number of field observations while pooling the collection effort. However, the feasibility of such a project and its interest in monitoring vine water status at regional scale has never been tested.

The objective of this article is to explore the possibility of making a regional map of vine water status based on crowdsourcing observations. It is based on the study of the free mobile application ApeX-Vigne, which allows the collection of observations about vine shoot growth. This information is easy to collect and can be considered, under certain conditions, as a proxy for vine water status. This article presents the first results obtained from the nearly 18,000 observations collected by winegrowers and wine industry stakeholders during 2019, 2020 and 2021 seasons. It presents the vine shoot growth maps obtained at regional scale and their evolution over the three vintages studied. It also proposes an analysis of the factors that favoured the number of observations collected and those that favoured their quality. These results open up new perspectives for monitoring vine water status at a regional scale but above they provide references for other crowdsourcing projects in viticulture.

20-Year-Old data set: scion x rootstock x climate, relationships. Effects on phenology and sugar dynamics

Global warming is one of the biggest environmental, social, and economic threats. In the Douro Valley, change to the climate are expected in the coming years, namely an increase in average temperature and a decrease in annual precipitation. Since vine cultivation is extremely vulnerable and influenced by the climate, these changes are likely to have negative effects on the production and quality of wine.
Adaptation is a major challenge facing the viticulture sector where the choice of plant material plays an important role, particularly the rootstock as it is a driver for adaptation with a wide range of effects, the most important being phylloxera, nematode and salt, tolerance to drought and a complex set of interactions in the grafted plant.
In an experimental vineyard, established in the Douro Region in 1997, with four randomized blocs, with five varieties, Touriga Nacional, Tinta Barroca, Touriga Franca and Tinta Roriz, grafted in four rootstocks, Rupestris du Lot, R110, 196-17C, R99 and 1103P, data was collected consecutively over 20 years (2001-2020). Phenological observations were made two to three times a week, following established criteria, to determine the average dates of budbreak, flowering and veraison. During maturation, weekly berry samples were taken to study the dynamics of sugar accumulation, amongst other parameters. Climate data was collected from a weather station located near the vineyard parcel, with data classified through several climatic indices.
The results achieved show a very low coefficient of variations in the average date of the phenophases and an important contribution from the rootstock in the dynamic of the phenology, allowing a delay in the cycle of up to10-12 days for the different combinations. The Principal Component Analysis performed, evaluating trends in the physical-chemical parameters, highlighted the effect of the climate and rootstock on fruit quality by grape varieties.

Diagnosis of soil quality and evaluation of the impact of viticultural practices on soil biodiversity in a vineyard in southwestern France

Viticulture is facing two major changes – climate change and agroecological transition. In both cases, soil quality is seen as a lever to move towards a more sustainable viticulture. However, soil biological quality is little considered in the implementation of viticultural practices. Gascogn’Innov (2017-2022) is an Operational Group funded by the European Innovation Partnership for Agriculture. As such, it brings together winegrowers from the south-west of France, scientists, advisors and technicians, around a project focused on viticultural soil biological functioning and the design of technical routes more respectful toward soil heritage. To achieve this, the project aims to acquire references on the impact of viticultural practices on soil biology from a dynamic way, and to test a methodology to integrate information provided by the soil bioindicators to manage farming systems. A set of indicators of soil biological quality are evaluated in the project: microorganisms (bacteria and fungi abundance and diversity), fauna (abundance and diversity of nematodes and earthworms), physico-chemical characteristics, soil structure assessment and degradation rate of organic matter. Based on a network of 13 plots that have been subject to an initial diagnosis in 2017, several agronomical practices to restore soil fertility are experimented to redesign the cropping system (for instance plant cover, organic matter inputs, reduction of herbicides, mineral fertilizers). System redesign was made in collaboration by winegrowers and an interdisciplinary group of experts (agronomists, biologists). Several indicators are measured on vine and soil at each vintage to assess vine health and productivity. At the end of the project (2021), a final diagnosis was carried out. Gascogn’Innov allowed to create a regional database on the quality of wine-growing soils, which permitted to evaluate the effect of practices according to soil types. Especially, decreasing the intensity of tillage and increasing the duration and diversity of grass coverage tends to increase the abundance of all the organisms studied. This project confirmed the value of soil biological quality indicators to drive the sustainability of practices, but also highlighted the key-role of expertise, in both agronomy and soil biology, to help winegrowers understand and appropriate their soil quality diagnoses.