Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Rapid damage assessment and grapevine recovery after fire

There is increasing scientific consensus that climate changeis the underlying cause of the prolonged dry and hot conditions that have increased the risk of extreme fire weather in many countries around the world. In December 2019, a bushfire event occurred in the Adelaide Hills, South Australia where 25,000 hectares were burnt and in vineyards and surrounding areas various degrees of scorching and infrastructure damage occurred. The ability to coordinate and plan recovery after a fire event relies on robust and timely data. The current practice for measuring the scale and distribution of fire damage is to walk or drive the vineyard and score individual vines based on visual observation. The process is time consuming, subjective, or semi-quantitative at best. After the December 2019 fires, it took many months to access properties and estimate the area of vineyard damaged. This study compares the rapid assessment and mapping of fire damage using high-resolution satellite imagery with more traditional ground based measures. Satellite imagery tracking vineyard recovery in the season following the bushfire is being correlated to field assessments of vineyard productivity such as canopy health and development, fertility and carbohydrate storage. Canopy health in the seasons following the fires correlated to the severity of the initial fire damage. Severely damaged vines had reduced canopy growth, were infertile or had very low fertility as well as lower carbohydrate levels in buds and canes during dormancy, which reduced productivity in the seasons following the bushfire event. In contrast, vines that received minor damage were able to recover within 1-2 years. Tools that rapidly and affordably capture the extent and severity of damage over large vineyard area will allow producers, government and industry bodies to manage decisions in relation to fire recovery planning, coordination and delivery, improving the efficiency and effectiveness of their response.

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

Grapevine sugar concentration model in the Douro Superior, Portugal

Increasingly warm and dry climate conditions are challenging the viticulture and winemaking sector. Digital technologies and crop modelling bear the promise to provide practical answers to those challenges. As viticultural activities strongly depend on harvest date, its early prediction is particularly important, since the success of winemaking practices largely depends upon this key event, which should be based on an accurate and advanced plan of the annual cycle. Herein, we demonstrate the creation of modelling tools to assess grape ripeness, through sugar concentration monitoring. The study area, the Portuguese Côa valley wine region, represents an important terroir in the “Douro Superior” subregion. Two varieties (cv. Touriga Nacional and Touriga Franca) grown in five locations across the Côa Region were considered. Sugar accumulation in grapes, with concentrations between 170 and 230 g l-1, was used from 2014 to 2020 as an indicator of technological maturity conditioned by meteorological factors. The climatic time series were retrieved from the EU Copernicus Service, while sugar data were collected by a non-profit organization, ADVID, and by Sogrape, a leading wine company. The software for calibrating and validating this model framework was the Phenology Modeling Platform (PMP), version 5.5, using Sigmoid and growing degree-day (GDD) models for predictions. The performance was assessed through two metrics: Roots Mean Square Error (RMSE) and efficiency coefficient (EFF), while validation was undertaken using leave-one-out cross-validation. Our findings demonstrate that sugar content is mainly dependent on temperature and air humidity. The models achieved a performance of 0.65

Climate modeling at local scale in the Waipara winegrowing region in the climate change context

In viticulture, a warming climate can have a very significant impact on grapevine development and therefore on the quality and characteristics of wines across different spatial scales, ranging from global to local. In order to adapt wine-growing to climate change, global climate models can be used to define future scenarios, but only at the scale of major wine regions. Despite the huge progress made over the last ten years in terms of the spatial resolution of climate models (now downscaled to a few square kilometres), they are not yet sufficiently precise to account for the local climate variability associated with such parameters as local topography, in spite of these parameters being decisive for vine and wine characteristics. This study describes a method to downscale future climate scenarios to vineyard scale. Networks of data loggers have been used to collect air temperature at canopy level in the Waipara winegrowing region (New Zealand) over five growing seasons. These measurements allow the creation of fine-scale geostatistical models and maps of temperature (at 100 m resolution) for the growing season. In order to model climate change at pilot site scale, these geostatistical models have been combined with regional climate change predictions for the periods 2031-2050 and 2081-2100 based on the RCP8.5 climate change scenario. The integration of local climate variability with regionalized climate change simulations allows assessment of the impacts of climate change at the vineyard scale. The improved knowledge gained using this methodology results from the increased horizontal resolution that better addresses the concerns of winegrowers. The results provide the local winegrowers with information necessary to understand current processes, as well as historical and future viticulture trends at the scale of their site, thereby facilitating decisions about future response strategies.

Low-cost sensors as a support tool to monitor soil-plant heat exchanges in a Mediterranean vineyard

Mediterranean viticulture is increasingly exposed to more frequent extreme conditions such as heat waves. These extreme events co-occur with low soil water content, high air vapor pressure deficit and high solar radiant energy fluxes and result in leaf and berry sunburn, lower yield, and berry quality, which is a major constraint for the sustainability of the sector. Grape growers must find ways to proper and effectively manage heat waves and extreme canopy and berry temperatures. Irrigation to keep soil moisture levels and enable adequate plant turgor, and convective and evaporative cooling emerged as a key tool to overcome this major challenge. The effects of irrigation on soil and plant water status are easily quantifiable but the impact of irrigation on soil and canopy temperature and on heat convection from soil to cluster zone remain less characterized. Therefore, a more detailed quantification of vineyard heat fluxes is highly relevant to better understand and implement strategies to limit the effects of extreme weather events on grapevine leaf and berry physiology and vineyards performance. Low-cost sensor technologies emerge as an opportunity to improve monitoring and support decision making in viticulture. However, validation of low-cost sensors is mandatory for practical applicability. A two-year study was carried in a vineyard in Alentejo, south of Portugal, using low-cost thermal cameras (FLIR One, 80×60 pixels and FLIR C5, 160×120 pixels, 8-14 µm, FLIR systems, USA) and pocket thermohygrometers (Extech RHT30, EXTECH instruments, USA) to monitor grapevine and soil temperatures. Preliminary results show that low-cost cameras can detect severe water stress and support the evaluation of vertical canopy temperature variability, providing information on soil surface temperature. All these thermal parameters can be relevant for soil and crop management and be used in decision support systems.