Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

δ13C : A still underused indicator in precision viticulture  

The first demonstration of the interest of carbon isotope composition of sugars in grapevine, as an integrated indicator of vineyard water status, dates back to 2000 (Gaudillère et al., 1999; Van Leeuwen et al., 2001). Thanks to the isotopic discrimination of Carbon that takes place during plant photosynthesis, under hydric stress conditions, it is possible to accurately estimate the photosynthetic activity. Ever since, δ13C has been widely applied with success to zonation, terroir studies and vine physiology research, but is still not widely used by viticulturists. This is quite astonishing by considering the impact of global warming on viticulture and the need to improve water management, that would justify a widespread use of δ13C.
The lack of private laboratories proposing the analysis, the cost of the technology, as well as the long analytical delays, have been detrimental to its development. Some laboratories tried to overcome the analytical difficulties of isotopic analysis by using fourier transformed infrared spectroscopy, as a fast and cheap alternative to the official OIV method (IRMS). These claimed FTIR models have never been published or peer reviewed and cannot be considered robust. In this work, thanks to the recent acquisition of IRMS technology, new modern and robust applications of δ13C for viticulture are proposed. This includes the use of the analysis to make parcel separations at harvesting, the possibility to increase the precision of hydric stress cartography and the potential cost reduction when compared with Scholander pressure bomb analysis.

Low-cost sensors as a support tool to monitor soil-plant heat exchanges in a Mediterranean vineyard

Mediterranean viticulture is increasingly exposed to more frequent extreme conditions such as heat waves. These extreme events co-occur with low soil water content, high air vapor pressure deficit and high solar radiant energy fluxes and result in leaf and berry sunburn, lower yield, and berry quality, which is a major constraint for the sustainability of the sector. Grape growers must find ways to proper and effectively manage heat waves and extreme canopy and berry temperatures. Irrigation to keep soil moisture levels and enable adequate plant turgor, and convective and evaporative cooling emerged as a key tool to overcome this major challenge. The effects of irrigation on soil and plant water status are easily quantifiable but the impact of irrigation on soil and canopy temperature and on heat convection from soil to cluster zone remain less characterized. Therefore, a more detailed quantification of vineyard heat fluxes is highly relevant to better understand and implement strategies to limit the effects of extreme weather events on grapevine leaf and berry physiology and vineyards performance. Low-cost sensor technologies emerge as an opportunity to improve monitoring and support decision making in viticulture. However, validation of low-cost sensors is mandatory for practical applicability. A two-year study was carried in a vineyard in Alentejo, south of Portugal, using low-cost thermal cameras (FLIR One, 80×60 pixels and FLIR C5, 160×120 pixels, 8-14 µm, FLIR systems, USA) and pocket thermohygrometers (Extech RHT30, EXTECH instruments, USA) to monitor grapevine and soil temperatures. Preliminary results show that low-cost cameras can detect severe water stress and support the evaluation of vertical canopy temperature variability, providing information on soil surface temperature. All these thermal parameters can be relevant for soil and crop management and be used in decision support systems.

Exploring resilience and competitiveness of wine estates in Languedoc-Roussillon in the recent past: a multi-level perspective

The Languedoc-Roussillon wineries are facing a decline in wine yields particularly PGI yields due to many factors. Climate change is just ones, but is expected to increase in the future. There is also structurally a large heterogeneity of yield profiles among terroirs, varieties and strategies. This work investigates the link between yield, competitiveness and resilience to explore how resilient winegrowers have been in the recent past. To this end two approaches have been combined; (i) an accountancy database analysis at estate scale and (ii) municipality level competitiveness analysis. A new resilience indicator that characterizes the capacity of an estate to absorb yield variation is also defined. The FADN database between 2000 and 2018 of ex-Languedoc-Roussillon (France) and other data are used to analyse the current situation and the past evolution of competitiveness and resilience by type of estate (type of farm: PGI and/or PDO & type of commercialization: bulk and/or bottles). The net margin, which defines competitiveness, is not correlated to yield for all types but depends on the type of commercialization and the level of specialisation. The resilience indicator shows that the net margin of estates specialized in PGI is particularly sensitive to yield declines. We also show that price evolutions seem to compensate the effect of yield losses for the majority of types. Municipality scale analysis shows the links between local pedoclimate, yield, commercialization strategies and price. Overlapping a PDO with a PGI does not always increase a municipality’s PGI competitiveness. It is difficult to make links between causes and effects due to the complexity of the wine production system. Production diversification may be a solution. Resorting to the two level of analysis helps resolving the data gap that is necessary to explore the links between yield and economic performance of the wine estates in the long term.

A predictive model of spatial Eca variability in the vineyard to support the monitoring of plant status

[lwp_divi_breadcrumbs home_text="IVES" use_before_icon="on" before_icon="||divi||400" module_id="publication-ariane" _builder_version="4.19.4" _module_preset="default" module_text_align="center" module_font_size="16px" text_orientation="center"...

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).