Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Mesoclimate impact on Tannat in the Atlantic terroir of Uruguay

The study of climate is relevant as an element conditioning the typicity of a product, its quality and sustainability over the years. The grapevine development and growth and the final grape and wine composition are closely related to temperature, while climate components vary at mesoscale according to topography and/or proximity to large bodies of water. The objective of this work is to assess the mesoclimate of the Atlantic region of Uruguay and to determine the effect of topography and the ocean on temperature and consequently on Tannat grapevine behavior.

Terroir analysis and its complexity

Terroir is not only a geographical site, but it is a more complex concept able to express the “collective knowledge of the interactions” between the environment and the vines mediated through human action and “providing distinctive characteristics” to the final product (OIV 2010). It is often treated and accepted as a “black box”, in which the relationships between wine and its origin have not been clearly explained. Nevertheless, it is well known that terroir expression is strongly dependent on the physical environment, and in particular on the interaction between soil-plant and atmosphere system, which influences the grapevine responses, grapes composition and wine quality. The Terroir studying and mapping are based on viticultural zoning procedures, obtained with different levels of know-how, at different spatial and temporal scales, empiricism and complexity in the description of involved bio-physical processes, and integrating or not the multidisciplinary nature of the terroir. The scientific understanding of the mechanisms ruling both the vineyard variability and the quality of grapes is one of the most important scientific focuses of terroir research. In fact, this know-how is crucial for supporting the analysis of climate change impacts on terroir resilience, identifying new promised lands for viticulture, and driving vineyard management toward a target oenological goal. In this contribution, an overview of the last findings in terroir studies and approaches will be shown with special attention to the terroir resilience analysis to climate change, facing the use and abuse of terroir concept and new technology able to support it and identifying the terroir zones.

Terroir traceability in grapes, musts and wine: results of research on Gewürztraminer and Sauvignon Blanc grape varieties in northern Italy

In the study of terroir, a separate analysis of its many component factors can be of great help in accurately identifying a vineyard’s natural elements that impact wine quality and typicity. This research used a dedicated pluri-disciplinary approach to investigate the ecological characteristics, including geology and geographical features, of 14 vineyards that produce Gewürztraminer and Sauvignon Blanc cultivars in the alpine Alto Adige DOC wine region. Both the geopedological method using Vineyards Geological Identity (VGI) and the new Solar Radiaton Identity (SRI) topoclimatic classification method were used to provide analytical measurements and qualitative/quantitative characterisations. In addition, wide-ranging targeted and untargeted oenological and chemical analyses were carried out on grapes, musts and wines to correlate the soils’ geomineral and physical conditions with the biochemical properties of their fruits and wines. The research identified strong correlations between vineyard geo-identity and wine biofingerprint, confirming a mineral traceability of strontium rubidium ratio and some minerals distinctive to the local geology, such as K, Ca, Ag, Ba and Mn.  The study also discovered that particular geomineral and physical soil conditions of the studied vineyards are related to the different amount of amino acids, primary varietal aromas and polyphenols found in grapes, musts and wines. The research confirmed that winemaking technologies support oenological quality, although in some cases, human practices can overpower certain characteristic elements in wine, erasing the typical imprint left by the vineyards’ natural terroir, which becomes less traceable. Terroir abiotic ecological factors and vineyard identity can be classified in detail using the new VGI and SRI analysis methods to discover interrelationships between geo-pedological and topoclimatic conditions that impact wine quality. These methods are also helpful in identifying which ecological elements are exclusive to a particular vineyard or wine sub-region.

Mapping and tracking canopy size with VitiCanopy

Understanding vineyard variability to target management strategies, apply inputs efficiently and deliver consistent grape quality to the winery is essential. However, despite inherent vineyard variability, the majority are managed as if they are uniform. VitiCanopy is a simple, grower-friendly tool for precision/digital viticulture that allows users to collect and interpret objective spatial information about vineyard performance. After four years of field and market research, an upgraded VitiCanopy has been created to achieve a more streamlined, technology-assisted vine monitoring tool that provides users with a set of superior new features, which could significantly improve the way users monitor their grapevines. These new features include:
• New user interface
• User authentication
• Batch analysis of multiple images
• Ease the learning curve through enhanced help features
• Reporting via the creation of colour maps that will allow users to assess the spatial differences in canopies within a vineyard.
Use-case examples are presented to demonstrate the quantification and mapping of vineyard variability through objective canopy measurements, ground-truthing of remotely sensed measurements, monitoring of crop conditions, implementation of disease and water management decisions as well as creating a history of each site to forecast quality. This intelligent tool allows users to manage grapevines and make informed management choices to achieve the desired production targets and remain profitable.

Rapid damage assessment and grapevine recovery after fire

There is increasing scientific consensus that climate changeis the underlying cause of the prolonged dry and hot conditions that have increased the risk of extreme fire weather in many countries around the world. In December 2019, a bushfire event occurred in the Adelaide Hills, South Australia where 25,000 hectares were burnt and in vineyards and surrounding areas various degrees of scorching and infrastructure damage occurred. The ability to coordinate and plan recovery after a fire event relies on robust and timely data. The current practice for measuring the scale and distribution of fire damage is to walk or drive the vineyard and score individual vines based on visual observation. The process is time consuming, subjective, or semi-quantitative at best. After the December 2019 fires, it took many months to access properties and estimate the area of vineyard damaged. This study compares the rapid assessment and mapping of fire damage using high-resolution satellite imagery with more traditional ground based measures. Satellite imagery tracking vineyard recovery in the season following the bushfire is being correlated to field assessments of vineyard productivity such as canopy health and development, fertility and carbohydrate storage. Canopy health in the seasons following the fires correlated to the severity of the initial fire damage. Severely damaged vines had reduced canopy growth, were infertile or had very low fertility as well as lower carbohydrate levels in buds and canes during dormancy, which reduced productivity in the seasons following the bushfire event. In contrast, vines that received minor damage were able to recover within 1-2 years. Tools that rapidly and affordably capture the extent and severity of damage over large vineyard area will allow producers, government and industry bodies to manage decisions in relation to fire recovery planning, coordination and delivery, improving the efficiency and effectiveness of their response.