Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Spatial variability of temperature is linked to grape composition variability in the Saint-Emilion winegrowing area

Elevated temperature during the grape maturation period is a major threat for grape quality and thus wine quality. Therefore, characterizing the grape composition response to temperature at a larger scale would represent a crucial step towards adaptation to climate change. In response to changes in temperature, various physiological mechanisms regulate grape composition. Primary and secondary metabolisms are both involved in this response, with well-known effects, for example on anthocyanins, and lesser known effects, for example on aromas or aroma precursors. At the field scale or at the regional scale, however, numerous environmental or plant-specific factors intervene to make the effects of temperature difficult to distinguish from overall variability. In this study, it was attempted to overcome this difficulty by selecting well-characterized situations with differing temperatures.
A long-term study of air temperature variability across several Merlot vineyards in the Saint-Emilion and Pomerol wine producing area found significant temperature differences and gradients at various time scales linked to environmental factors. From this study area, a few sites were selected with similar age, soil and training system conditions, and with repeated and contrasted temperature differences during the maturation period. The average temperature difference during the maturation period was about 2°C between cooler and warmer sites, a difference similar to that expected under future climate change scenarios. In close vicinity to the temperature sensors at each site, grape berries were sampled at different times until full maturity during 2019 and 2020. Also, berries from bunches on either side of the row were analyzed separately, allowing an investigation of bunch exposure effect associated with the coupling of berry temperature and solar radiation. Four replicates of pooled berries for each time – site – bunch exposure combination were obtained and analyzed for biochemical composition. Analyses of variance of the biochemical composition data collected at different sampling times reveal significant effects associated with temperature, site, and bunch azimuth. For instance, anthocyanins in grape skins are clearly influenced by temperature and solar radiation exposure, with up to 30% reduction in warmer conditions.

Long-term drought resilience of traditional red grapevine varieties from a semi-arid region

In recent decades, the scarcity of water resources in agriculture in certain areas has been aggravated by climate change, which has caused an increase in temperatures, changes in rainfall patterns, as well as an increase in the frequency of extreme phenomena such as droughts and heat waves. Although the vine is considered a drought-tolerant specie, it has to satisfy important water requirements to complete its cycle, which coincides with the hottest and driest months. Achieving sustainable viticulture in this scenario requires high levels of efficiency in the use of water, a scarce resource whose use is expected to be severely restricted in the near future. In this regard, the use of drought-tolerant varieties that are able to maintain grape yield and quality could be an effective strategy to face this change. During three consecutive seasons (2018-2020) the behavior in rainfed regime of 13 traditional red grapevine varieties of the Spain central region was studied. These varieties were cultivated in a collection at Centro de Investigación de la Vid y el Vino de Castilla-La Mancha (IVICAM-IRIAF) located in Tomelloso (Castilla-La Mancha, Spain). Yield components (yield, mean bunch and berry weight, pruning weight), physicochemical parameters of the musts (brix degree, total acidity, pH) and some physiological parameters related with water stress during ripening period (δ13C, δ18O) were analysed. The application of different statistical techniques to the results showed the existence of significant differences between varieties in their response to stressful conditions. A few varieties highlighted for their high ability to adapt to drought, being able to maintain high yields due to their efficiency in the use of water. In addition, it was possible quantify to what extent climate can be a determinant in the δ18O of musts under severe water stress conditions.

Characterization of variety-specific changes in bulk stomatal conductance in response to changes in atmospheric demand and drought stress

In wine growing regions around the world, climate change has the potential to affect vine transpiration and overall vineyard water use due to related changes in atmospheric demand and soil water deficits. Grapevines control their transpiration in response to a changing environment by regulating conductance of water through the soil-plant-atmosphere continuum. Most vineyard water use models currently estimate vine transpiration by applying generic crop coefficients to estimates of reference evapotranspiration, but this does not account for changes in vine conductance associated with water stress, nor differences thought to exist between varieties. The response of bulk stomatal conductance to daily weather variability and seasonal drought stress was studied on Cabernet-Sauvignon, Merlot, Tempranillo, Ugni blanc, and Semillon vines in a non-irrigated vineyard in Bordeaux France. Whole vine sap flow, temperature and humidity in the vine canopy, and net radiation absorbed by the vine canopy were measured on 15-minute intervals from early July through mid-September 2020, together with periodic measurement of leaf area, canopy porosity, and predawn leaf water potential. From this data, bulk stomatal conductance was calculated on 15-minute intervals, and multiple regression analysis was performed to identify key variables and their relative effect on conductance. Attention was focused on addressing multicollinearity and time-dependency in the explanatory variables and developing regression models that were readily interpretable. Variability of vapor pressure deficit over the day, and predawn water potential over the season explained much of the variability in conductance, with relative differences in response coefficients observed across the five varieties. By characterizing this conductance response, the dynamics of vine transpiration can be better parameterized in vineyard water use modeling of current and future climate scenarios.

How can historical cultivars mitigate the effects of climate change?

IFV, INRAe and the national network “Partenaires de la Sélection Vigne” representing 37 organizations from the different wine regions, have been working increasingly closely over the last 2 decades towards the preservation of the French varietal patrimony. There are approximately 600 patrimonial varieties according to INRAe and SupAgro Montpellier experts, including ancient cultivars (400) and intravarietal crossbreeds obtained since the 19th century. In the context of a drastic reduction in such varieties from the mid 1980’s in favor of mainstream varieties, it was essential to carry out an inventory of old vines and vineyards. INRAe Vassal collection plays a key role here as it holds the largest diversity available, along with a rich bibliography and herbariums, offering us the opportunity to document and double check the identity of a cultivar, consolidating the expertise of ampelographers. The work is carried out in several stages, from verifying the existence of a variety in a small region, through to rehabilitation. During this session, the authors present the process that leads to the official registration of a variety. After this, IFV selection center takes over to initiate the process of selection and propagation. A specific focus within regions such as the Alps, Champagne and the South-West will provide details of the full procedure. Bia, Bouysselet, Chardonnay rose, Mecle and the aptly named Tardif, are some of the cultivars that have followed this procedure. Furthermore, a recent regulation established by INAO on “varieties of interest for adaptation purposes” might boost uptake by growers. Since 2006, 36 historical cultivars have been registered. Most of these have been neglected in the past due to late maturity, lack of sugar and high titratable acidity at harvest time. Such characteristics are today considered as positive qualities, not only in mitigation of the effects of climate change, but also as an opportunity for restoring diversity…

Sustainable fertilisation of the vineyard in Galicia (Spain)

Excessive fertilization of the vineyard leads to low quality grapes, increased costs and a negative impact on the environment. In order to establish an integrated management system aimed at a sustainable fertilization of the vineyards, nutritional reference levels were established. For this purpose, 30 representative vineyards of the Albariño variety were studied, in which soil and petiole analyses were carried out for two years and grape yield and quality at harvest were measured. In both years of study, soil pH, calcium, sodium and cation exchange capacity were positively correlated with calcium content and negatively correlated with manganese in grapes. Irrigated vineyards had higher levels of aluminium in soil and lower levels of calcium in petiole. Climatic conditions were very different in the years of the study. The year 2019 was colder than usual, in 2020 there was a marked water stress with high summer temperatures. This resulted in medium-high acidity in grapes in 2019 and low acidity in 2020, with sugar levels being similar both years. A very marked decrease in must amino nitrogen was observed in 2020, with ammonia nitrogen remaining stable. The correlation of acidity and sugar values in grapes with soil and petiole analysis data made it possible to establish reference levels for the nutritional diagnosis of the Albariño variety in this region. Based on these results, an easy-to-use TIC application is currently being created for grapegrowers, aimed at improving the sustainability of the vineyard through reasoned fertilization. This study has now been extended to other Galician vine varieties.