Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Mechanisms involved in the heating of the environment by the aerodynamic action of a wind machine to protect a vineyard against spring frost

One of the main consequences of global warming is the rise of the mean temperature. Thus, the heat summation by the plants begins sooner in the early spring, and by cumulating growing degree-days, phenological development tends to happen earlier. However, spring frost is still a recurrent phenomenon causing serious damages to buds and therefore, threatening the harvests of the winegrowers. The wind machine is a solution to protect fruit crops against spring frost that is increasingly used. It is composed of a 10-m mast with a blowing fan at its peak. By tapping into the strength of the nocturnal thermal inversion, it sweeps the crop by propelling warm air above to the ground. Thus, stratification is momentarily suppressed. Furthermore, the continuous action of the machine, alone or in synergy, or the addition of a heater allow the bud to be bathed in a warmer environment. Also, the punctual action of the tower’s warm gust reaches the bud directly at each rotation period. All these actions allow the bud to continuously warm up, but with different intensities and over a different period. Although there is evidence of the effectiveness of the wind machines, the thermal transfers involved in those mechanisms raise questions about their true nature. Field measurements based on ultrasonic anemometers and fast responding thermocouples complemented by laboratory measurements on a reduced scale model allow to characterize both the airflow produced by the wind machine and the local temperature in its vicinity. Those experiments were realized in the vineyard of Quincy, in the framework of the SICTAG project. In the future paper, we will detail the aeraulic characterization of the wind machine and the thermal effects resulting from it and we will focus on how the wind machine warms up the local atmosphere and enables to reduce the freezing risk.

Climate projections over France wine-growing region and its potential impact on phenology

Climate change represents a major challenge for the French wine industry. Climatic conditions in French vineyards have already changed and will continue to evolve. One of the notable effects on grapevine is the advancing growing season. The aim of this study is to characterise the evolution of agroclimatic indicators (Huglin index, number of hot days, mean temperature, cumulative rainfall and number of rainy days during the growing season) at French wine-growing regions scale between 1980 and 2019 using gridded data (8 km resolution, SAFRAN) and for the middle of the 21th century (2046-2065) with 21 GCMs statistically debiased and downscaled at 8 km. A set of three phenological models were used to simulate the budburst (BRIN, Smoothed-Utah), flowering, veraison and theoretical maturity (GFV and GSR) stages for two grape varieties (Chardonnay and Cabernet-Sauvignon) over the whole period studied. All the French wine-growing regions show an increase in both temperatures during the growing season and Huglin index. This increase is accompanied by an advance in the simulated flowering (+3 to +9 days), veraison (+6 to +13 days) and theoretical maturity (+6 to +16 days) stages, which are more noticeable in the north-eastern part of France. The climate projections unanimously show, for all the GCMs considered, a clear increase in the Huglin index (+662 to 771 °C.days compared to the 1980-1999 period) and in the number of hot days (+5.6 to 22.6 days) in all the wine regions studied. Regarding rainfall, the expected evolution remains very uncertain due to the heterogeneity of the climates simulated by the 21 models. Only 4 regions out of 21 have a significant decrease in the number of rainy days during the growing season. The two budburst models show a strong divergence in the evolution of this stage with an average difference of 18 days between the two models on all grapevine regions. The theoretical maturity is the most impacted stage with a potential advance between 40 and 23 days according to wine-growing regions.

Photoselective shade films affect grapevine berry secondary metabolism and wine composition

Grapevine physiology and production are challenged by forecasted increases in temperature and water deficits. Within this scenario, photoselective overhead shade films are promising tools in warm viticulture areas to overcome climate change related factors. The aim of this study was to evaluate the vulnerability of ‘Cabernet Sauvignon’ grape berry to solar radiation overexposure and optimize shade film use for berry integrity. A randomized complete block design field study was conducted across two years (2020-2021) in Oakville, Napa Valley, CA, with four shade films (D1, D3, D4, D5) differing in the percent of radiation spectra transmitted and compared to an uncovered control (C0). Integrals for gas exchange parameters and mid-day stem water potential were unaffected by the shade films in 2020 and 2021. By harvest, berries from uncovered and shaded vines did not differ in their size or primary metabolism in either year. Despite precipitation exclusion during the dormant season in the shaded treatments, yield did not differ between them and the control in either season. In 2020, total skin anthocyanins (mg/g fresh mass) in the shaded treatments was greater than C0 during berry ripening and at harvest. Conversely, flavonol concentrations in 2020 were reduced in shaded vines compared to C0. The 2020 growing season highlighted the impact of heat degradation on flavonoids. Flavonoid concentrations in 2021 increased until harvest while flavonoid degradation was apparent from veraison to harvest in 2020 across shaded and control vines. Wine analyses highlighted the importance of light spectra to modify wine composition. Wine color intensity, tonality and anthocyanin values were enhanced in D4 whereas antioxidant properties were enhanced in C0 and D5 wines. Altogether, our results highlighted the need of new approaches in warm viticulture areas given the impact that composition of light has on berry and wine quality.

Terroir analysis and its complexity

Terroir is not only a geographical site, but it is a more complex concept able to express the “collective knowledge of the interactions” between the environment and the vines mediated through human action and “providing distinctive characteristics” to the final product (OIV 2010). It is often treated and accepted as a “black box”, in which the relationships between wine and its origin have not been clearly explained. Nevertheless, it is well known that terroir expression is strongly dependent on the physical environment, and in particular on the interaction between soil-plant and atmosphere system, which influences the grapevine responses, grapes composition and wine quality. The Terroir studying and mapping are based on viticultural zoning procedures, obtained with different levels of know-how, at different spatial and temporal scales, empiricism and complexity in the description of involved bio-physical processes, and integrating or not the multidisciplinary nature of the terroir. The scientific understanding of the mechanisms ruling both the vineyard variability and the quality of grapes is one of the most important scientific focuses of terroir research. In fact, this know-how is crucial for supporting the analysis of climate change impacts on terroir resilience, identifying new promised lands for viticulture, and driving vineyard management toward a target oenological goal. In this contribution, an overview of the last findings in terroir studies and approaches will be shown with special attention to the terroir resilience analysis to climate change, facing the use and abuse of terroir concept and new technology able to support it and identifying the terroir zones.

Spatiotemporal patterns of chemical attributes in Vitis vinifera L. cv. Cabernet Sauvignon vineyards in Central California

Spatial variability of vine productivity in winegrapes is important to characterise as both yield and quality are relevant for the production of different wine styles and products. The objectives were to understand how patterns of variability of Cabernet Sauvignon fruit composition changed over time and space, how these patterns could be characterised with indirect measurements, and how spatial patterns of the variation in fruit compositional attributes can aid in improving management. Prior to the 2017 vintage, 125 data vines were distributed across each of four vineyards in the Lodi American Viticultural Area (AVA) of California. Each data vine was sampled at commercial harvest in 2017, 2018, and 2019. Yield components and fruit composition were measured at harvest for each data vine, and maps of yield and fruit composition were produced for eight ‘objective measures of fruit quality’: total anthocyanins, polymeric tannins, quercetin glycosides, malic acid, yeast assimilable nitrogen, β-damascenone, C6 alcohols and aldehydes, and 3-isobutyl-2-methoxypyrazine. Patterns of variation in anthocyanins and phenolic compounds were found to be most stable over time. Given this relative stability, management decisions focused on fruit quality could be based on zonal descriptions of anthocyanins or phenolics to increase profitability in some vineyards. In each vineyard, dormant season pruning weights and soil cores were collected at each location, elevation and soil apparent electrical conductivity surveys were completed, and remotely sensed imagery was captured by fixed wing aircraft and two satellite platforms at major phenological stages. The data collected were used to develop relationships among biophysical data, soil, imagery, and fruit composition. The standardised and aggregated samples from four vineyards over three seasons were included in the estimation of ‘common variograms’ to assess how this technique could aid growers in producing geostatistically rigorous maps of fruit composition variability without cumbersome, single season sampling efforts.