Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Climate change impacts on Douro Region viticulture and adaptation measures

Climate has a significant impact in the success of any agricultural system, with a direct influence on the crops suitability to a given region, interfering on yield and quality and also with the economic sustainability of the productive activity. In the Douro Demarcated Region (RDD), as in most regions of the Mediterranean climate, the scarce precipitation (33% has less than 600 mm per year), and your high variability, associated with high rates of evapotranspiration during the summer, is usually one of the fundamental factors that limit the grapevine development, as well as the production and quality of the harvest. Thus, facing the scenario in temperature changes for the next decades (1.5-2.5°C) and confirming the predictions of precipitation decreases and/or great variability in the occurrence of heat waves and intense rainfall, the consequences for slope stability in mountain viticulture and sustainability of all operations involved, are risks to be taken into account. In this way, a deepest and sustained knowledge regarding the adaptation measures to adverse environmental conditions is of a crucial importance, enabling a more efficient adaptation of plant growth conditions and the optimization of production and quality of the grapevines. The development of this work, carried out in two commercial vineyards, one located in Soutelo do Douro, São João da Pesqueira, Cima Corgo sub-region, and another located in Numão, Vila Nova de Foz Côa, Douro Superior sub-region, it seeks to establish a relationship between climatic elements and physiological, productive and qualitative parameters, as well as to evaluate the effectiveness of adaptation measures, including different types of deficit irrigation (2002-2019) and the application of shading nets (2019-2020) in the physiological, viticultural and oenological behavior in the Touriga Nacional and Moscatel Galego Branco varieties, respectively. The results showed that the application of deficit irrigation allowed to significantly reduce the impact of the adverse weather conditions at key moments in the development of the grapevine, particularly in the period immediately before veráison and maturation, reducing the negative effects on the physiological processes and productivity, without compromise the must quality parameters. On the other hand, the application of shading nets significantly reduced de leaves temperature, allowing to increase the water potential, stomatal conductance and photosynthetic rate of grapes, which was reflected in the yield increase in the 2nd year of the study. For the maturation indicators, higher levels of total acidity, malic acid and assimilable nitrogen were obtained. The last measure presents a huge potential, being essential to carry out more years of trials to obtain stronger conclusions in terms of production parameters, but also in characteristics as important as the grape ripening components and the organoleptic characteristics of wines.

Low-cost sensors as a support tool to monitor soil-plant heat exchanges in a Mediterranean vineyard

Mediterranean viticulture is increasingly exposed to more frequent extreme conditions such as heat waves. These extreme events co-occur with low soil water content, high air vapor pressure deficit and high solar radiant energy fluxes and result in leaf and berry sunburn, lower yield, and berry quality, which is a major constraint for the sustainability of the sector. Grape growers must find ways to proper and effectively manage heat waves and extreme canopy and berry temperatures. Irrigation to keep soil moisture levels and enable adequate plant turgor, and convective and evaporative cooling emerged as a key tool to overcome this major challenge. The effects of irrigation on soil and plant water status are easily quantifiable but the impact of irrigation on soil and canopy temperature and on heat convection from soil to cluster zone remain less characterized. Therefore, a more detailed quantification of vineyard heat fluxes is highly relevant to better understand and implement strategies to limit the effects of extreme weather events on grapevine leaf and berry physiology and vineyards performance. Low-cost sensor technologies emerge as an opportunity to improve monitoring and support decision making in viticulture. However, validation of low-cost sensors is mandatory for practical applicability. A two-year study was carried in a vineyard in Alentejo, south of Portugal, using low-cost thermal cameras (FLIR One, 80×60 pixels and FLIR C5, 160×120 pixels, 8-14 µm, FLIR systems, USA) and pocket thermohygrometers (Extech RHT30, EXTECH instruments, USA) to monitor grapevine and soil temperatures. Preliminary results show that low-cost cameras can detect severe water stress and support the evaluation of vertical canopy temperature variability, providing information on soil surface temperature. All these thermal parameters can be relevant for soil and crop management and be used in decision support systems.

Towards a regional mapping of vine water status based on crowdsourcing observations

Monitoring vine water status is a major challenge for vineyard management because it influences both yield and harvest quality. It is also a challenge at the territorial scale for identifying periods of high water restriction or zones regularly impacted by water stress. This information is of major importance for defining collective strategies, anticipating harvest logistic or applying for irrigation authorisation. At this spatial scale, existing tools and methods for monitoring vine water status are few and often require strong assumptions (e.g. water balance model). This paper proposes to consider a collaborative collection of observations by winegrowers and wine industry stakeholders (crowdsourcing) as an interesting alternative. Indeed, it allows the collection of a large number of field observations while pooling the collection effort. However, the feasibility of such a project and its interest in monitoring vine water status at regional scale has never been tested.

The objective of this article is to explore the possibility of making a regional map of vine water status based on crowdsourcing observations. It is based on the study of the free mobile application ApeX-Vigne, which allows the collection of observations about vine shoot growth. This information is easy to collect and can be considered, under certain conditions, as a proxy for vine water status. This article presents the first results obtained from the nearly 18,000 observations collected by winegrowers and wine industry stakeholders during 2019, 2020 and 2021 seasons. It presents the vine shoot growth maps obtained at regional scale and their evolution over the three vintages studied. It also proposes an analysis of the factors that favoured the number of observations collected and those that favoured their quality. These results open up new perspectives for monitoring vine water status at a regional scale but above they provide references for other crowdsourcing projects in viticulture.

Elucidating vineyard site contributions to key sensory molecules: Identification of correlations between elemental composition and volatile aroma profile of site-specific Pinot noir wines

The reproducibility of elemental profile in wines produced across multiple vintages has been previously reported using grapes from a single scion clone of Vitis vinifera L. cv. Pinot noir. The grapevines were grown on fourteen different vineyard sites, from Oregon to southern California in the U.S.A., which span distances from approximately hundreds of meters to 1450 km, while elevations range from near sea level to nearly 500 m. In addition, sensorial (i.e. aroma, taste, and mouthfeel) and chemical (i.e. polyphenolic and volatile) differences across the different vineyard sites have also been observed among these wines at two aging time points. While strong evidence exists to support that grapes grown in different regions can produce wines with unique chemical and sensorial profiles, even when a single clone is used, the understanding of growing site characteristics that result in this reproducible differentiation continues to emerge. One hypothesis is that the elemental profile that a vineyard site imparts to the grape berries and the resulting wine is an important contributor to this differentiation in chemistry and sensory of wines. For example, various classes of enzymes that catalyze the formation of key aroma compounds or their precursors require specific metals. In this work, we begin to report correlations between elemental and volatile aroma profiles of site-specific Pinot noir wines, made under standardized winemaking conditions, that have been previously shown to be distinguished separately by these chemical analyses.

Grapevine yield-gap: identification of environmental limitations by soil and climate zoning in Languedoc-Roussillon region (south of France)

Grapevine yield has been historically overlooked, assuming a strong trade-off between grape yield and wine quality. At present, menaced by climate change, many vineyards in Southern France are far from the quality label threshold, becoming grapevine yield-gaps a major subject of concern. Although yield-gaps are well studied in arable crops, we know very little about grapevine yield-gaps. In the present study, we analysed the environmental component of grapevine yield-gaps linked to climate and soil resources in the Languedoc Roussillon. We used SAFRAN data and IGP Pays d’Oc wine yields from 2010 to 2018. We selected climate and soil indicators proving to have a significant effect on average wine yield-gaps at the municipality scale. The most significant factors of grapevine yield were the Soil Available Water Capacity; followed by the Huglin Index and the Climatic Dryness Index. The Days of Frost; the Soil pH; and the Very Hot Days were also significant. Then, we clustered geographical zones presenting similar indicators, facilitating the identification of resources yield-gaps. We discussed the number of zones with the experts of IGP Pays d’Oc label, obtaining 7 zones with similar limitations for grapevine yield. Finally, we analysed the main resources causing yield-gaps and the grapevine varieties planted on each zone. Mapping grapevine resource yield-gaps are the first stage for understanding grapevine yield-gaps at the regional scale.