Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

Traditional agroforestry vineyards, sources of inspiration for the agroecological transition of viticulture

A unique “terroir” can be found in southern Bolivia, which combines the specific features of climate, topography and altitude of high valleys, with the management of grapevines staked on trees. It is one of the rare remnants of agroforestry viticulture. A survey was carried out among 29 grapegrowers in three valleys, to characterize the structure and management of these vineyards, and identify the services they expect from trees. Farms were small (2.2 ha on average) and 85% of vineyards were less than 1 ha. Viticulture was associated with vegetable, fruit and fodder production, sometimes in the same fields. Molle trees were found in all plots, together with one or two other native tree species. Traditional grapevine varieties such as Negra Criolla, Moscatel de Alejandría and Vicchoqueña were grown with a large range of densities from 1550 to 9500 vines ha-1. From 18 to 30% of them were staked on trees, with 1.2 to 4.9 vines per tree. The management of these vineyards (irrigation, fertilization and grapevine protection) was described, the most particular technical operation being the coordinated pruning of trees and grapevines. Three types of management could be identified in the three valleys. Grapegrowers had a clear idea of the ecosystem services they expected from trees in their vineyards. The main one was protection against climate hazards (hail, frost, flood). Then they expected benefits in terms of pest and disease control, improvement of soil fertility and resulting yield. At last, some producers claimed that tree-staking was quicker and cheaper than conventional trellising. It can be hypothesized then that agroforestry is a promising technique for the agroecological transition of viticulture. Its contribution to the “terroir” of the high valleys of southern Bolivia and its link with the specificities of the wines and spirits produced there remain to be explored.

VINIoT: Precision viticulture service for SMEs based on IoT sensors network

The main innovation in the VINIoT service is the joint use of two technologies that are currently used separately: vineyard monitoring using multispectral imaging and deployed terrain sensors. One part of the system is based on the development of artificial intelligence algorithms that are feed on the images of the multispectral camera and IoT sensors, high-level information on water stress, grape ripening status and the presence of diseases. In order to obtain algorithms to determine the state of ripening of the grapes and avoid losing information due to the diversity of the grape berries, it was decided to work along the first year 2020 at berry scale in the laboratory, during the second year at the cluster scale and on the last year at plot scale. Different varieties of white and red grapes were used; in the case of Galicia we worked with the white grape variety Treixadura and the red variety Mencía. During the 2020 and 2021 campaigns, multispectral images were taken in the visible and infrared range of: 1) sets of 100 grapes classifying them by means of densimetric baths, 2) individual bunches. The images taken with the laboratory analysis of the ripening stage were correlated. Technological maturity, pH, probable degree, malic acid content, tartaric acid content and parameters for assessing phenolic maturity, IPT, anthocyanin content were determined. It has been calculated for each single image the mean value of each spectral band (only taking into account the pixels of interest) and a correlation study of these values with laboratory data has been carried out. These studies are still provisional and it will be necessary to continue with them, jointly with the training of the machine learning algorithms. Processed data will allow to determine the sensitivity of the multispectral images and select bands of interest in maturation.

Organic recycled mulches in sustainable viticulture: assessment of spontaneous plants communities and weed coverage

In recent years, developing more efficient and sustainable viticulture management has been essential due to the impact of climate change in semiarid regions. For this reason, the use of recycled organic mulching (ROM) in the vineyard has become an interesting strategy to cope with water stress, isolated soil from extreme temperatures and improving soil humidity, control the presence of weeds and therefore reduce the inputs of herbicides and improve soil fertility. This work aimed to analyse the effect of three different organic mulches [straw (S), grape pruning debris (GPD) and spent mushroom compost (SMC)] and two traditional soil management techniques [herbicide (H) and interrow (IN)] on weed coverage and the spontaneous plant communities’ presence. Data sampling was collected throughout the vine vegetative cycle of 2021 in La Rioja, Spain. The different soil management techniques had a clear effect on weed coverage and his development during the vine vegetative cycle. SMC and H were the treatments with the highest and the lowest coverage percentage, respectively. IN had a delayed weed emergence at the beginning of the vine vegetative cycle, but finally it reached maximum values nearby SMC. GPD and S had similar effects on weed emergence, reaching 25-30% of the maximum coverage values. A total of 29 herbaceous species were identified during the vegetative cycle, some of them very isolated and occasional. Principal component analysis (PCAs) showed a good association between spontaneous species and treatments, furthermore, specific species-treatment associations were found. Moreover, three clear groups of herbaceous communities were identified by cluster analysis. This study provides interesting information about the effect of different alternative soil management on herbaceous plant coverage and weed species communities which could contribute to making more sustainable viticulture.

From a local to an international scale: sensory benchmarking of PDO wines. Quincy and Reuilly PDO wines (Sauvignon blanc) as a case study (France)

In a collective marketing strategy, the Protected Designation of Origin (PDO) can be used as a quality indicator. To highlight terroir specificities, it is useful to know how the wines are positioned on the local, national or international market from a sensory point of view. This is especially true for a comparison of varietal wines (e.g. Sauvignon blanc). We focus on the case of two closed Loire Valley PDO (France): Quincy and Reuilly. Three distinct tastings were organized. Firstly, at the local level comparing the 2 PDO (11 and 9 wines, 17 professional assessors); secondly at a regional level adding 3 closed PDO: Menetou-Salon, Sancerre and Pouilly-Fumé (3 wines per PDO, 16 assessors) and thirdly at an international level comparing these 5 PDO with Sauvignon Blanc wines coming from South Africa, New Zealand and Chile (1 to 3 wines per PDO, 19 assessors). All the wines were from the 2019 vintage and were considered to have a traditional elaboration process without contact with oak. A sensory descriptive analysis was performed using an aroma wheel allowing to combine a Check-All-That-Apply methodology, often used in sensory benchmarking, with a hierarchical structuration of the attributes. The aim is to facilitate data acquisition in a professional context without common training, to consider the hierarchical relationships among the attributes during the data analysis and to be able to characterize wines with a large range of sensorial variability. We use univariate, multivariate and clustering analyses. Similarities and differences between Quincy and Reuilly PDO wines and other Sauvignon blanc wines were identified. Specific attributes can distinguish the two PDO and different proximities exist with other local PDO, while clear differences were observed compared to international wines. Our study contributes to propose and discuss a method to do a wine sensory benchmarking highlighting sensory specificities linked to origin.

Towards adaptation to climate change in Rioja: Quality evaluation of wines obtained from Grenache x Tempranillo selections

The wine sector is of great relevance and tradition in Mediterranean countries, however, it may be most susceptible to climate change. In recent years, wine production is facing changes worldwide, both at environmental as well as commercial levels, due to global warming and the shift in consumers’ preferences. Wine growers and wine makers are in search of solutions that allow to face these new challenges. One of the most promising initiatives in the long term is the introduction of new plant materials, specifically intraspecific hybridizations between premium varieties that may improve traditional germplasm in its adaptation to climate change. These inter-varietal crosses have the potential to generate quality wines, whilst maintaining the regional typicity, and constitute an attractive alternative for the consumer due to their sensory attributes. In this study, we have evaluated wines from 29 intraspecific Garnacha x Tempranillo hybrids in two different locations, with the aim to assess their oenological potential and sensory attributes. Thirteen of the selections were white and 16 were red. Microvinifications were conducted with two or three replications depending on grape availability. Conventional oenological parameters were determined for all wines. The sensory evaluation and hedonic scores were given by five experts. Red selections obtained higher quality scores than white ones. Among the white selections with higher quality scores, GT-41 Varea and GT-159 Varea outstand, due to their high total acidity and high malic acid content. Regarding red selections, GT-57 Varea and GT-57 UR were perceived as higher in quality, highlighted for their moderate alcoholic and high anthocyanin content. Our results indicate that intraspecific hybridization may be a powerful tool for adapting traditional cultivars to climate change in Rioja.