Fully automated non-targeted GC-MS data analysis

Abstract

Non-targeted analysis is applied in many different domains of analytical chemistry such as metabolomics, environmental and food analysis. In contrast to targeted analysis, non-targeted approaches take information of known and unknown compounds into account, are inherently more comprehensive and give a more holistic representation of the sample composition. 

Besides chromatographic techniques coupled to high resolution mass spectrometry such as LC-HRMS, gas chromatography with unit resolution mass spectrometry is still regularly utilized for non-targeted profiling or fingerprinting. This is mainly due to high separation power of GC and a wide availability and low costs of quadrupole mass spectrometers. 

Although several non-targeted approaches have been developed, data processing still remains a serious bottleneck. Baseline correction, feature detection, and retention time alignment can be prone to errors and time-consuming manual corrections are often necessary. We therefore developed an automated strategy to non-targeted GC-MS data avoiding feature detection and retention time alignment. The novel automated approach includes segmentation of chromatograms along the retention time axis, multiway decomposition of transformed segments followed by a supervised machine learning pipeline based on gradient boosted tree classification on the decomposed tensor [1, 2]. 

In order to make this novel data analysis strategy available to scientists without programming background, we developed a convenient browser based application. For the here presented interactive browser application the open source Python packages Bokeh and HoloViews were used. The application will be online freely available soon. 

[1] J. Vestner, G. de Revel, S. Krieger-Weber, D. Rauhut, M. du Toit, A. de Villiers, Toward automated chromatographic fingerprinting: A non-alignment approach to gas chromatography mass spectrometry data. Acta Chimica Acta 911 (2016) 42-58 
[2] K. Sirén, U. Fischer, J. Vestner, Automated supervised learning pipeline for non-targeted GC-MS data analysis. Analytica Chimica Acta: X 1 (2019) 100005

DOI:

Publication date: June 19, 2020

Issue: OENO IVAS 2019

Type: Article

Authors

Jochen Vestner, Kimmo Sirén, Pierre Le Brun, Ulrich Fischer

Institute for Viticulture and Oenology, DLR Rheinpfalz, Breitenweg 71, D-67435 Neustadt, Germany
Institut National Supérieur des Sciences Agronomiques de l’Alimentation et de l’ Environnement, Agrosup Dijon, 6 boulevard Docteur Petitjean, 21000 Dijon, France
Department of Chemistry, University of Kaiserslautern, Erwin-Schroedinger-Strasse 52, D-67663 Kaiserslautern

Contact the author

Keywords

metabolomics, non-targeted, GC-MS, exploratory data analysis 

Tags

IVES Conference Series | OENO IVAS 2019

Citation

Related articles…

The modification of cultural practices in grapevine cv. Syrah, does it modify the characteristics of the musts?

The work shows the results of a year of experimentation (2020) in a Syrah variety vineyard in La Roda (Castilla-La Mancha, Spain). The trial approach was on a randomized block design with two factors: Irrigation (I) and Pruning (P).
Irrigation schedules were adjusted to apply amounts close to 1,500 m3/ha. With this provision, 2 different irrigation treatments were proposed: I1) Start of irrigation from pea-sized grape to post-harvest (providing at least 20 % of the total amount of irrigation water to be provided post-harvest); I2) Start of irrigation from pea-sized grape to harvest (usual irrigation practice in the study area). Pruning was proposed with two treatments, one at the end of January (P1), which is pruning on a conventional date; and P2) pruning carried out at the beginning of budding. In total, 4 repetitions were designed with 4 elementary plots, each one of them representing one of the proposed treatments (I1P1; I1P2; I2P1; I2P2). In total, 16 plots were worked on and each elementary plot consisted of 30 strains, distributed in 3 lines.
The productive response was evaluated with the yield results of the harvest harvested at 23 ºBrix. The qualitative response was measured in the musts through the indices of technological (acidity, pH and potassium) and phenolic maturity and aromatic compounds in free and glycosylated fractions. The treatments tested had, in general, an effect on the different variables analyzed.

Mapping and tracking canopy size with VitiCanopy

Understanding vineyard variability to target management strategies, apply inputs efficiently and deliver consistent grape quality to the winery is essential. However, despite inherent vineyard variability, the majority are managed as if they are uniform. VitiCanopy is a simple, grower-friendly tool for precision/digital viticulture that allows users to collect and interpret objective spatial information about vineyard performance. After four years of field and market research, an upgraded VitiCanopy has been created to achieve a more streamlined, technology-assisted vine monitoring tool that provides users with a set of superior new features, which could significantly improve the way users monitor their grapevines. These new features include:
• New user interface
• User authentication
• Batch analysis of multiple images
• Ease the learning curve through enhanced help features
• Reporting via the creation of colour maps that will allow users to assess the spatial differences in canopies within a vineyard.
Use-case examples are presented to demonstrate the quantification and mapping of vineyard variability through objective canopy measurements, ground-truthing of remotely sensed measurements, monitoring of crop conditions, implementation of disease and water management decisions as well as creating a history of each site to forecast quality. This intelligent tool allows users to manage grapevines and make informed management choices to achieve the desired production targets and remain profitable.

Under-vine management effects on grapevine production, soil properties and plant communities in South Australia

Under-vine (UV) management has traditionally consisted of synthetic herbicide use to limit competition between weeds and grapevines. With growing global interest towards non-synthetic chemical use, this study aimed to capture the effects of alternative UV management at two commercial Shiraz vineyards in South Australia, where the sole management variables were UV management since 2016. In adjacent treatment blocks, cultivation (CU) was compared to spontaneous vegetation (SV) in McLaren Vale (MV), and herbicide was compared to SV in Eden Valley (EV). Soil water infiltration rates were slower and grapevine stem water potential was lower in CU compared to SV in MV, with the latter having a plant community dominated by soursob (Oxalis pes-caprae) during winter; while in EV, there was little separation between the treatments. Yields were affected at both sites, with SV being higher in MV and HE being higher in EV. In MV, the only effect on grape must was a lower 13C:12C isotope ratio in CU, indicating greater grapevine water stress. In the grape must at EV, SV had higher total soluble solids, total phenolics, anthocyanins, and yeast available nitrogen; and lower pH and titratable acidity. Pruning weights were not affected by the treatments in MV, while they were higher in HE at EV. Assessments revealed that the differing soil types at the two sites were likely the main determinants of the opposing production outcomes associated with UV management. In the silty loam soil of MV, the higher yields in SV were likely due to more plant-available water, as a potential result of the continuous soil bio-pores formed by winter UV vegetation. Conversely, in the loamy sand soils of EV with a lower cation exchange capacity, the lower yields and pruning weights in SV suggest the UV vegetation competed significantly with the grapevines for available water and nutrients.

Geospatial trends of bioclimatic indexes in the topographically complex region of Barolo DOCG

Barolo DOCG is an economically important wine producing region in Northwest Italy. It is a small region of approximately 70 km2 gross area. The topography is very complex with steep sloped hills ranging in elevation from below 200 m to 550 m. Barolo DOCG wine is made exclusively from the Nebbiolo grape. Bioclimatic indexes are often used in viticulture to gain a better understanding of broader climate trends which can be compared temporally and geographically. These indexes are also used for identifying potential phenological timing, growing region suitability, and potential risks associated with expected climatic changes. Understanding how topography influences bioclimatic indexes can help with understanding of mesoscale climate behaviour leading to improved decision making and risk management strategies. The average monthly maximum and minimum temperatures, the Cool Night Index, the Huglin Index, and the monthly diurnal range (from July to October) were calculated using data from 45 weather stations within a 40 km radius of the Barolo DOCG growing area between the years 1996 and 2019. Linear and multiple regression models were developed using independent variables (elevation, aspect, slope) extracted from a digital elevation model to identify significant relationships. Bioclimatic indexes were then kriged with external drift using independent variables that showed significant relationships with the bioclimatic index using a 100 m resolution grid. The maximum monthly temperatures and the Huglin Index showed consistent significant negative relationships with elevation in all years. The minimum monthly temperatures showed no relationship with elevation but in some months a small but significant relationship was observed with aspect. Due to the lack of a relationship between minimum monthly temperatures and elevation compared to the significant relationship between maximum monthly temperatures and elevation, monthly diurnal range had a negative relationship with elevation.

Water deficit differentially impacts the performances and the accumulation of grape metabolites of new varieties tolerant to fungi

The use of resistant varieties is a long-term but promising solution to reduce chemical input in viticulture. Several important breeding programs in Europe and abroad are now releasing a range of new hybrids performing well regarding fungi susceptibility and producing good quality wines. Unfortunately, insufficient attention is paid by the breeders to the adaptation of these varieties to climatic changes, notably to the increased climatic demand and water deficit (WD). Thus, prior to the adoption of such varieties by the wine industry in Mediterranean regions, there is a need to consider their suitability to WD. This study aimed to characterize the different drought-strategies adopted by 6 new resistant varieties selected by INRAE in comparison to Syrah. To allow the assessment of long-term impacts of WD, field-grown vines were exposed to contrasted WD from 2018 to 2021 under a semi-arid Mediterranean climate. A gradient of WD was applied in the field and controlled through plant measurements at the single plant level. Grape development was non-destructively monitored to determine the arrest of berry phloem unloading. The impacts of WD on berry composition, including water, primary metabolites (sugars, organic acids), secondary metabolites (anthocyanins, thiols precursors) and main cations contents, were assessed at this specific stage. Results showed different varietal responses during the year and inter-annual acclimation in terms of plant water use efficiency, biomass accumulation, as well as yield components and berry composition. WD differentially reduced the accumulation of primary metabolites at plant and berry levels, but it little changed their concentrations in the fruits at the ripe stage. Moreover, WD differentially impacted the accumulation of secondary metabolites and major cations between the varieties. In the talk, we’ll present the main results regarding the WD impacts on fruit metabolites and enlarge the reflection about the practical assessment of the grapevine acclimation to WD.