GiESCO 2019 banner
IVES 9 IVES Conference Series 9 GiESCO 9 GiESCO 2019 9 Aroma and quality assessment for vertical vintages using machine learning modelling based on weather and management information

Aroma and quality assessment for vertical vintages using machine learning modelling based on weather and management information

Abstract

Context and purpose of the study ‐ Wine quality traits are usually given by parameters such as aroma profile, total acidity, alcohol content, colour and phenolic content, among others. These are highly dependent on the weather conditions during the growing season and management strategies. Therefore, it is important to develop predictive models using machine learning (ML) algorithms to assess and predict wine quality traits before the winemaking process.

Material and methods ‐ Samples in duplicates of Pinot Noir wines from vertical vintages (2008 to 2013) of the same winery located in Macedon Ranges, Victoria, Australia were used to assess different chemical analytics such as i) aromas using gas chromatography – mass spectrometry, ii) color density, iii) color hue, iv) degree of red pigmentation, v) total red pigments, vi) total phenolics, vii) pH, viii) total acidity (TA), and ix) alcohol content. Data from weather conditions from the specific vintages were obtained both from the bureau of meteorology (BoM) and the Australian Wine Availability Project (AWAP) climate databases. Such data consisted of: i) solar exposure from veraison to harvest (V‐H), ii) solar exposure from September to harvest (S‐H), iii) maximum January solar exposure, iv) degree days from S‐H, v) maximum January evaporation, vi) mean maximum temperature from veraison to harvest, vii) mean minimum temperature from V‐H, viii) water balance from S‐H, ix) solar exposure from V‐H, x) degree hour accumulation with base 25 – 30 °C, xi) degree hour accumulation with base 25 °C, xii) degree hour accumulation with base 30 °C, xiii) degree hour accumulation with base 35 °C, and xiv) total cumulative degree days accumulation with base 10 °C. All data were used to develop two machine learning (ML) regression models using Matlab® R2018b. The best models obtained were using artificial neural networks (ANN) with the Levenberg‐Marquardt algorithm with 5 neurons for Model 1 and 9 neurons for Model 2. Model 1 was developed using the 14 parameters from the weather data as inputs to predict 21 aromas found in the wines from the six different vinatges. Model 2 was developed using the same 14 parameters from weather data and the eight chemical parameters as targets and outputs.

Results ‐ Both models obtained presented very high accuracy to predict wine quality trait parameters. Model 1 had an overall correlation coefficient R = 0.99 with a high performance based on the mean squared error (MSE = 0.01), while Model 2 had an overall correlation coefficient R = 0.98 with a high performance (MSE = 0.03). These models would aid in the prediction of wine quality traits before its production, which would give anticipated information to winemakers about the product they would obtain to make early decisions on wine style variations.

DOI:

Publication date: June 22, 2020

Issue: GiESCO 2019

Type: Article

Authors

Sigfredo FUENTES, Claudia GONZALEZ VIEJO, Xiaoyi WANG, Damir D. TORRICO

School of Agriculture and Food, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, VIC 3010, Australia

Contact the author

Keywords

wine quality, machine learning, weather, aromas

Tags

GiESCO 2019 | IVES Conference Series

Citation

Related articles…

The modification of cultural practices in grapevine cv. Syrah, does it modify the characteristics of the musts?

The work shows the results of a year of experimentation (2020) in a Syrah variety vineyard in La Roda (Castilla-La Mancha, Spain). The trial approach was on a randomized block design with two factors: Irrigation (I) and Pruning (P).
Irrigation schedules were adjusted to apply amounts close to 1,500 m3/ha. With this provision, 2 different irrigation treatments were proposed: I1) Start of irrigation from pea-sized grape to post-harvest (providing at least 20 % of the total amount of irrigation water to be provided post-harvest); I2) Start of irrigation from pea-sized grape to harvest (usual irrigation practice in the study area). Pruning was proposed with two treatments, one at the end of January (P1), which is pruning on a conventional date; and P2) pruning carried out at the beginning of budding. In total, 4 repetitions were designed with 4 elementary plots, each one of them representing one of the proposed treatments (I1P1; I1P2; I2P1; I2P2). In total, 16 plots were worked on and each elementary plot consisted of 30 strains, distributed in 3 lines.
The productive response was evaluated with the yield results of the harvest harvested at 23 ºBrix. The qualitative response was measured in the musts through the indices of technological (acidity, pH and potassium) and phenolic maturity and aromatic compounds in free and glycosylated fractions. The treatments tested had, in general, an effect on the different variables analyzed.

Sustaining wine identity through intra-varietal diversification

With contemporary climate change, cultivated Vitis vinifera L. is at risk as climate is a critical component in defining ecologically fitted plant materiel. While winegrowers can draw on the rich diversity among grapevine varieties to limit expected impacts (Morales-Castilla et al., 2020), replacing a signature variety that has created a sense of local distinctiveness may lead to several challenges. In order to sustain wine identity in uncertain climate outcomes, the study of intra-varietal diversity is important to reflect the adaptive and evolutionary potential of current cultivated varieties. The aim of this ongoing study is to understand to what extent can intra-varietal diversity be a climate change adaptation solution. With a focus on early (Sauvignon blanc, Riesling, Grolleau, Pinot noir) to moderate late (Chenin, Petit Verdot, Cabernet franc) ripening varieties, data was collected for flowering and veraison for the various studied accessions (from conservatory plots) and clones. For these phenological growing stages, heat requirements were established using nearby weather stations (adapted from the GFV model, Parker et al., 2013) and model performances were verified. Climate change projections were then integrated to predict the future behaviour of the intra-varietal diversity. Study findings highlight the strong phenotypic diversity of studied varieties and the importance of diversification to enhance climate change resilience. While model performances may require improvements, this study is the first step towards quantifying heat requirements of different clones and how they can provide adaptation solutions for winegrowers to sustain local wine identity in a global changing climate. As genetic diversity is an ongoing process through point mutations and epigenetic adaptations, perspective work is to explore clonal data from a wide variety of geographic locations.

Grape berry size is a key factor in determining New Zealand Pinot noir wine composition

Making high quality but affordable Pinot noir (PN) wine is challenging in most terroirs and New Zealand’s (NZ) situation is no exception. To increase the probability of making highly typical PN wines producers choose to grow grapes in cool climates on lower fertility soils while adopting labour intensive practices. Stringent yield targets and higher input costs necessarily mean that PN wine cost is high, and profitability lower, in line-priced varietal wine ranges. To understand the reasons why higher yielding vines are perceived to produce wines of lower quality we have undertaken an extensive study of PN in NZ. Since 2018, we established a network of twelve trial sites in three NZ regions to find individual vines that produced acceptable commercial yields (above 2.5kg per vine) and wines of composition comparable to “Icon” labels. Approximately 20% of 660 grape lots (N = 135) were selected from within a narrow juice Total Soluble Solids (TSS) range and made into single vine wines under controlled conditions. Principal Component Analysis of the vine, berry, juice and wine parameters from three vintages found grape berry mass to be most effective clustering variable. As berry mass category decreased there was a systematic increase in the probability of higher berry red colour and total phenolics with a parallel increase in wine phenolics, changed aroma fraction and decreased juice amino acids. The influence of berry size on wine composition would appear stronger than the individual effects of vintage, region, vineyard or vine yield. Our observations support the hypothesis that it is possible to produce PN wines that fall within an “Icon” benchmark composition range at yields above 2.5kg per vine provided that the Leaf Area:Fruit Weight ratio is above 12cm2 per g, mean berry mass is below 1.2g and juice TSS is above 22°Brix.

Upscaling the integrated terroir zoning through digital soil mapping: a case study in the Designation of Origin Campo de Borja

homogeneous zones by intersecting several partial zonings of major factors that influence vineyard growth. Each of them follows specific process from their corresponding disciplines. Soil zoning specifically refers to a Soil Resource Inventory map that has traditionally been generated by conventional soil mapping methods. These methods have shortcomings in reaching fine cartographic and categorical details and involve significant expenses, which undermines their applicability. A new framework named Digital Soil Mapping has introduced quantitative models by statistical techniques to establish soil-landscape relationships and is able to provide intensive scale cartography.

In the present study, a microzoning at 1:10.000 scale is generated from an initial zoning, where the conventional soil map with polytaxic map units is replaced by a new one from digital techniques that disaggregates them. The comparison between the zonings considers a quantitative evaluation of capability for each Homogeneous Terroir Unit by means of the Viticultural Quality Index and its categorization based on its distribution by map. The spatial intersection of both maps gives rise to a confusion matrix in which the flows of class variations after the substitution are assessed.

The results show a five-fold increase in the number of Homogeneous Terroir Units identified and a larger differentiation among them, evidenced by a wider range in the capability index distribution. Both elements are accompanied by an increase in the detection of areas of higher potential within previously undervalued uniform zones.These features are a direct effect of the improvements brought by Digital Soil Mapping techniques and would verify the advantages of their implementation in the Integrated Terroir zoning. Eventually, such new highly detailed terroir units would benefit precision viticulture and sustainable management practices.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.