GiESCO 2019 banner
IVES 9 IVES Conference Series 9 GiESCO 9 GiESCO 2019 9 Aroma and quality assessment for vertical vintages using machine learning modelling based on weather and management information

Aroma and quality assessment for vertical vintages using machine learning modelling based on weather and management information

Abstract

Context and purpose of the study ‐ Wine quality traits are usually given by parameters such as aroma profile, total acidity, alcohol content, colour and phenolic content, among others. These are highly dependent on the weather conditions during the growing season and management strategies. Therefore, it is important to develop predictive models using machine learning (ML) algorithms to assess and predict wine quality traits before the winemaking process.

Material and methods ‐ Samples in duplicates of Pinot Noir wines from vertical vintages (2008 to 2013) of the same winery located in Macedon Ranges, Victoria, Australia were used to assess different chemical analytics such as i) aromas using gas chromatography – mass spectrometry, ii) color density, iii) color hue, iv) degree of red pigmentation, v) total red pigments, vi) total phenolics, vii) pH, viii) total acidity (TA), and ix) alcohol content. Data from weather conditions from the specific vintages were obtained both from the bureau of meteorology (BoM) and the Australian Wine Availability Project (AWAP) climate databases. Such data consisted of: i) solar exposure from veraison to harvest (V‐H), ii) solar exposure from September to harvest (S‐H), iii) maximum January solar exposure, iv) degree days from S‐H, v) maximum January evaporation, vi) mean maximum temperature from veraison to harvest, vii) mean minimum temperature from V‐H, viii) water balance from S‐H, ix) solar exposure from V‐H, x) degree hour accumulation with base 25 – 30 °C, xi) degree hour accumulation with base 25 °C, xii) degree hour accumulation with base 30 °C, xiii) degree hour accumulation with base 35 °C, and xiv) total cumulative degree days accumulation with base 10 °C. All data were used to develop two machine learning (ML) regression models using Matlab® R2018b. The best models obtained were using artificial neural networks (ANN) with the Levenberg‐Marquardt algorithm with 5 neurons for Model 1 and 9 neurons for Model 2. Model 1 was developed using the 14 parameters from the weather data as inputs to predict 21 aromas found in the wines from the six different vinatges. Model 2 was developed using the same 14 parameters from weather data and the eight chemical parameters as targets and outputs.

Results ‐ Both models obtained presented very high accuracy to predict wine quality trait parameters. Model 1 had an overall correlation coefficient R = 0.99 with a high performance based on the mean squared error (MSE = 0.01), while Model 2 had an overall correlation coefficient R = 0.98 with a high performance (MSE = 0.03). These models would aid in the prediction of wine quality traits before its production, which would give anticipated information to winemakers about the product they would obtain to make early decisions on wine style variations.

DOI:

Publication date: June 22, 2020

Issue: GiESCO 2019

Type: Article

Authors

Sigfredo FUENTES, Claudia GONZALEZ VIEJO, Xiaoyi WANG, Damir D. TORRICO

School of Agriculture and Food, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, VIC 3010, Australia

Contact the author

Keywords

wine quality, machine learning, weather, aromas

Tags

GiESCO 2019 | IVES Conference Series

Citation

Related articles…

Elucidating vineyard site contributions to key sensory molecules: Identification of correlations between elemental composition and volatile aroma profile of site-specific Pinot noir wines

The reproducibility of elemental profile in wines produced across multiple vintages has been previously reported using grapes from a single scion clone of Vitis vinifera L. cv. Pinot noir. The grapevines were grown on fourteen different vineyard sites, from Oregon to southern California in the U.S.A., which span distances from approximately hundreds of meters to 1450 km, while elevations range from near sea level to nearly 500 m. In addition, sensorial (i.e. aroma, taste, and mouthfeel) and chemical (i.e. polyphenolic and volatile) differences across the different vineyard sites have also been observed among these wines at two aging time points. While strong evidence exists to support that grapes grown in different regions can produce wines with unique chemical and sensorial profiles, even when a single clone is used, the understanding of growing site characteristics that result in this reproducible differentiation continues to emerge. One hypothesis is that the elemental profile that a vineyard site imparts to the grape berries and the resulting wine is an important contributor to this differentiation in chemistry and sensory of wines. For example, various classes of enzymes that catalyze the formation of key aroma compounds or their precursors require specific metals. In this work, we begin to report correlations between elemental and volatile aroma profiles of site-specific Pinot noir wines, made under standardized winemaking conditions, that have been previously shown to be distinguished separately by these chemical analyses.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Effect of regulated deficit irrigation regime on amino acids content of Monastrell (Vitis vinifera L.) grapes

Irrigation is an important practice to influence vine quality, especially in Mediterranean regions, characterized by hot summers and severe droughts during the growing season. This study focused on deficit irrigation regime influence on amino acids composition of Monastrell grapevines under semiarid conditions (Albacete, Southeastern of Spain). In 2019, two treatments were applied: non-irrigation (NI) and regulated deficit irrigation (RDI), watered at 30% of the estimated crop evapotranspiration from fruit set to onset of veraison. Grape amino acids content was analyzed by HPLC. Berries from non-irrigated vines showed higher concentration of several amino acids, such as tryptophan (73%), arginine (70%), lysine (36%), isoleucine (27%), and leucine (21%), compared to RDI grapes. Arginine is, together with ammonium ion, the principal nitrogen source for yeasts during the alcoholic fermentation; while isoleucine, tryptophan, and leucine are precursors of fermentative volatile compounds, key compounds for wine quality. Moreover, NI treatment increased in a 14% the total amino acids content in grapes compared to RDI treatment. The reported effects might be because yield was 70% higher in RDI vines than in the NI ones and, therefore, the sink demand was increased in the irrigated vines. In addition, NI vines suffered more severe water stress and it is known that the amino acids synthesis and accumulation can be influenced by the plant response to stress. According to the results, the irrigation regime showed effect on amino acids concentration in Monastrell grapes under semiarid conditions. Grapes from non-irrigated vines showed a higher content of several amino acids relevant to the fermentative process and to the wine aroma compounds formation. It is demonstrated that the final content of nitrogen-related components in grapes is influenced by the irrigation regime. The convenience of the irrigation strategy to suggest will depend on the desired wine style and the target yield levels.

Variety and climatic effects on quality scores in the Western US winegrowing regions

Wine quality is strongly linked to climate. Quality scores are often driven by climate variation across different winegrowing regions and years, but also influenced by other aspects of terroir, including variety. While recent work has looked at the relationship between quality scores and climate across many European regions, less work has examined New World winegrowing regions. Here we used scores from three major rating systems (Wine Advocate, Wine Enthusiast and Wine Spectator) combined with daily climate and phenology data to understand what drives variation across wine quality scores in major regions of the Western US, including regions in California, Oregon and Washington. We examined effects of variety, region, and in what phenological period climate was most predictive of quality. As in other studies, we found climate, based mainly on growing degree day (GDD) models, was generally associated with quality—with higher GDD associated with higher scores—but variety and region also had strong effects. Effects of region were generally stronger than variety. Certain varieties received the highest scores in only some areas, while other varieties (e.g., Merlot) generally scored lower across regions. Across phenological stages, GDD during budbreak was often most strongly associated with quality. Our results support other studies that warmer periods generally drive high quality wines, but highlight how much region and variety drive variation in scores outside of climate.

Grapevine sugar concentration model in the Douro Superior, Portugal

Increasingly warm and dry climate conditions are challenging the viticulture and winemaking sector. Digital technologies and crop modelling bear the promise to provide practical answers to those challenges. As viticultural activities strongly depend on harvest date, its early prediction is particularly important, since the success of winemaking practices largely depends upon this key event, which should be based on an accurate and advanced plan of the annual cycle. Herein, we demonstrate the creation of modelling tools to assess grape ripeness, through sugar concentration monitoring. The study area, the Portuguese Côa valley wine region, represents an important terroir in the “Douro Superior” subregion. Two varieties (cv. Touriga Nacional and Touriga Franca) grown in five locations across the Côa Region were considered. Sugar accumulation in grapes, with concentrations between 170 and 230 g l-1, was used from 2014 to 2020 as an indicator of technological maturity conditioned by meteorological factors. The climatic time series were retrieved from the EU Copernicus Service, while sugar data were collected by a non-profit organization, ADVID, and by Sogrape, a leading wine company. The software for calibrating and validating this model framework was the Phenology Modeling Platform (PMP), version 5.5, using Sigmoid and growing degree-day (GDD) models for predictions. The performance was assessed through two metrics: Roots Mean Square Error (RMSE) and efficiency coefficient (EFF), while validation was undertaken using leave-one-out cross-validation. Our findings demonstrate that sugar content is mainly dependent on temperature and air humidity. The models achieved a performance of 0.65