GiESCO 2019 banner
IVES 9 IVES Conference Series 9 GiESCO 9 GiESCO 2019 9 Aroma and quality assessment for vertical vintages using machine learning modelling based on weather and management information

Aroma and quality assessment for vertical vintages using machine learning modelling based on weather and management information

Abstract

Context and purpose of the study ‐ Wine quality traits are usually given by parameters such as aroma profile, total acidity, alcohol content, colour and phenolic content, among others. These are highly dependent on the weather conditions during the growing season and management strategies. Therefore, it is important to develop predictive models using machine learning (ML) algorithms to assess and predict wine quality traits before the winemaking process.

Material and methods ‐ Samples in duplicates of Pinot Noir wines from vertical vintages (2008 to 2013) of the same winery located in Macedon Ranges, Victoria, Australia were used to assess different chemical analytics such as i) aromas using gas chromatography – mass spectrometry, ii) color density, iii) color hue, iv) degree of red pigmentation, v) total red pigments, vi) total phenolics, vii) pH, viii) total acidity (TA), and ix) alcohol content. Data from weather conditions from the specific vintages were obtained both from the bureau of meteorology (BoM) and the Australian Wine Availability Project (AWAP) climate databases. Such data consisted of: i) solar exposure from veraison to harvest (V‐H), ii) solar exposure from September to harvest (S‐H), iii) maximum January solar exposure, iv) degree days from S‐H, v) maximum January evaporation, vi) mean maximum temperature from veraison to harvest, vii) mean minimum temperature from V‐H, viii) water balance from S‐H, ix) solar exposure from V‐H, x) degree hour accumulation with base 25 – 30 °C, xi) degree hour accumulation with base 25 °C, xii) degree hour accumulation with base 30 °C, xiii) degree hour accumulation with base 35 °C, and xiv) total cumulative degree days accumulation with base 10 °C. All data were used to develop two machine learning (ML) regression models using Matlab® R2018b. The best models obtained were using artificial neural networks (ANN) with the Levenberg‐Marquardt algorithm with 5 neurons for Model 1 and 9 neurons for Model 2. Model 1 was developed using the 14 parameters from the weather data as inputs to predict 21 aromas found in the wines from the six different vinatges. Model 2 was developed using the same 14 parameters from weather data and the eight chemical parameters as targets and outputs.

Results ‐ Both models obtained presented very high accuracy to predict wine quality trait parameters. Model 1 had an overall correlation coefficient R = 0.99 with a high performance based on the mean squared error (MSE = 0.01), while Model 2 had an overall correlation coefficient R = 0.98 with a high performance (MSE = 0.03). These models would aid in the prediction of wine quality traits before its production, which would give anticipated information to winemakers about the product they would obtain to make early decisions on wine style variations.

DOI:

Publication date: June 22, 2020

Issue: GiESCO 2019

Type: Article

Authors

Sigfredo FUENTES, Claudia GONZALEZ VIEJO, Xiaoyi WANG, Damir D. TORRICO

School of Agriculture and Food, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, VIC 3010, Australia

Contact the author

Keywords

wine quality, machine learning, weather, aromas

Tags

GiESCO 2019 | IVES Conference Series

Citation

Related articles…

The interplay between grape ripening and weather anomalies – A modeling exercise

Current climate change is increasing inter- and intra-annual variability in atmospheric conditions leading to grapevine phenological shifts as well altered grape ripening and composition at ripeness. This study aims to (i) detect weather anomalies within a long-term time series, (ii) model grape ripening revealing altered traits in time to target specific ripeness thresholds for four Vitis vinifera cultivars, and (iii) establish empirical relationships between ripening and weather anomalies with forecasting purposes. The Day of the Year (DOY) to reach specific grape ripeness targets was determined from time series of sugar concentrations, total acidity and pH collected from a private company in the period 2009-2021 in North-Eastern Italy. Non-linear models for the DOY to reach the specified ripeness thresholds were assessed for model efficiency (EF) and error of prediction (RMSE) in four grapevine cultivars (Merlot, Cabernet Sauvignon, Glera and Garganega). For each vintage and cultivar, advances or delays in DOY to target specified ripeness thresholds were assessed with respect to the average ripening dynamics. Long-term meteorological series monitored at ground weather station by means of hourly air temperature and rainfall data were analyzed. Climate statistics were obtained and for each time period (month, bimester, quarter and year) weather anomalies were identified. A linear regression analysis was performed to assess a possible correlation that may exist between ripening and weather anomalies. For each cultivar, ripeness advances or delays expressed in number of days to target the specific ripening threshold were assessed in relation to registered weather anomalies and the specific reference time period in the vintage. Precipitation of the warmest month and spring quarter are key to understanding the effect of climate change on sugar ripeness. Minimum temperatures of May-June bimester and maximum temperatures of spring quarter best correlate with altered total acidity evolution and pH increment during the ripening process, respectively.

Updating the Winkler index: An analysis of Cabernet sauvignon in Napa Valley’s varied and changing climate

This study aims to create an updated, agile viticultural climate index (similar to the Winkler Index) by performing in-depth analyses of current and historical data from industry partners in several major winegrowing regions. The Winkler Index was developed in the early twentieth century based on analysis of various grape-growing regions in California. The index uses heat accumulation (i.e. Growing Degree Days) throughout the growing season to determine which grape varieties are best suited to each region. As viticultural regions are increasingly subject to the complexity and uncertainty of a changing climate, a more rigorous, agile model is needed to aid grape growers in determining which cultivars to plant where. For the first phase of this study, 21 industry partners throughout Napa Valley shared historical phenology, harvest, viticultural practice, and weather data related to their Cabernet sauvignon vineyard blocks. To complement this data, berry samples were collected throughout the 2021 growing season from 50 vineyard blocks located throughout 16 American Viticultural Areas that were then analyzed for basic berry chemistry and phenolics. These blocks have been mapped using a Geographic Information System (GIS), enabling analysis of altitude, vineyard row orientation, slope, and remotely sensed climate data. Sampling sites were also chosen based on their proximity to a weather station. By analyzing historical data from industry partners and data specifically collected for this study, it is possible to identify key parameters for further analysis. Initial results indicate extreme variability at a high spatial resolution not currently accounted for in modern viticultural climate indices and suggest that viticultural practices play a major role. Using the structure of data collection and analyses developed for the first phase, this project will soon be expanded to other wine regions globally, while continuing data collection in Napa Valley.

Low-cost sensors as a support tool to monitor soil-plant heat exchanges in a Mediterranean vineyard

Mediterranean viticulture is increasingly exposed to more frequent extreme conditions such as heat waves. These extreme events co-occur with low soil water content, high air vapor pressure deficit and high solar radiant energy fluxes and result in leaf and berry sunburn, lower yield, and berry quality, which is a major constraint for the sustainability of the sector. Grape growers must find ways to proper and effectively manage heat waves and extreme canopy and berry temperatures. Irrigation to keep soil moisture levels and enable adequate plant turgor, and convective and evaporative cooling emerged as a key tool to overcome this major challenge. The effects of irrigation on soil and plant water status are easily quantifiable but the impact of irrigation on soil and canopy temperature and on heat convection from soil to cluster zone remain less characterized. Therefore, a more detailed quantification of vineyard heat fluxes is highly relevant to better understand and implement strategies to limit the effects of extreme weather events on grapevine leaf and berry physiology and vineyards performance. Low-cost sensor technologies emerge as an opportunity to improve monitoring and support decision making in viticulture. However, validation of low-cost sensors is mandatory for practical applicability. A two-year study was carried in a vineyard in Alentejo, south of Portugal, using low-cost thermal cameras (FLIR One, 80×60 pixels and FLIR C5, 160×120 pixels, 8-14 µm, FLIR systems, USA) and pocket thermohygrometers (Extech RHT30, EXTECH instruments, USA) to monitor grapevine and soil temperatures. Preliminary results show that low-cost cameras can detect severe water stress and support the evaluation of vertical canopy temperature variability, providing information on soil surface temperature. All these thermal parameters can be relevant for soil and crop management and be used in decision support systems.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

De novo Vitis champinii whole genome assembly allows rootstock-specific identification of potential candidate genes for drought and salt tolerance

Vitis champinii cultivars Ramsey and Dog-ridge are main choices for rootstocks to adapt viticulture in semi-arid and arid regions thanks to their distinctive tolerance to drought and salinity. However, genetic studies on non-vinifera rootstocks have heavily relied on the grapevine (Vitis vinifera) reference genome, which difficulted the assessment of the genetic variation between rootstock species and grapevines. In the present study, this limitation is addressed by introducing a novo phased genome assembly and annotation of Vitis champinii. This new Vitis champinii genome was employed as reference for mapping RNA-seq reads from the same species under drought and salt stresses, and for comparison the same reads were also mapped to the Vitis vinifera PN40024.V4 reference genome. A significant increase in alignment rate was gained when mapping Vitis champinii RNA-seq reads to its own genome, compared to the Vitis vinifera PN40024.V4 reference genome, thus revealing the expression levels of genes specific to Vitis champinii. Moreover, differences in coding sequences were observed in ortholog genes between Vitis champinii and Vitis vinifera, which therefore challenges previous differential expression analyses performed between contrasting Vitis genotypes on the same gene from the Vitis vinifera genome. Genes with possible implications in drought and salt tolerance have been identified across the genome of Vitis champinii, and the same genomic data can potentially guide the discovery of candidate genes specific from Vitis champinii for other traits of interest, therefore becoming a valuable resource for rootstock breeding designs, specially towards increased drought and salinity due to climate change.