terclim by ICS banner
IVES 9 IVES Conference Series 9 A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

A better understanding of the climate effect on anthocyanin accumulation in grapes using a machine learning approach

Abstract

The current climate changes are directly threatening the balance of the vineyard at harvest time. The maturation period of the grapes is shifted to the middle of the summer, at a time when radiation and air temperature are at their maximum. In this context, the implementation of corrective practices becomes problematic. Unfortunately, our knowledge of the climate effect on the quality of different grape varieties remains very incomplete to guide these choices. During the Innovine project, original experiments were carried out on Syrah to study the combined effects of normal or high air temperature and varying degrees of exposure of the berries to the sun. Berries subjected to these different conditions were sampled and analyzed throughout the maturation period. Several quality characteristics were determined, including anthocyanin content. The objective of the experiments was to investigate which climatic determinants were most important for anthocyanin accumulation in the berries. Temperature and irradiance data, observed over time with a very thin discretization step, are called functional data in statistics. We developed the procedure SpiceFP (Sparse and Structured Procedure to Identify Combined Effects of Functional Predictors) to explain the variations of a scalar response variable (a grape berry quality variable for example) by two or three functional predictors (as temperature and irradiance) in a context of joint influence of these predictors. Particular attention was paid to the interpretability of the results. Analysis of the data using SpiceFP identified a negative impact of morning combinations of low irradiance (lower than about 100 μmol m−2 s−1 or 45 μmol m−2 s−1 depending on the advanced-delayed state of the berries) and high temperature (higher than 25oC). A slight difference associated with overnight temperature occurred between these effects identified in the morning.

DOI:

Publication date: May 31, 2022

Issue: Terclim 2022

Type: Article

Authors

Girault Gnanguenon Guesse1, Patrice Loisel1, Bénedicte Fontez1, Nadine Hilgert1 and Thierry Simonneau2

1MISTEA, Université Montpellier, INRAE, Institut Agro, Montpellier, France
2LEPSE, Université Montpellier, INRAE, Institut Agro, Montpellier, France

Contact the author

Keywords

machine learning, anthocyanin, temperature, irradiance, SpiceFP

Tags

IVES Conference Series | Terclim 2022

Citation

Related articles…

Grapevine yield estimation in a context of climate change: the GraY model

Grapevine yield is a key indicator to assess the impacts of climate change and the relevance of adaptation strategies in a vineyard landscape. At this scale, a yield model should use a number of parameters and input data in relation to the information available and be able to reproduce vineyard management decisions (e.g. soil and canopy management, irrigation). In this study, we used data from six experimental sites in Southern France (cv. Syrah) to calibrate a model of grapevine yield limited by water constraint (GraY). Each yield component (bud fertility, number of berries per bunch, berry weight) was calculated as a function of the soil water availability simulated by the WaLIS water balance model at critical phenological phases. The model was then evaluated in 10 grapegrowers’ plots, covering a diversity of biophysical and technical contexts (soil type, canopy size, irrigation, cover crop). We identified three critical periods for yield formation: after flowering on the previous year for the number of bunches and berries, around pre-veraison and post-veraison of the same year for mean berry weight. Yields were simulated with a model efficiency (EF) of 0.62 (NRMSE = 0.28). Bud fertility and number of berries per bunch were more accurately simulated (EF = 0.90 and 0.77, NRMSE = 0.06 and 0.10, respectively) than berry weight (EF = -0.31, NRMSE = 0.17). Model efficiency on the on-farm plots reached 0.71 (NRMSE = 0.37) simulating yields from 1 to 8 kg/plant. The GraY model is an original model estimating grapevine yield evolution on the basis of water availability under future climatic conditions.  It allows to evaluate the effects of various adaptation levers such as planting density, cover crop management, fruit/leaf ratio, shading and irrigation, in various production contexts.

Exploring resilience and competitiveness of wine estates in Languedoc-Roussillon in the recent past: a multi-level perspective

The Languedoc-Roussillon wineries are facing a decline in wine yields particularly PGI yields due to many factors. Climate change is just ones, but is expected to increase in the future. There is also structurally a large heterogeneity of yield profiles among terroirs, varieties and strategies. This work investigates the link between yield, competitiveness and resilience to explore how resilient winegrowers have been in the recent past. To this end two approaches have been combined; (i) an accountancy database analysis at estate scale and (ii) municipality level competitiveness analysis. A new resilience indicator that characterizes the capacity of an estate to absorb yield variation is also defined. The FADN database between 2000 and 2018 of ex-Languedoc-Roussillon (France) and other data are used to analyse the current situation and the past evolution of competitiveness and resilience by type of estate (type of farm: PGI and/or PDO & type of commercialization: bulk and/or bottles). The net margin, which defines competitiveness, is not correlated to yield for all types but depends on the type of commercialization and the level of specialisation. The resilience indicator shows that the net margin of estates specialized in PGI is particularly sensitive to yield declines. We also show that price evolutions seem to compensate the effect of yield losses for the majority of types. Municipality scale analysis shows the links between local pedoclimate, yield, commercialization strategies and price. Overlapping a PDO with a PGI does not always increase a municipality’s PGI competitiveness. It is difficult to make links between causes and effects due to the complexity of the wine production system. Production diversification may be a solution. Resorting to the two level of analysis helps resolving the data gap that is necessary to explore the links between yield and economic performance of the wine estates in the long term.

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Biodiversity in the vineyard agroecosystem: exploring systemic approaches

Biodiversity conservation and restoration are essential for guarantee the provision of ecosystem services associated to vineyard agroecosystem such as climate regulation trough carbon sequestration and control of pests and diseases. Most of published research dealing with the complexity of the vineyard agroecosystems emphasizes the necessity of innovative approaches, including the integration of information at different temporal and spatial scales and development of systemic analysis based on modelling. A biodiversity survey was conducted in the Franciacorta wine-growing area (Lombardy, Italy), one of the most important Italian wine-growing regions for sparkling wine production, considering a portion of the territory of 112 ha. The area was divided into several Environmental Units (EUs), defined as a whole vineyard or portion of vineyard homogenous in terms of four agronomic characteristics: planting year, planting density, cultivar, and training system. In each EU a set of compartments was identified and characterised by specific variables. The compartments are meteorology, morphology (altitude, slope, aspect, row orientation, and solar irradiance), ecological infrastructures and management. The landscape surrounding EU was also characterised in terms of land-use in a buffer zone of 500 m. For each component a specific methodology was identified and applied. Different statistical approaches were used to evaluate the method to integrate the information related to different compartments within the EU and related to the buffer zone. These approaches were also preliminarily evaluated for their ability to describe the contribution of biodiversity and landscape components to ecosystem services. This methodological exploration provides useful indication for the development of a fully systemic approach to structural and functional biodiversity in vineyard agroecosystems, contributing to promote a multifunctional perspective for the all wine-growing sector.

Downscaling of remote sensing time series: thermal zone classification approach in Gironde region

In viticulture, the challenges of local climate modelling are multiple: taking into account the local environment, fine temporal and spatial scales, reliable time series of climate data, ease of implementation and reproducibility of the method. At the local scale, recent studies have demonstrated the contribution of spatialization methods for ground-based climate observation data considering topographic factors such as altitude, slope, aspect, and geographic coordinates (Le Roux et al, 2017; De Rességuier et al, 2020). However, these studies have shown questions in terms of the reproducibility and sustainability of this type of climate study. In this context, we evaluated the potential of MODIS thermal satellite images validated with ground-based climate data (Morin et al, 2020). Previous studies have been encouraging, but questions remain to be explored at the regional scale, particularly in the dynamics of the massive use of bioclimatic indices to classify the climate of wine regions. The results at the local scale were encouraging, but this approach was tested in the current study at the regional scale. Several objectives were set: 1) to evaluate the downscaling method for land surface temperature time series, 2) to identify regional thermal structure variations. We used weekly minimum and maximum surface temperature time series acquired by MODIS satellites at a spatial resolution of 1000 m and downscaled at 500 m using topographical variables. Two types of analyses were performed: