Modelling grape and wine quality through PLS Spline statistical method

Abstract

Started in 1994, this project intends to explain quality of grapes and wines using data of soil, climate and vineyard that are currently used in field trials. Firstly set at a national scale, it has been transferred to the Aquitaine region in 2000. The work has been conducted by the ITV institute thanks to many other partners. 2 cultivars have been considered: cvs. Merlot and cabernet sauvignon.
A set of data has been collected using different years and plots showing varied environnemental and cultural situations. Data mining used PLS Spline method. 4 models have been produced: sugar and total acids in musts, colour intensity and total polyphenolic compounds in wines. These models point out the variables that are most influent on quality and order them. A validation with plots that have not been used to build the models has been done in 2006. The prediction is of correct level and gives a potential-like result. At the same time, the models have been integrated into a better convenient tool called SPQV 1.1 software. It is aimed to farmers’s advisors.
The models do not give any prediction during the year the grapes are produced, because it uses post-harvest variables. Nevertheless they can be a helpful tool for potential zoning, plots selection or planting advising.

DOI:

Publication date: December 8, 2021

Issue: Terroir 2008

Type : Article

Authors

CLAVERIE M., PRUD’HOMME PY., MONGENDRE J., ZABOLLONE E., RAYNAL M., COULON T. (1), DURAND J.F. (2), MAZEIRAUD JF., RIVES C. (3), LAVAL C. (4), LAPORTE R. (5), FORGET D. (6)

(1) Institut Français de la Vigne et du Vin (ENTAV-ITV France), Station régionale Aquitaine, 39 rue Michel Montaigne, Blanquefort, France
(2) Laboratoire de Probabilités et Statistiques, Université de Montpellier II, Montpellier, France
(3) Chambre d’Agriculture de Lot-et-Garonne, 271 rue de Péchabout, Agen, France
(4) Chambre d’Agriculture de Dordogne, CRDA du Bergeracois, Monbazillac, France
(5) Chambre d’Agriculture des Landes, Mont de Marsan, France
(6) INRA Domaine expérimental de Couhins, Villenave d’Ornon, France

Contact the author

Keywords

vine, quality, model

Tags

IVES Conference Series | Terroir 2008

Citation

Related articles…

Underpinning terroir with data: rethinking the zoning paradigm

Agriculture, natural resource management and the production and sale of products such as wine are increasingly data-driven activities. Thus, the use of remote and proximal crop and soil sensors to aid management decisions is becoming commonplace and ‘Agtech’ is proliferating commercially; mapping, underpinned by geographical information systems and complex methods of spatial analysis, is widely used. Likewise, the chemical and sensory analysis of wines draws on multivariate statistics; the efficient winery intake of grapes, subsequent production of wines and their delivery to markets relies on logistics; whilst the sales and marketing of wines is increasingly driven by artificial intelligence linked to the recorded purchasing behaviour of consumers. In brief, there is data everywhere!

Opinions will vary on whether these developments are a good thing. Those concerned with the ‘mystique’ of wine, or the historical aspects of terroir and its preservation, may find them confronting. In contrast, they offer an opportunity to those interested in the biophysical elements of terroir, and efforts aimed at better understanding how these impact on vineyard performance and the sensory attributes of resultant wines. At the previous Terroir Congress, we demonstrated the potential of analytical methods used at the within-vineyard scale in the development of Precision Viticulture, in contributing to a quantitative understanding of regional terroir. For this conference, we take this approach forward with examples from contrasting locations in both the northern and southern hemispheres. We show how, by focussing on the vineyards within winegrowing regions, as opposed to all of the land within those regions, we might move towards a more robust terroir zoning than one derived from a mixture of history, thematic mapping, heuristics and the whims of marketers. Aside from providing improved understanding by underpinning terroir with data, such methods should also promote improved management of the entire wine value chain.

How distinctive are single vineyard Gewürztraminer musts and wines from Alto Adige (Italy) based on untargeted analysis, sensory profiling, and chemometric elaboration?

Vitis vinifera L. ‘Gewürztraminer’ is a historical grape variety of Alto Adige (Südtirol), Italy, which is widely grown in the area of Tramin an der Weinstraße, but is also grown globally. It produces highly aromatic wines that are strongly influenced by the terroir of the vineyard sites where they are grown. This study looked at musts and young wines from ‘Gewürztraminer’ grapes harvested in seven distinct vineyards near Tramin and then processed at Cantina di Termeno, minimizing winemaking protocol variability. Samples were profiled using bidimensional gas chromatography–time-of-flight mass spectrometry, liquid chromatography coupled to electrochemical detection, and near-IR spectrometry. The data were subjected to Principle Component Analysis and Hierarchical Clustering Analysis. Sensory discriminant testing was undertaken using the sorting method with a semi-trained panel, and the data were processed using Multidimensional Scaling. Seven must/wine pairs could be distinguished based on their untargeted volatilome profiles and on sensory evaluation. As expected, there were greater differences in the volatile compounds between the wines than between the musts. The wines from vineyards 4 and 5 were nonetheless quite homogenous in terms of chemical and sensory analyses, as were the wines from vineyards 1 and 3. For the phenolic profile, differences were noted between the musts and wines of vineyards 2, 3, and 4, but the musts from vineyards 5 and 7 were similar. Sensory analysis showed the wines from vineyards 6 and 7 to be distinct from the rest. These results reinforce that the composition of ‘Gewürztraminer’ musts and wines is strongly determined by vineyard site, even in a small geographic area with high variability of the terroir (soil and microclimate), and that these differences are apparent in the flavours and aromas of the finished wines. Further confirmation would require a larger sample of wines, preferably from several vintages.

Influence of weather and climatic conditions on the viticultural production in Croatia

The research includes an analysis of the impact of weather conditions on phenological development of the vine and grape quality, through monitoring of four experimental cultivars (Chardonnay, Graševina, Merlot and Plavac mali) over two production years. In each experimental vineyard, which were evenly distributed throughout the regions of Slavonia and The Croatian Danube, Croatian Uplands,

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

Grapevine yield-gap: identification of environmental limitations by soil and climate zoning in Languedoc-Roussillon region (south of France)

Grapevine yield has been historically overlooked, assuming a strong trade-off between grape yield and wine quality. At present, menaced by climate change, many vineyards in Southern France are far from the quality label threshold, becoming grapevine yield-gaps a major subject of concern. Although yield-gaps are well studied in arable crops, we know very little about grapevine yield-gaps. In the present study, we analysed the environmental component of grapevine yield-gaps linked to climate and soil resources in the Languedoc Roussillon. We used SAFRAN data and IGP Pays d’Oc wine yields from 2010 to 2018. We selected climate and soil indicators proving to have a significant effect on average wine yield-gaps at the municipality scale. The most significant factors of grapevine yield were the Soil Available Water Capacity; followed by the Huglin Index and the Climatic Dryness Index. The Days of Frost; the Soil pH; and the Very Hot Days were also significant. Then, we clustered geographical zones presenting similar indicators, facilitating the identification of resources yield-gaps. We discussed the number of zones with the experts of IGP Pays d’Oc label, obtaining 7 zones with similar limitations for grapevine yield. Finally, we analysed the main resources causing yield-gaps and the grapevine varieties planted on each zone. Mapping grapevine resource yield-gaps are the first stage for understanding grapevine yield-gaps at the regional scale.