Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

AIM: Patterns in data obtained from wine chemical and sensory evaluations are difficult to infer using classical statistics. Pattern recognition can be resolved by coupling data fusion with machine learning techniques, possibly leading to new hypotheses being formed. This study demonstrates the applicability of two pattern recognition approaches using as case study involving Chenin Blanc wines (recently bottled and after two years storage) from young (35 years) vines.

METHODS: Sensory (sorting (Mafata et al. 2020)) and chemical (NMR: nuclear magnetic resonance, HRMS: high resolution mass spectrometry, and UV-Vis: ultraviolet spectrophotometry) data were collected for the young and aged (two years in the bottle) wines. Data sets were combined using multiple factor analysis (MFA). Exploratory unsupervised cluster analysis was performed by agglomerative hierarchical clustering (AHC) and Fuzzy-k means (Bezdek 1981). Optimal cluster conditions were found for both methods and the cophenetic coefficient was used to assess the most confident clustering method.

RESULTS: Since large data sets were fused, the models were very complex. There were no consistent clustering patterns when varying clustering conditions, signalling high similarity between samples. The samples could not confidently be distinguished from one another even at the highest optimized conditions. Although Fuzzy-k means gave more confident clustering, it was still not sufficient for solving classification issues in this sample set.

CONCLUSIONS:

Fuzzy-k means was better at resolving the natural grouping of samples. Coupled to data fusion, it could potentially lead to better pattern recognition, especially for oenological chemical and sensory data. The fuzzy approach should be explored, keeping in mind it is more sensitive to small differences in the data compared to classical statistics.

DOI:

Publication date: September 7, 2021

Issue: Macrowine 2021

Type: Article

Authors

Mpho Mafata, Jeanne

1South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University & 2School for Data Science and Computational Thinking, Stellenbosch University, South Africa, BRAND, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University, South Africa  Astrid, BUICA, South African Grape and Wine Research Institute, Department of Viticulture and Oenology, Stellenbosch University

Contact the author

Keywords

data fusion, pattern recognition, machine learning, artificial intelligence, multiple factor analysis, fuzzy-k means, cluster analysis

Citation

Related articles…

Geospatial trends of bioclimatic indexes in the topographically complex region of Barolo DOCG

Barolo DOCG is an economically important wine producing region in Northwest Italy. It is a small region of approximately 70 km2 gross area. The topography is very complex with steep sloped hills ranging in elevation from below 200 m to 550 m. Barolo DOCG wine is made exclusively from the Nebbiolo grape. Bioclimatic indexes are often used in viticulture to gain a better understanding of broader climate trends which can be compared temporally and geographically. These indexes are also used for identifying potential phenological timing, growing region suitability, and potential risks associated with expected climatic changes. Understanding how topography influences bioclimatic indexes can help with understanding of mesoscale climate behaviour leading to improved decision making and risk management strategies. The average monthly maximum and minimum temperatures, the Cool Night Index, the Huglin Index, and the monthly diurnal range (from July to October) were calculated using data from 45 weather stations within a 40 km radius of the Barolo DOCG growing area between the years 1996 and 2019. Linear and multiple regression models were developed using independent variables (elevation, aspect, slope) extracted from a digital elevation model to identify significant relationships. Bioclimatic indexes were then kriged with external drift using independent variables that showed significant relationships with the bioclimatic index using a 100 m resolution grid. The maximum monthly temperatures and the Huglin Index showed consistent significant negative relationships with elevation in all years. The minimum monthly temperatures showed no relationship with elevation but in some months a small but significant relationship was observed with aspect. Due to the lack of a relationship between minimum monthly temperatures and elevation compared to the significant relationship between maximum monthly temperatures and elevation, monthly diurnal range had a negative relationship with elevation.

Better understand the soil wet bulb formation with subsurface or aerial drip irrigation in viticulture

The gradual change in rainfall patterns experienced in the south of France vineyards, especially around the Mediterranean sea, means that the vines are increasingly subject to summer drought. The winegrowers developped the use of irrigation techniques to ensure the maintenance of competitive yields in the production of wines under Protected Geographical Indication label. In practice, drip irrigation pipes can be installed above the ground or buried into the soil as well as at different distances from the vine row. The objective of this study was to examine the profiles of the wet bulbs of the soil obtained from two drip irrigation systems : aerial drip located under the vine row and subsurface drip placed in the middle of the inter-row. This experiment took place over two consecutive seasons (2020-2021) on a 3.4 ha Viognier plot in the Mediterranean region (PGI Oc, France) on sandy clay soil. The annual rainfalls were less than 400 mm. Soil water content probes were installed at different depths (20 – 40 – 60 – 80 cm) and at different lateralities from the vine row (30 – 60 – 90 – 120 cm) to control the formation of the soil wet bulb during irrigation. The mapping and the analysis of the data allowed a better understanding and differentiation of the water percolation when irrigating with subsurface or aerial drip. For the same amount of water and without differences of vine water status, it is shown that in a subsurface drip irrigation situation, the size of the wet bulb formed is larger than in aerial drip irrigation system.

Copper contamination in vineyard soils of Bordeaux: spatial risk assessment for the replanting of vines and crops

Copper (Cu) is widely and historically used in viticulture as a fungicide against mildew. Cu has a strong affinity for soil organic matter and accumulates in topsoil horizons. Thus, Cu may negatively affect soil organisms and plants, consequently reducing soil fertility and productivity. The Bordeaux vineyards have the largest vineyard surfaces (26%) within French controlled appellation and a great proportion of French wine production (around 5 million hl per year). Considering the local context of vineyard surfaces decreasing (vine uprooting) and possible new crop plantation, the issue of Cu potential toxicity rises. Therefore, the aims of this work are firstly to evaluate the Cu contamination in vineyard soils of Bordeaux, secondly to produce a risk assessment map for new vine or crop plantation. We used soil analyses from several local studies to build a database with 4496 soil horizon samples. The database was enhanced by means of pedotransfer functions in order to estimate the bioaccessible (EDTA-extractable) Cu in soils of samples without measurements. From this database, 1797 georeferenced samples with CuEDTA concentrations in the topsoil (0-50 cm depth) were used for kriging interpolation in order to produce the spatial distribution map of CuEDTA in vineyard soils. Then, the spatial distribution of Cu was crossed with vine uprooting surfaces and municipality boundaries. CuEDTAconcentrations ranged from 0.52 to 459 mg/kg and showed clear anomalies. Our results from spatial analysis showed that almost 50% of vineyard soil surfaces have CuEDTA concentrations higher than 30 mg/kg (moderate risk for new plantation) and 20% with concentrations higher than 50 mg/kg (high risk for new plantation). A decision-support map based on municipalities was realised to provide a simple tool to stakeholders concerned by land use management.

The potential of multispectral/hyperspectral technologies for early detection of “flavescence dorée” in a Portuguese vineyard

“Flavescence dorée” (FD) is a grapevine quarantine disease associated with phytoplasmas and transmitted to healthy plants by insect vectors, mainly Scaphoideus titanus. Infected plants usually develop symptoms of stunted growth, unripe cane wood, leaf rolling, leaf yellowing or reddening, and shrivelled berries. Since plants can remain symptomless up to four years, they may act as reservoirs of FD contributing to the spread of the disease. So far, conventional management strategies rely mainly on the insecticide treatments, uprooting of infected plants and use of phytoplasma-free propagation material. However, these strategies are costly and could have undesirable environmental impacts. Thus, the development of sustainable and noninvasive approaches for early detection of FD and its management are of great importance to reduce disease spread and select the best cultural practices and treatments. The present study aimed to evaluate if multispectral/hyperspectral technologies can be used to detect FD before the appearance of the first symptoms and if infected grapevines display a spectral imaging fingerprint. To that end, physiological parameters (leaf area, chlorophyll content and photosynthetic rate) were collected in concomitance to the measurements of plant reflectance (using both a portable apparatus and a remote sensing drone). Measurements were performed in two leaves of 8 healthy and 8 FD-infected grapevines, at four timepoints: before the development of disease symptoms (21st June); and after symptoms appearance (ii) at veraison (2nd August); at post-veraison (11th September); and at harvest (25th September). At all timepoints, FD infected plants revealed a significant decrease in the studied physiological parameters, with a positive correlation with drone imaging data and portable apparatus analyses. Moreover, spectra of either drone imaging and portable apparatus showed clear differences between healthy and FD-infected grapevines, validating multispectral/ hyperspectral technology as a potential tool for the early detection of FD or other grapevine-associated diseases.

Effect of partial net shading on the temperature and radiation in the grapevine canopy, consequences on the grape quality of cv. Gros Manseng in PDO Pacherenc-du-vic-Bilh

As elsewhere, southwestern France vineyards face more recurrent summer heat waves these last years. Among the possibilities of adaptation to this climate changing parameter, the use of net shading is a technique that allow for limiting canopy exposure to radiations. In this trial, we tested net shading installed on one face of the canopy, on a north-south row-oriented plot of cv. Gros Manseng trained on VSP system in the PDO Pacherenc-du-Vic-Bilh. The purpose was to characterize the effects on the ambient canopy temperatures and radiations during the season and to observe the consequences on the composition of grapes and wines. Two sorts of net were used with two levels of obstruction (50% and 75%) of the photosynthesis active radiation (PAR). They have been installed on the west side of the canopy and compared to a netless control. Temperature and PAR sensors registered hourly data during the season. On specific summer day (hot and sunny) manual measurements took also place on bunches (temperature) and in different spots of the canopy (PAR). The results showed that, on clear days, the radiation is lowered by the shade nets respecting the supplier criteria. The effects on the ambient canopy temperature were inconstant on this plot when we observed the data from the global period of shading between fruit set and harvest. However, during hot days (>30°C), the temperature in the canopy was reduced during afternoon and the temperature of the bunch surface was reduced as well comparing to the control. A decrease of the maturity parameters of the berries, sugar and acidity, was also observed. Concerning the wine aromatic potential, no differences clearly appeared.

Macrowine 2021
IVES 9 IVES Conference Series 9 Beyond classical statistics – data fusion coupled with pattern recognition

Beyond classical statistics – data fusion coupled with pattern recognition

Abstract

Content of the article

References

Section for all references

DOI:

Publication date: September 7, 2021

Issue: (ex: Issue: Terclim 2023)

Type: typeofpublication

Authors

author1, author2, author3

Presenting author

Description

List of affiliations ¹ ² ³

Contact the author

Email address (with mailto: link)

Keywords

List of different keywords (keyword1, keyword2, keyword3)

Tags

Citation

Related articles…

Effect of vigour and number of clusters on eonological parameters and metabolic profile of Cabernet Sauvignon red wines

Vegetative growth and yield are reported to affect grape and wine quality. They can be controlled through different techniques linked to vine management. The objective of this research was to determine the effect of vine vigour and number of clusters per vine on physicochemical composition and phenolic profile of red wines. The experiment was carried out during two vegetative cycles, with cv. Cabernet Sauvignon grafted onto Paulsen 1103. Three vine vigour were defined, according to shoot weight at previous harvests, being low, medium and high. Five treatments of number of clusters were used for each vigour, with 15, 22, 29, 36, and 45 clusters per vine. Grapes from all treatments were harvested in the same day from Brix and total acidity criteria. Thirty days after bottling, classical analyzes and phenolic compounds were performed. As results, different responses were obtained from each vintage. In 2020, a dry season from veraison to harvest, grapes and wines obtained from low vigour treatment and 45 clusters per vine was the highest in sugar and alcohol content respectively, while grapes and wines from high vigour and 15 clusters presented the lowest sugar and alcohol content. Total anthocyanins were higher in treatment with low vigour and 15 clusters, while the lowest amounts were found in low vigour with 45 clusters, as well as medium and high vigour with 36 clusters per vine. Total tannins were higher in high vigour with 22 clusters and medium vigour with 29 clusters, while were lower in low vigour with 36 clusters. In 2021, a wet season at harvest, responses were different, and great variations were observed between treatments. As conclusions, yield and vine vigour had strong influence on grape and wine quality, promoting different enological potentials on which can be indicated/used for aging strategies of red and even rosé wines.

Rapid damage assessment and grapevine recovery after fire

There is increasing scientific consensus that climate changeis the underlying cause of the prolonged dry and hot conditions that have increased the risk of extreme fire weather in many countries around the world. In December 2019, a bushfire event occurred in the Adelaide Hills, South Australia where 25,000 hectares were burnt and in vineyards and surrounding areas various degrees of scorching and infrastructure damage occurred. The ability to coordinate and plan recovery after a fire event relies on robust and timely data. The current practice for measuring the scale and distribution of fire damage is to walk or drive the vineyard and score individual vines based on visual observation. The process is time consuming, subjective, or semi-quantitative at best. After the December 2019 fires, it took many months to access properties and estimate the area of vineyard damaged. This study compares the rapid assessment and mapping of fire damage using high-resolution satellite imagery with more traditional ground based measures. Satellite imagery tracking vineyard recovery in the season following the bushfire is being correlated to field assessments of vineyard productivity such as canopy health and development, fertility and carbohydrate storage. Canopy health in the seasons following the fires correlated to the severity of the initial fire damage. Severely damaged vines had reduced canopy growth, were infertile or had very low fertility as well as lower carbohydrate levels in buds and canes during dormancy, which reduced productivity in the seasons following the bushfire event. In contrast, vines that received minor damage were able to recover within 1-2 years. Tools that rapidly and affordably capture the extent and severity of damage over large vineyard area will allow producers, government and industry bodies to manage decisions in relation to fire recovery planning, coordination and delivery, improving the efficiency and effectiveness of their response.

An analytical framework to site-specifically study climate influence on grapevine involving the functional and Bayesian exploration of farm data time series synchronized using an eGDD thermal index

Climate influence on grapevine physiology is prevalent and this influence is only expected to increase with climate change. Although governed by a general determinism, climate influence on grapevine physiology may present variations according to the terroir. In addition, these site-specific differences are likely to be enhanced when climate influence is studied using farm data. Indeed, farm data integrate additional sources of variation such as a varying representativity of the conditions actually experienced in the field. Nevertheless, there is a real challenge in valuing farm data to enable grape growers to understand their own terroir and consequently adapt their practices to the local conditions. In such a context, this article proposes a framework to site-specifically study climate influence on grapevine physiology using farm data. It focuses on improving the analysis of time series of weather data. The analytical framework includes the synchronization of time series using site-specific thermal indices computed with an original method called Extended Growing Degree Days (eGDD). Synchronized time series are then analyzed using a Bayesian functional Linear regression with Sparse Steps functions (BLiSS) in order to detect site-specific periods of strong climate influence on yield development. The article focuses on temperature and rain influence on grape yield development as a case study. It uses data from three commercial vineyards respectively situated in the Bordeaux region (France), California (USA) and Israel. For all vineyards, common periods of climate influence on yield development were found. They corresponded to already known periods, for example around veraison of the year before harvest. However, the periods differed in their precise timing (e.g. before, around or after veraison), duration and correlation direction with yield. Other periods were found for only one or two vineyards and/or were not referred to in literature, for example during the winter before harvest.

Modelling vine water stress during a critical period and potential yield reduction rate in European wine regions: a retrospective analysis

Most European vineyards are managed under rainfed conditions, where seasonal water deficit has become increasingly important. The flowering-veraison phenophase represents an important period for vine response to water stress, which is seldomly thoroughly evaluated. Therefore, we aim to quantify the flowering-veraison water stress levels using Crop Water Stress Indicator (CWSI) over 1986–2015 for important European wine regions, and to assess the respective potential Yield Lose Rate (YLR). Additionally, we also investigate whether an advanced flowering-veraison phase may help alleviating the water stress with improved yield. A process-based grapevine model STICS is employed, which has been extensively calibrated for flowering and veraison stages using observed data at 38 locations with 10 different grapevine varieties. Subsequently, the model is being implemented at the regional level, considering site-specific calibration results and gridded climate and soil datasets. The findings suggest wine regions with stronger flowering-veraison CWSI tend to have higher potential YLR. However, contrasting patterns are found between wine regions in France-Germany-Luxembourg and Italy-Portugal-Spain. The former tends to have slight-to-moderate drought conditions (CWSI<0.5) and a negligible-to-moderate YLR (<30%), whereas the latter possesses severe-to-extreme CWSI (>0.5) and substantial YLR (>40%). Wine regions prone to a high drought risk (CWSI>0.75) are also identified, which are concentrated in southern Mediterranean Europe. An advanced flowering-veraison phase may have benefited from cooler temperatures and a higher fraction of spring precipitation in wine regions of Italy-Portugal-Spain, resulting in alleviated CWSI and moderate reductions of YLR. For those of France-Germany-Luxembourg, this can have reduced flowering-veraison precipitation, but prevalent alleviations of YLR are also found, possibly because of shifted phase towards a cooler growing season with reduced evaporative demands. Overall, such a retrospective analysis might provide new insights towards better management of seasonal water deficit for conventionally vulnerable Mediterranean wine regions, but also for relatively cooler and wetter Central European regions.

VINIoT – Precision viticulture service

The project VINIoT pursues the creation of a new technological vineyard monitoring service, which will allow companies in the wine sector in the SUDOE space to monitor plantations in real time and remotely at various levels of precision. The system is based on spectral images and an IoT architecture that allows assessing parameters of interest viticulture and the collection of data at a precise scale (level of grape, plant, plot or vineyard) will be designed. In France, three subjects were specifically developed: evaluation of maturity, of water stress, and detection of flavescence dorée. For the evaluation of maturity, it has been decided first to work at the berry scale in the laboratory, then at the bunch scale and finally in the vineyard. The acquisition of the spectral hyperstal image as well as the reference analyzes to measure the maturity, were carried out in the laboratory after harvesting the berries in a maturity monitoring context. This work focuses on a case study to predict sugar content of three different grape varieties: Syrah, Fer Servadou and Mauzac. A robust method called Roboost-PLSR, developed in the framework of this work (Courand et al., 2022), to improve prediction model performance was applied on spectra after the acquirement of hyperspectral images. Regarding the evaluation of water stress, to work with a significant variability in terms of water status, it has been worked first with potted plants under 2 different water regimes. The facilities have allowed the supervision of irrigation and micro-climatic conditions. The regression models on agronomic variables (stomatal conductance, water potential, …) are studied. To detect flavescence dorée, the experimental plan has consisted of work at leaf scale in the laboratory first, and then in the field. To detect the disease from hyper-spectral imaging, a combination of multivariate curve resolution-alternating least squares (MCR-ALS) and factorial discriminant analysis (FDA) was proposed. This strategy proved the potential towards the discrimination of healthy and infected leaves by flavescence dorée based on the use of hyperspectral images (Mas Garcia et al., 2021).