terclim by ICS banner
IVES 9 IVES Conference Series 9 Data deluge: Opportunities, challenges, and lessons of big data in a multidisciplinary project

Data deluge: Opportunities, challenges, and lessons of big data in a multidisciplinary project

Abstract

Grapevine powdery mildew resistance is a key target for grape breeders and grape growers worldwide. The driver of the USDA-NIFA-SCRI VitisGen3 project is completing the pipeline from germplasm identification to QTL to candidate gene characterization to new cultivars to vineyards to consumers. This is a common thread across such projects internationally. We will discuss how our objectives and approaches leverage big data to advance this initiative, starting with genomics and computer vision phenotyping for gene discovery and genetic improvement. To manage and maintain resistances for long-term sustainability, growers will be trained through our nation-wide extension and outreach plan. Ultimately, consumers drive adoption of new varieties, and our socioeconomic research using eye-tracking will be briefly described. Across this multi-disciplinary research effort, big data presents opportunities, challenges, and lessons.

DOI:

Publication date: June 13, 2024

Issue: Open GPB 2024

Type: Article

Authors

Lance Cadle-Davidson1,2*, Matt Clark3, Dario Cantu4,5, Chengyan Yue3,6, Kaitlin Gold2, Yu Jiang2, Qi Sun7, Kate Fessler3

1 USDA-ARS Grape Genetics Research Unit, Geneva, NY, USA
2 School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, USA
3 Department of Horticultural Science, Univ. of Minnesota, Saint Paul, MN, USA
4 Department of Viticulture and Enology, University of California Davis, Davis, CA, USA
5 Genome Center, University of California Davis, Davis, CA, USA
6 Department of Applied Economics, Univ. of Minnesota, Saint Paul, MN, USA
7 BRC Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY, USA

Contact the author*

Keywords

Disease resistance, Grape breeding, Genomics, Computer vision, Consumer behavior

Tags

IVES Conference Series | Open GPB | Open GPB 2024

Citation

Related articles…

ReGenWine: A transdisciplinary project to assess concepts in regenerative viticulture

Regenerative agriculture is a set of agricultural practices that focus on improving the health of the soil, increasing biodiversity, and enhancing ecosystem services.

Evolution of the crown procyanidins during wine making and aging in bottle

Condensed tannins are widely distributed in plant‐derived foods and beverages like grape, red wine, nuts, tea, apples and chocolate in which they contribute to multiple sensorial properties such as flavor, color, and taste (astringency and bitterness). During the wine making process,

Comparison of imputation methods in long and varied phenological series. Application to the Conegliano dataset, including observations from 1964 over 400 grape varieties

A large varietal collection including over 1700 varieties was maintained in Conegliano, ITA, since the 1950s. Phenological data on a subset of 400 grape varieties including wine grapes, table grapes, and raisins were acquired at bud break, flowering, veraison, and ripening since 1964. Despite the efforts in maintaining and acquiring data over such an extensive collection, the data set has varying degrees of missing cases depending on the variety and the year. This is ubiquitous in phenology datasets with significant size and length. In this work, we evaluated four state-of-the-art methods to estimate missing values in this phenological series: k-Nearest Neighbour (kNN), Multivariate Imputation by Chained Equations (mice), MissForest, and Bidirectional Recurrent Imputation for Time Series (BRITS). For each phenological stage, we evaluated the performance of the methods in two ways. 1) On the full dataset, we randomly hold-out 10% of the true values for use as a test set and repeated the process 1000 times (Monte Carlo cross-validation). 2) On a reduced and almost complete subset of varieties, we varied the percentage of missing values from 10% to 70% by random deletion. In all cases, we evaluated the performance on the original values using normalized root mean squared error. For the full dataset we also obtained performance statistics by variety and by year. MissForest provided average errors of 17% (3 days) at budbreak, 14% (4 days) at flowering, 14.5% (7 days) at veraison, and 17% (3 days) at maturity. We completed the imputations of the Conegliano dataset, one of the world’s most extensive and varied phenological time series and a steppingstone for future climate change studies in grapes. The dataset is now ready for further analysis, and a rigorous evaluation of imputation errors is included.

FUNCTIONALIZED MESOPOROUS SILICA IS A VIABLE ALTERNATIVE TO BENTONITE FOR WINE PROTEIN STABILIZATION

The presence of grape-derived heat unstable proteins can lead to haze formation in white wines [1], an instability prevented by removing these proteins by adding bentonite, a hydrated aluminum silicate that interacts electrostatically with wine proteins leading to their flocculation. Despite effective, using bentonite has several drawbacks as the costs associated with its use, the potential negative effects on wine quality, and its environmental impact, so that alternative solutions are needed.

1H-NMR-based Untargeted Metabolomics to assess the impact of soil type on the chemical composition of Mediterranean red wines

Untargeted metabolomics has proven to be an effective method to study the impact of the terroir on metabolic profile of wines. In this context, the aim of this study was to evaluate the effects of different soil types on the chemical composition of Mediterranean red wines, through 1H-NMR metabolomics combined with chemometrics.Grapes from Nero d’Avola L. red cultivar cultivated on four different soil types were separately vinified to obtain four different red wines.One milliliter of raw wine was analyzed by means of a Bruker Avance II 400 spectrometer operating at 400.15 MHz