Deep learning reveals regulatory logic of water stress adaptation in grapevine

Abstract

Predicting how sequence variants affect regulatory activity is critical for crop improvement, but confidence in these predictions, especially for diverse varieties, remains insufficient for breeding applications. Reference genomes collapse natural genetic diversity, creating reference bias that limits prediction accuracy across heterozygous crop genomes. We demonstrate the application of interpretable deep learning to a non-model perennial crop, Vitis vinifera (grapevine), and explore how layering computational evidence can build confidence in regulatory predictions.

We retrained the DeepCRE convolutional neural network on 163 public RNA-seq samples from water-stressed grapevine to predict genome-wide gene expression (high versus low) from flanking sequence. The model learns sequence-to-expression relationships, which we then interpret using gradient-based methods to extract the regulatory motifs driving predictions. Using chromosome-wise cross-validation to prevent homology inflation, the model achieved 91% accuracy and 92% F1 score. TF-MoDISco interpretation revealed 29 expression-predictive motifs organised into two functional meta-clusters.

Multiple lines of computational evidence support these predictions: genome-wide motif mapping shows enrichment consistent with expected expression patterns; gene ontology analysis reveals significant enrichment of co-occurring motifs in canonical drought response pathways including ABAsignalling, osmotic stress response, and flavonoid biosynthesis; integration with co-expression networks identifies candidate hub transcription factors. Comparative analysis between water stress and control models reveals patterns of regulatory rewiring under drought conditions.

This work explores how integrating multiple computational approaches can support confident prioritisation of regulatory elements and variants for targeted validation – a critical step toward explainable AI predictions for breeding applications in heterozygous crop genomes.

Publication date: June 22, 2026

Issue: GBG 2026

Type: Poster

Authors

Larysha Rothmann1, José Tomás Matus3, Jedrzej Szymanski2, Justin Lashbrooke1,*

Stellenbosch University, South Africa

Leibniz Institute of Plant Genetics and Crop Plant Research, Germany

University of Valencia, Institute of Integrative Systems Biology, Spain

Contact the author*

Keywords

deep learning, transcriptional regulation, water stress

Tags

GBG | GBG 2026 | IVES Conference Series

Citation

Related articles…

Exploring grapevine genetic resources in a changing climate

Plant genetic resources have sustained human societies throughout history. Through selection and propagation, humans have shaped plant gene pools to enhance productivity, local adaptation, and diversity of products across continents.

Archaeogenomics reveals few generations separating ancient Eastern Mediterranean and modern Iberian grapevines over three millennia of viticulture

Viticulture became central to most western Mediterranean civilisations only a few millennia after grapevine (Vitis vinifera L.) was domesticated in the South Caucasus and the Near East.

Documenting and mining disease resistance alleles in the USDA Vitis repositories

The USDAAgricultural Research Service maintains Vitis germplasm repositories in Geneva, NY and Davis, CAcollectively preserving approximately 5,000 unique accessions representing 30 Vitis species.

Study of ancient north-east Italian grape varieties taking advantage of an optimized aDNA extraction protocol

Grapevine (Vitis vinifera L.) is one of the most extensively cultivated fruit trees in the world. It is cultivated primarily for wine production but also for fruit fresh consumption.

What 2,900 wild grapevines reveal about the genetic diversity of Vitis vinifera L. subsp. sylvestris

Vitis vinifera L. subsp. sylvestris is the wild ancestor of the European cultivated grapevine (V. vinifera L. subsp. sativa).