Machado Lab at the University of Maryland

Ron’s Dissertation Defense

Ron will defend his dissertation on Wednesday, April 2nd, at 9:00 AM in BRB 1103.

Title: Investigating The Evolution of sORFs and Proto/De Novo Genes in the Drosophila pseudoobscura Subgroup using Comparative Transcriptome and Genome Data

Abstract: The discovery of small open reading frames (sORFs) and de novo genes has reshaped our understanding of genome functionality, revealing that a much larger portion of the genome is transcribed than previously recognized. However, identifying and validating these unannotated transcripts remains challenging due to their short sequence lengths, low expression levels, and lineage-specific divergence. Here, we employ a multifaceted approach to refine the unannotated transcriptome of Drosophila pseudoobscura and Drosophila persimilis using a large RNA-seq dataset. By assessing expression consistency across multiple strains, developmental stages, and tissues, we identify a subset of small ORF-encoded peptides (SEPs) and proto/de novo genes (p/DNGs) with potential biological relevance. Our analysis reduced the unannotated transcriptome to a refined set of 2,864 consistently expressed transcripts in D. pseudoobscura, of which 1,260 exhibit conserved consistent expression in D. persimilis. Clustering these transcripts into distinct expression profiles revealed 764 transcripts with coordinated developmental regulation, and among them, 85 contained ORFs supported by mass spectrometry in D. pseudoobscura. Further validation through Kozak sequence analysis strengthened the case for their potential translation. To contextualize these transcripts, we examined their genomic distribution, chromosomal localization, and enrichment within Topologically Associated Domains (TADs). Our findings reveal significant differences in transcript expression relative to TAD boundaries, suggesting stable gene expression patterns and potential shared regulatory mechanisms. Additionally, we investigated their co-localization with transposable elements (TEs), uncovering striking differences in TE composition between transcript subsets based on biological support and expression profiles. These results suggest that TE activity may influence the regulatory landscape of unannotated transcripts. By integrating expression conservation, genomic architecture, and evolutionary dynamics, our study refines the strategies for identifying functional sORFs and p/DNGs. This work provides novel insights into the transcriptional landscape of D. pseudoobscura and D. persimilis, offering a framework for future studies on the evolutionary constraints and functional relevance of unannotated genes.

Previous post
New Machado Lab Website!
Next post
Congratulations, Dr. Ron Clark!