Skip to main content

Characterizing transposable element contributions to the transcriptome with evidence-based and inferential methods

NSF

open

About This Grant

Transposable elements (TEs), also known as ‘jumping genes,’ are DNA sequences that can change or duplicate their position within a genome. Particularly, when a new copy is inserted into a gene by a process called exonization, it creates new genetic material that contributes to new variations and functions within a species and, over time, to the evolution of new species. One class of TEs, called Alu, accounts for thousands of primate- and human-specific proteins, potentially creating ‘programs’ to control essential cell functions such as tissue type and immune response. However, the extent and role of exonized TEs in humans and other species are largely unknown. The project will produce an innovative entirely computational methodology to predict TE exonization for a variety of species and TE families. It will develop a deep learning model that can predict exonization directly from the genome sequence without the need for extensive gene and tissue surveys. It will utilize the model to identify Alu exonization events in humans and several primates and compare them to identify common and specific traits. Lastly, focusing on genome sequences from a multi-ethnic population cohort will help answer questions about the ongoing contributions of Alu insertions to genetic variation in humans. The knowledge, methods and resources generated, including software tools and educational and science popularization materials, will help support biologists in more comprehensively analyzing genomes, contribute to public awareness of this class of gene plasticity events, and provide new tools to understand the molecular bases of human traits. The project will develop a flexible deep learning model that biologists can use to annotate and study transposable element (TE) exonization in a wide range of species and for different TE families. Using the model and existing RNA sequencing data, it will create and comparatively analyze an unprecedented collection of annotations of Alu exonization events and regulatory features in the genomes of humans and several primate species, at different evolutionary time points, including very recent polymorphic Alu insertions observed in a large multi-ethnic population cohort. All methods, annotations and software tools will be available free of charge from public repositories. Additionally, the project will create electronic teaching and training modules on how to use the software, as well as educational art videos on the process and functional implications of TE exonization in biology and as a source of variation in the human population. Lastly, it will create opportunities for training and education from high school through graduate levels through research programs and summer internships. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Focus Areas

biologyeducation

Eligibility

universitynonprofitsmall business

How to Apply

Funding Range

Up to $875K

Deadline

2028-07-31

Complexity
Medium
Start Application

One-time $749 fee · Includes AI drafting + templates + PDF export

AI Requirement Analysis

Detailed requirements not yet analyzed

Have the NOFO? Paste it below for AI-powered requirement analysis.

0 characters (min 50)