NIGMS - National Institute of General Medical Sciences
TITLE: Advancing Statistical Methods for Next-Generation Microbiome Data Analytics Abstract: The focus of our research lab is to support human microbiome research with rigorous, robust, and practical statistical and quantitative methodology. In the next five years, our goals are to address critical data analysis gaps raised by recent advancements in microbiome epidemiology and bioinformatics, and to develop novel statistical methods that will facilitate the next generation of microbiome analytics. First, expanding microbiome consortia and public databases increase precision in testing the microbiome’s association with host conditions, but introduce more sources for unmeasured confounding such as population heterogeneity or uncollected covariates (e.g., medication). Recent statistical research has focused on confounding from the data’s compositional nature but largely ignored these unmeasured and potentially stronger factors. We propose to benchmark unmeasured confounding effects in microbiome studies with diverse real-world data, and to develop specialized latent factor modeling techniques for adjustment. This will improve false discovery control and facilitate robust findings in large-scale microbiome association studies. Second, modern bioinformatics can generate rich whole-microbiome genetic profiles with millions of microbial genes. But these profiles tend to have extreme levels of sparsity and lack functional annotations, limiting interpretability and statistical power in downstream findings. We propose to leverage recent breakthroughs in artificial intelligence and genomic large language models (LLMs) towards this problem. We will utilize the inherent functional dependency structure encoded by genomic LLMs to create putatively functional orthologs of microbial genes, which can meaningfully aggregate sparse and under-annotated “dark-matter” genes. This will empower downstream analyses and unlock the potential of modern microbiome bioinformatics. Third, metatranscriptomic (MTX) protocols are increasingly available, which characterize the microbiome beyond its functional capacities (“what can microbes do”) and reveal functional bioactivities in situ (“what are microbes doing”). MTX expressions depend on the underlying gene abundances, which are dynamic and error prone. Existing methods do not account for this and can generate biased findings. We will develop a rigorous error-in-variable model that will properly adjust for dynamic and noisy gene abundances, thus enabling robust differential expression analysis in new MTX studies. Our developed methodologies will be validated in different populations, disease settings, and microbiome ecologies, and implemented as open-source software. Derived data resources will be provided as publicly available databases. Through this proposal, our overall vision is to address critically unmet analytical needs from recent advancements in microbiome epidemiology and bioinformatics, and provide a publicly available toolkit of novel statistical methods that will support the next phase of human microbiome research and catalyze translational discoveries. 1
Up to $434K
2030-12-31
Detailed requirements not yet analyzed
Have the NOFO? Paste it below for AI-powered requirement analysis.
One-time $749 fee · Includes AI drafting + templates + PDF export
Dynamic Cognitive Phenotypes for Prediction of Mental Health Outcomes in Serious Mental Illness
NIMH - National Institute of Mental Health — up to $18.3M
COORDINATED FACILITIES REQUIREMENTS FOR FY25 - FACILITIES TO I
NCI - National Cancer Institute — up to $15.1M
Leveraging Artificial Intelligence to Predict Mental Health Risk among Youth Presenting to Rural Primary Care Clinics
NIMH - National Institute of Mental Health — up to $15.0M
Feasibility of Genomic Newborn Screening Through Public Health Laboratories
OD - NIH Office of the Director — up to $14.4M
WOMEN'S HEALTH INITIATIVE (WHI) CLINICAL COORDINATING CENTER - TASK AREA A AND A2
NHLBI - National Heart Lung and Blood Institute — up to $10.2M
Metal Exposures, Omics, and AD/ADRD risk in Diverse US Adults
NIA - National Institute on Aging — up to $10.2M