NSF requires disclosure of AI tool usage in proposal preparation. Ensure you disclose the use of FindGrants' AI drafting in your application.
NSF
The goal of this research project is to build mathematical foundations for reasoning about the behavior of modern machine learning systems. Foundations are needed to direct future developments in Artificial Intelligence (AI) research, and also to diagnose and remedy problems that arise in existing AI systems. This project specifically focuses on how AI models represent data. For example, many language models are "trained" using vast collections of text from books, websites, and other sources, but these texts are not "stored" in the model in the same way that documents are stored on a computer. Rather, the texts are transformed in a way that seems to facilitate their use for a variety of tasks, ranging from generating code for a website to solving mathematical word problems.These representations, however, are not perfect, and they also seem to lead to embarrassing errors committed by AI models, such as miscounting the number of Rs in the word STRAWBERRY. Developing a mathematical theory of the representations used by AI models will help demystify how the models perform these tasks and reveal fundamental limitations that result in errors. The theory will also guide the development of next-generation models that go beyond the limitations of current models. Feature learning is a key ingredient in the success of modern machine learning systems, and thus its understanding is a essential in any theory of deep learning. The aims of this project are as follows. The first is to develop general principles of how features emerge by building on well-established mathematical structures, such as low rank structure and circulant structure. The second is to study these principles in analytically tractable and practical architectures--such as two-layer multilayer Perceptrons, kernel methods, and transformers--in the context of specific data frameworks/inference problems such as multi-index models and modular arithmetic. The third is to develop new architectures that are potentially viable alternatives to neural networks that are currently in widespread deployment, thereby mitigating risks of over-reliance on specific technologies while suggesting directions for improving or controlling current architectures. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Up to $475K
2028-09-30
Detailed requirements not yet analyzed
Have the NOFO? Paste it below for AI-powered requirement analysis.
One-time $49 fee · Includes AI drafting + templates + PDF export
Research Infrastructure: National Geophysical Facility (NGF): Advancing Earth Science Capabilities through Innovation - EAR Scope
NSF — up to $26.6M
AmLight: The Next Frontier Towards Discovery in the Americas and Africa
NSF — up to $9M
CREST Phase II Center for Complex Materials Design
NSF — up to $7.5M
EPSCoR CREST Phase I: Center for Energy Technologies
NSF — up to $7.5M
EPSCoR CREST Phase I: Center for Post-Transcriptional Regulation
NSF — up to $7.5M
EPSCoR CREST Phase I: Center for Semiconductors Research
NSF — up to $7.5M