NSF AI Disclosure Required

NSF requires disclosure of AI tool usage in proposal preparation. Ensure you disclose the use of FindGrants' AI drafting in your application.

Collaborative Research: SHF: Medium: Semantic Aware Code Generation with LLMs

NSF

open

Large Language Models (LLMs) show great promise for generating source code and automating programming tasks. But these models are error-prone and can produce code with subtle bugs. This poses a risk for deploying LLMs in industrial settings for software engineering tasks - the subtly erroneous code generated by LLMs can expose vulnerabilities that compromise system security. It has been shown that the weakness of LLMs for code generation primarily stems from not accounting for the semantic properties of programs when training, using, and evaluating these models. This project aims to improve LLMs’ ability to generate high-quality code by deeply integrating program analyses with all the stages in the life cycle of LLMs: training, code generation, and evaluation. This project develops novel quantitative program analyses techniques to provide feedback to LLMs during training and decoding. First, the project leverages symbolic execution and Bayesian program analyses to design meaningful metrics to evaluate LLM-generated code. This project then uses program scores to train a differentiable reward model that can assess the quality of partial or complete generated code. At training time, inspired by Reinforcement Learning with Human Feedback (RLHF), this project uses the reward model for fine-tuning LLMs to generate high-quality code. To improve code generation at decoding time, this project leverages the reward model and similarity-based program ranking techniques to constrain and prune the decoding tree. Finally, this project develops semantics-guided metrics and collects new benchmarks consisting of realistic coding tasks for training and evaluating code LLMs. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Focus Areas

engineering

Eligibility

universitynonprofitsmall business

How to Apply

Funding Range

Up to $225K

Deadline

2028-09-30

Complexity

AI Requirement Analysis

Detailed requirements not yet analyzed

Have the NOFO? Paste it below for AI-powered requirement analysis.

0 characters (min 50)

Browse More Grants

Engineering Grants