NSF AI Disclosure Required
NSF requires disclosure of AI tool usage in proposal preparation. Ensure you disclose the use of FindGrants' AI drafting in your application.
Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages
NSF
About This Grant
This project introduces a new way to develop speech-to-text systems for language varieties in which little digital data is available, especially when a related variety already has plenty of digital data. Languages differ from one another, but they also show a great deal of variation within themselves. People in different regions or countries often pronounce words in the same language differently. For example, many people in the southern US pronounce the “i” in words like “ride” as an “a” sound (similar to the one found in “rad”), while most other Americans do not. Most speech-to-text systems—which turn spoken words into written text—focus on just one variety of each language. However, in everyday life, many people speak other varieties. Existing speech-to-text systems often do not work well for these varieties. Improving speech recognition for low-resource varieties represents a business opportunity, one that can help more Americans access voice-powered tools and services. This project takes one step towards this goal. It innovates by leveraging the fact that the differences between various varieties of the same language often follow predictable patterns. For example, since the pronunciations of words in different regions change following rules that apply to the whole vocabulary, one can often predict how a word will be pronounced in one variety if one knows how it is pronounced in another. The project will develop a powerful AI model (POWSM) that can transcribe pronunciation (using a universal system developed by linguists to represent sounds). This approach enables the development of speech-to-text systems for previously unsupported language variants, even when little recorded training data exists for them. It works by learning both the similarities and the systematic differences between well-resourced varieties and others. The project builds an encoder-decoder foundation speech model called POWSM (Phonetic Open Whisper-style Speech Model), which is trained to recognize speech as sequences of phones (consonants and vowels) in any language. The project will include three applications using this model: 1) Prompting POWSM with vector representations of the systematic sound correspondences between a low-resource variety (LRV) and a high-resource variety (HRV), enabling the model to recognize the LRV as a transformed variant of the HRV. 2) Constructing stochastic weighted finite-state transducers that can generate synthetic LRV data based on linguist-curated knowledge about sound changes in the HRV and LRV. Using this synthetic data to train a language model that can be used to decode LRV output from POWSM. 3) Learning phonetic correspondences between language varieties automatically from transcribed audio using a novel form of unsupervised bilingual lexicon induction (UBLI) that leverages both text and audio. Audio aligned in this way can be used to train basic speech-to-speech translation models that, when used in conjunction with POWSM, enable speech technologies for LRVs without requiring linguistic annotation beyond transcription. The proposed methods are intended to cover most HRV-LRV scenarios. They will be evaluated on major languages including varieties in English, Italian, Chinese, German, Arabic, German, and Dutch, English, as well as endangered languages like Nahuatl and Mixtec. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Grant Summary
Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages is a NSF grant providing up to $822K for university, nonprofit, small business. Applications are due 2029-08-31 (open). Check eligibility and apply with FindGrants.
Focus Areas
Eligibility
How to Apply
Up to $822K
2029-08-31
- 1Confirm your organization is eligible for Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages from NSF, checking organization type, location, and any population or project requirements.
- 2Gather the required documents and information, including your organization details, project plan, and budget figures.
- 3Draft your application narrative and budget addressing the funder's priorities and review criteria. FindGrants can draft each section for you to review and edit.
- 4Review every section against the requirements checklist, then export a submission-ready application pack and submit it to NSF before the deadline.
Don't want to draft it yourself?
We'll draft the complete application against NSF's requirements, run a quality review, and email you a submission-ready PDF plus an editable Word doc within 5 business days. Most orders deliver in 24-48 hours. Flat $399, any grant size.
AI Requirement Analysis
Detailed requirements not yet analyzed
Have the NOFO? Paste it below for AI-powered requirement analysis.
Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages: Frequently Asked Questions
Who is eligible for the Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages?
Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages is offered by NSF and is generally open to university, nonprofit, small business. It is open to organizations nationwide unless the funder specifies otherwise. Review the specific eligibility terms before applying, since funders set their own requirements around organization type, location, and the population or project being served.
How much funding does the Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages provide?
Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages provides up to $822K per award from NSF. Actual award sizes depend on the scope of your project, available program funds, and the number of applicants, so build a budget that reflects realistic, allowable costs rather than the maximum figure.
When is the Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages deadline?
Applications for Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages are due 2029-08-31 (open). Because deadlines can change, verify the date with the funder, NSF, and give yourself enough time to prepare a complete, competitive application before the close date.
How do you apply for the Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages?
To apply for Collaborative Research: RI: Medium: Using Systematic Relationships and Phonetic Speech Foundation Models for Universal Speech-to-Text across Varieties of Languages, confirm your eligibility, gather the required documents, and prepare a narrative and budget that address the funder's priorities. FindGrants guides you step by step and can draft each section, then exports a submission-ready application pack for this grant from NSF.