Music Flamingo: Scaling Music Understanding in Audio Language Models
Upload a song and ask anything — including captions, lyrics, genre, key, chords, or complex questions. Music Flamingo gives detailed answers.
Authors: Sreyan Ghosh1,2*, Arushi Goel1*, Lasha Koroshinadze2**, Sang-gil Lee1, Zhifeng Kong1, Joao Felipe Santos1,
Ramani Duraiswami2, Dinesh Manocha2, Wei Ping1, Mohammad Shoeybi1, Bryan Catanzaro1
1NVIDIA, CA, USA | 2University of Maryland, College Park, USA
*Equally contributed and led the project. Names randomly ordered. **Significant technical contribution.
Correspondence: sreyang@umd.edu, arushig@nvidia.com
🎵 Audio Input
OR
🎵 Example Prompts
| Upload Audio File | Prompt |
|---|
© 2025 NVIDIA | Powered by 🤗 Transformers + Gradio