Music Flamingo: Scaling Music Understanding in Audio Language Models

Upload a song and ask anything — including captions, lyrics, genre, key, chords, or complex questions. Music Flamingo gives detailed answers.

Authors: Sreyan Ghosh1,2*, Arushi Goel1*, Lasha Koroshinadze2**, Sang-gil Lee1, Zhifeng Kong1, Joao Felipe Santos1,
Ramani Duraiswami2, Dinesh Manocha2, Wei Ping1, Mohammad Shoeybi1, Bryan Catanzaro1

1NVIDIA, CA, USA | 2University of Maryland, College Park, USA

*Equally contributed and led the project. Names randomly ordered. **Significant technical contribution.

Correspondence: sreyang@umd.edu, arushig@nvidia.com

🎵 Audio Input

OR

🎵 Example Prompts
Upload Audio File Prompt