-
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
Paper • 2507.08128 • Published • 15 -
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper • 2311.07919 • Published • 9 -
Pengi: An Audio Language Model for Audio Tasks
Paper • 2305.11834 • Published • 2
park
woongvy
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
EMO: Pretraining Mixture of Experts for Emergent Modularity upvoted a paper 5 months ago
Self-Improving VLM Judges Without Human Annotations liked a dataset 6 months ago
gamma-lab-umd/MMAU-ProOrganizations
None yet