SoundMind Model

The SoundMind Model is an audio language model (ALM) trained using SoundMind, a rule-based reinforcement learning (RL) algorithm designed to equip ALMs with advanced bimodal reasoning capabilities.

Github Paper Dataset Model

Citation

If you find our work helpful, feel free to give us a cite.

@article{diao2025soundmind,
  title={SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models},
  author={Diao, Xingjian and Zhang, Chunhui and Kong, Keyi and Wu, Weiyi and Ma, Chiyu and Ouyang, Zhongyu and Qing, Peijun and Vosoughi, Soroush and Gui, Jiang},
  journal={arXiv preprint arXiv:2506.12935},
  year={2025}
}
Downloads last month
15
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for SoundMind-RL/SoundMindModel

Finetuned
(26)
this model

Dataset used to train SoundMind-RL/SoundMindModel