SoundMindModel / README.md
SoundMind-RL's picture
Update README.md
52940bf verified
---
license: mit
datasets:
- SoundMind-RL/SoundMindDataset
language:
- en
base_model:
- Qwen/Qwen2.5-Omni-7B
pipeline_tag: audio-to-audio
---
# SoundMind Model
The SoundMind Model is an audio language model (ALM) trained using SoundMind, a rule-based reinforcement learning (RL) algorithm designed to equip ALMs with advanced bimodal reasoning capabilities.
[Github](https://github.com/xid32/SoundMind) [Paper](https://arxiv.org/abs/2506.12935) [Dataset](https://huggingface.co/datasets/SoundMind-RL/SoundMindDataset) [Model](https://huggingface.co/SoundMind-RL/SoundMindModel)
## Citation
If you find our work helpful, feel free to give us a cite.
```bibtex
@article{diao2025soundmind,
title={SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models},
author={Diao, Xingjian and Zhang, Chunhui and Kong, Keyi and Wu, Weiyi and Ma, Chiyu and Ouyang, Zhongyu and Qing, Peijun and Vosoughi, Soroush and Gui, Jiang},
journal={arXiv preprint arXiv:2506.12935},
year={2025}
}
```