FunAudioLLM
/

ThinkSound

Model card Files Files and versions

ThinkSound / README.md

liuhuadai's picture

Update README.md

b72d99a verified 3 months ago

|

775 Bytes

	---
	license: apache-2.0
	---

	This repository contains the weights of [ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing](https://arxiv.org/abs/2506.21448).

	Project Paper: https://thinksound-project.github.io/.

	If you find our work useful, please cite our paper:

	```bibtex
	@misc{liu2025thinksoundchainofthoughtreasoningmultimodal,
	title={ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing},
	author={Huadai Liu and Jialei Wang and Kaicheng Luo and Wen Wang and Qian Chen and Zhou Zhao and Wei Xue},
	year={2025},
	eprint={2506.21448},
	archivePrefix={arXiv},
	primaryClass={eess.AS},
	url={https://arxiv.org/abs/2506.21448},
	}
	```