This repository contains the model described in Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis.

Please note that the checkpoints are licensed under CC BY-NC 4.0 and can be used for non-commercial purposes only.

Code: https://github.com/hkchengrex/MMAudio.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 2 Ask for provider support

Model tree for hkchengrex/MMAudio

Finetunes
2 models

Spaces using hkchengrex/MMAudio 59

Paper for hkchengrex/MMAudio