FAcodec trained on 50k hours speech data, with more timbre diversity and better at reconstructing speakers from podcasts, videos, games or animations.
See main repository for example usages.

Downloads last month: -; Downloads are not tracked for this model. How to track

Spaces using Plachta/FAcodec 53

Paper for Plachta/FAcodec

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Paper • 2403.03100 • Published Mar 5, 2024 • 37