Update README.md
Browse files
README.md
CHANGED
|
@@ -40,7 +40,13 @@ Model | Sample Rate | Frame Rate | Bit Rate | # Codebooks | Codebook Size | Em
|
|
| 40 |
[0.6kbps-12.5fps](https://huggingface.co/nvidia/nemo-nano-codec-22khz-0.6kbps-12.5fps) | 22050 | 12.5 | 0.6kpbs | 4 | 4032 | 16 | [9, 8, 8, 7] |
|
| 41 |
[1.89kbps-21.5fps](https://huggingface.co/nvidia/nemo-nano-codec-22khz-1.89kbps-21.5fps) | 22050 | 21.5 | 1.89kpbs | 8 | 2016 | 32 | [8, 7, 6, 6] |
|
| 42 |
|
|
|
|
|
|
|
|
|
|
| 43 |
|
|
|
|
|
|
|
|
|
|
| 44 |
|
| 45 |
## License/Terms of Use
|
| 46 |
[NVIDIA Open Model License Agreement](https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf)
|
|
|
|
| 40 |
[0.6kbps-12.5fps](https://huggingface.co/nvidia/nemo-nano-codec-22khz-0.6kbps-12.5fps) | 22050 | 12.5 | 0.6kpbs | 4 | 4032 | 16 | [9, 8, 8, 7] |
|
| 41 |
[1.89kbps-21.5fps](https://huggingface.co/nvidia/nemo-nano-codec-22khz-1.89kbps-21.5fps) | 22050 | 21.5 | 1.89kpbs | 8 | 2016 | 32 | [8, 7, 6, 6] |
|
| 42 |
|
| 43 |
+
⚠️ **Note on 0.6kbps-12.5fps**
|
| 44 |
+
This variant is designed for **fine-tuning with a limited set of speakers**, as shown in our [S2S Duplex paper](https://www.isca-archive.org/interspeech_2025/hu25f_interspeech.html).
|
| 45 |
+
It is **not recommended** for general-purpose audio encoding or decoding.
|
| 46 |
|
| 47 |
+
ℹ️ **Recommended Variants**
|
| 48 |
+
Both **1.78kbps-12.5fps** and **1.89kbps-21.5fps** achieve similar audio reconstruction quality.
|
| 49 |
+
However, our [Magpie TTS](https://build.nvidia.com/nvidia/magpie-tts-multilingual) model performs best with **1.89kbps-21.5fps**.
|
| 50 |
|
| 51 |
## License/Terms of Use
|
| 52 |
[NVIDIA Open Model License Agreement](https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf)
|