Update README.md
Browse files
README.md
CHANGED
|
@@ -20,7 +20,7 @@ padding: 0;
|
|
| 20 |
|
| 21 |
The NeMo Mel Codec is a neural audio codec which compresses mel-spectrograms into a quantized representation and reconstructs audio. The model can be used as a vocoder for speech synthesis.
|
| 22 |
|
| 23 |
-
The model works with full-bandwidth 44.1kHz speech. It might have lower performance with low-bandwidth speech (e.g. 16kHz speech upsampled to 44.
|
| 24 |
|
| 25 |
| Sample Rate | Frame Rate | Bit Rate | # Codebooks | Codebook Size | Embed Dim | FSQ Levels |
|
| 26 |
|:-----------:|:----------:|:----------:|:-----------:|:-------------:|:-----------:|:------------:|
|
|
|
|
| 20 |
|
| 21 |
The NeMo Mel Codec is a neural audio codec which compresses mel-spectrograms into a quantized representation and reconstructs audio. The model can be used as a vocoder for speech synthesis.
|
| 22 |
|
| 23 |
+
The model works with full-bandwidth 44.1kHz speech. It might have lower performance with low-bandwidth speech (e.g. 16kHz speech upsampled to 44.1kHz) or with non-speech audio.
|
| 24 |
|
| 25 |
| Sample Rate | Frame Rate | Bit Rate | # Codebooks | Codebook Size | Embed Dim | FSQ Levels |
|
| 26 |
|:-----------:|:----------:|:----------:|:-----------:|:-------------:|:-----------:|:------------:|
|