Moshi Streaming Speech-to-Text (Quantized)

This is a quantized version of Kyutai’s stt-1b-en_fr model. The original model is a 1B parameter streaming speech-to-text model for English and French. This fork contains the same model, quantized to Q8_0 and Q4_K GGUF formats for reduced memory usage and faster inference.

Downloads last month
67
GGUF
Model size
1.0B params
Architecture
undefined
Hardware compatibility
Log In to view the estimation

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using efficient-nlp/stt-1b-en_fr-quantized 1