metadata

license: cc-by-4.0
language:
  - en
  - fr
library_name: moshi
tags:
  - audio
  - automatic-speech-recognition

Moshi Streaming Speech-to-Text (Quantized)

This is a quantized version of Kyutai’s stt-1b-en_fr model. The original model is a 1B parameter streaming speech-to-text model for English and French. This fork contains the same model, quantized to Q8_0 and Q4_K GGUF formats for reduced memory usage and faster inference.