Post
207
Try IBM's Granite Speech 3.3 8B in a Space! Currently ranks #2 on the Open ASR Leaderboard (
hf-audio/open_asr_leaderboard) by Word Error Rate.
My go-to transcription model is probably still nvidia/parakeet-tdt-0.6b-v2, as Granite does not perform punctuation or capitalization. Still interesting nonetheless!
randomblock1/granite-speech-3.3 (sorry it's slow, it runs on base CPU)
My go-to transcription model is probably still nvidia/parakeet-tdt-0.6b-v2, as Granite does not perform punctuation or capitalization. Still interesting nonetheless!
randomblock1/granite-speech-3.3 (sorry it's slow, it runs on base CPU)