FluidInference
/

parakeet-tdt-0.6b-v2-coreml

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions Community

bweng commited on Jul 27

Commit

a95894b

·

verified ·

1 Parent(s): 0cedc4b

Update README.md

Files changed (1) hide show

README.md +32 -2

README.md CHANGED Viewed

@@ -27,6 +27,36 @@ base_model:
 - nvidia/parakeet-tdt-0.6b-v2
 ---
-Parakeet TDT 0.6B V2 (En)
-Work in progress, follow this [repo](https://github.com/FluidInference/FluidAudio) for updates, models will continue to change as we tune them

 - nvidia/parakeet-tdt-0.6b-v2
 ---
+# Parakeet TDT 0.6B V2 - CoreML
+This is a CoreML-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model, designed for high-performance automatic speech recognition on Apple platforms.
+## Model Description
+Models will continue to evolve as we optimize performance and accuracy. This model has been converted to CoreML format for efficient on-device inference on Apple Silicon and iOS devices, enabling real-time speech recognition with
+minimal memory footprint.
+## Usage in Swift
+See the [FluidAudio repository](https://github.com/FluidInference/FluidAudioSwift) for instructions.
+## Performance
+- Real-time factor: < 0.3x on M1 Pro
+- Memory usage: ~800MB peak
+- Supported platforms: macOS 14+, iOS 17+
+- Optimized for: Apple Silicon
+## Model Details
+- Architecture: FastConformer-TDT
+- Parameters: 0.6B
+- Sample rate: 16kHz
+## License
+This model is released under the CC-BY-4.0 license. See the LICENSE file for details.
+Acknowledgments
+Based on NVIDIA's Parakeet TDT model. CoreML conversion and Swift integration by the FluidInference team.